[cache] Skip q4 KV quantization on restored prefix caches (#48) #62
Loading…
Reference in a new issue
No description provided.
Delete branch "fix/48-q4-cache-prefix-restore"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
_apply_quantized_kv()in_do_external_prefill()to skip whenexisting_cacheis providedCloses #48
Test results