[fix] QuantizedKVCache prefix cache and batching fixes (#39 follow-up) #41
Loading…
Reference in a new issue
No description provided.
Delete branch "feature/39-llamacpp-kv-cache-rewrite"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Fixes for issues discovered after PR #40
1. Empty caches in _patched_merge_caches
2. Layer count mismatch in prefix cache
3. Quantization parameters lost during SSD storage
Test Results
Live Test Log