CRITICAL: QuantizedKVCacheHandler reconstruct_cache overrides correct offset with meta_state #42
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Bug
QuantizedKVCacheHandler.reconstruct_cache()first correctly setscache.offset = keys[0].shape[2](the actual concatenated tensor length), but then overrides it withmeta_state[0]which contains the FIRST BLOCK'S offset only.Impact
Fix
Remove the meta_state offset override. Always use tensor shape for offset, matching KVCacheHandler behavior.
File
omlx/cache/type_handlers.py:501-510