[mtp] Mixed MTP + batched generation phases 1-3 (#49) #64
Loading…
Reference in a new issue
No description provided.
Delete branch "feature/49-mixed-mtp-batch"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Fixes #49. Allows MTP speculative decoding and standard batched generation to run concurrently.
Changes
_mtp_step()for MTP-enabled requests alongside batch generator in mixed mode_mtp_stepfinds MTP-enabled request instead of always taking first running requestTest Results
81/82 scheduler tests pass. 1 failure is pre-existing (boundary snapshot token tracking).
Diff
+32/-40 lines in scheduler.py (net reduction). Clean, focused changes.
d043a32975to3a231bb00d