[feature] Complete mixed MTP + batched generation (phase 3) #49
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Branch: feature/10-mixed-mtp-batch (3 commits ahead of main)
Allows MTP speculative decoding and standard batched generation to run concurrently.
Commits:
8a6e2f1phase 1: skip MTP to batch transition on new arrivalsc4ecef4phase 2: run both MTP and batch paths in mixed mode19811d4phase 3: find MTP-enabled request in _mtp_stepAcceptance criteria:
Notes: Phases 1-2 were merged via PRs #9/#11. Phase 3 was not merged.
Merged via squash (PR #64). Three phases implemented: (1) skip MTP→batch transition, (2) run MTP alongside batch in mixed mode, (3) find MTP-enabled request in _mtp_step. Reviewer caught 2 bugs (missing cache extraction, dangling batch generator) — both fixed. 81/82 tests pass.