[mtp] Mixed MTP + batched generation phases 1-3 (#49) #64

Merged
sleepy merged 1 commit from feature/49-mixed-mtp-batch into main 2026-05-15 18:26:58 +02:00
Owner

Summary

Fixes #49. Allows MTP speculative decoding and standard batched generation to run concurrently.

Changes

  1. Phase 1: Remove MTP→batch transition — MTP requests stay in MTP mode when new requests arrive
  2. Phase 2: Run _mtp_step() for MTP-enabled requests alongside batch generator in mixed mode
  3. Phase 3: _mtp_step finds MTP-enabled request instead of always taking first running request

Test Results

81/82 scheduler tests pass. 1 failure is pre-existing (boundary snapshot token tracking).

Diff

+32/-40 lines in scheduler.py (net reduction). Clean, focused changes.

## Summary Fixes #49. Allows MTP speculative decoding and standard batched generation to run concurrently. ## Changes 1. **Phase 1**: Remove MTP→batch transition — MTP requests stay in MTP mode when new requests arrive 2. **Phase 2**: Run `_mtp_step()` for MTP-enabled requests alongside batch generator in mixed mode 3. **Phase 3**: `_mtp_step` finds MTP-enabled request instead of always taking first running request ## Test Results 81/82 scheduler tests pass. 1 failure is pre-existing (boundary snapshot token tracking). ## Diff +32/-40 lines in scheduler.py (net reduction). Clean, focused changes.
sleepy force-pushed feature/49-mixed-mtp-batch from d043a32975 to 3a231bb00d 2026-05-15 18:26:11 +02:00 Compare
sleepy merged commit d7a19a793a into main 2026-05-15 18:26:58 +02:00
Sign in to join this conversation.
No reviewers
No labels
bug
feature
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
sleepy/omlx!64
No description provided.