[feature] SpecPrefill attention-based sparse prefill for MoE #58
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The scheduler has SpecPrefill infrastructure (commit
c5081e3) but it needs work to be production-ready for MoE models like Mixtral, Qwen3-MoE.Current state: draft code exists, disabled by default.
Acceptance criteria:
Closed — not prioritized.