Files
llama.cpp/common
Jillis ter Hove 52e5f0a5c1 common : re-arm reasoning budget after DONE on new <think> (#22323)
DONE state absorbs all tokens including a new start tag, causing any think blocks after the first to run unbudgeted. Observed on unsloth/Qwen3.6-27B-GGUF which interleaves multiple <think> blocks per response.

Fixed by advancing start_matcher in DONE branch and re-arming to COUNTING with a fresh budget on match. Adds regression test (test-reasoning-budget: test 6).
2026-04-28 19:15:36 +02:00
..
2026-04-28 09:07:33 +03:00
2026-04-28 09:07:33 +03:00
2026-04-28 09:07:33 +03:00
2026-04-28 09:07:33 +03:00
2026-01-30 18:21:48 +02:00
2026-04-28 09:07:33 +03:00
2026-04-28 09:07:33 +03:00