bug: Model generates garbage output (repetitive tokens) #62
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The model produces garbage/repetitive output instead of coherent text.
Reproduction
Output:
和!!!!!!!!!!!!!!!!!!!Expected: Coherent English text
Observations
Possible Causes
Verification Needed
Related
src/tests/correctness.zig(hardcoded to 4B model)Merged. Three bugs fixed: linear_norm F32→BF16 conversion, undersized q_gate buffers, and RMS norm formula (weight → 1+weight).
Benchmark on M4 Pro Qwen3.5-0.8B: