llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

* quantize : imatrix-fail early + code cleanup

* fix manual override printing

it's in the preliminary loop now, so needs to be on its own line

* revert header changes per ggerganov

* remove old #includes

* clarify naming

rename `tensor_quantization` to `tensor_typo_option` to descirbe its
functionality

* fix per barto

This commit is contained in:

ddh0

2026-03-10 01:16:05 -05:00

committed by

GitHub

parent c96f608d98

commit 1dab5f5a44

2 changed files with 485 additions and 316 deletions

src/llama-quant.cpp

+466 -285

View File

File diff suppressed because it is too large Load Diff