llama.cpp

Files

T

Vishal Singh f1ac84119c ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 )

* ggml-zendnn : add MUL_MAT_ID op support for MoE models
- Add MUL_MAT_ID op acceleration for Mixture-of-Experts models
- MUL_MAT_ID op fallback to CPU backend if total experts > 32
- Point ZenDNN lib to latest bits ZenDNN-2026-WW13

* ggml-zendnn : add braces to sgemm failure condition for consistency

Co-authored-by: Aaron Teo <taronaeo@gmail.com>

---------

Co-authored-by: Aaron Teo <taronaeo@gmail.com>

2026-04-03 12:19:08 +03:00

snapdragon

chore : correct typos [no ci] (#20041 )

2026-03-05 08:50:21 +01:00

VirtGPU

ggml-virtgpu: Fix some build commands (#20341 )

2026-03-12 15:47:45 +08:00

BLIS.md

make : deprecate (#10514 )

2024-12-02 21:22:53 +02:00

CANN.md

CANN: update docker images to 8.5.0 and improve CANN.md (#20801 )