Files
llama.cpp/ggml/src
Masashi Yoshimura 6da7168312 ggml-webgpu: Add fused RMS_NORM + MUL (#21983)
* fused rms_norm_mul + mul

* Add GGML_WEBGPU_DISABLE_FUSION for being able to disable kernel fusion.

* Decouple num_fused_ops from webgpu_context; misc cleanup

* Fix eps handling and remove disable_fusion.

* Fix not to use c++20 initializers.
2026-04-22 10:51:40 -07:00
..
2026-04-14 17:32:58 +03:00
2026-04-16 17:21:28 +08:00
2026-04-16 17:21:28 +08:00
2026-03-25 12:53:16 +02:00