Profile graph fusion effectiveness #34
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
GGML_METAL_FUSION_DISABLE=1 has negligible impact on tok/s. Graph debug shows some ops being fused, but the benefit appears minimal.
Data (9B Q4_0)
Only 0.7 tok/s difference (1.3%).
Questions
Approach