Investigate CPY overhead (159 MB/tick at 9B) #31
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
CPY reads 106 MB and writes 53 MB per tick. These are actual GPU memory copies for type conversion and layout transformation. They are not zero-cost.
Data (9B Q4_0, ctx=256)
Questions
Approach
Investigate CPY overhead 159 MB per tick at 9Bto Investigate CPY overhead (159 MB/tick at 9B)