ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

* Work towards removing bitcast

* Move rest of existing types over

* Add timeout back to wait and remove synchronous set_tensor/memset_tensor

* move to unpackf16 for wider compatibility

* cleanup

* Remove deadlock condition in free_bufs

* Start work on removing parameter buffer pools

* Simplify and optimize further

* simplify profile futures

* Fix stride

* Try using a single command buffer per batch

* formatting
This commit is contained in:
Reese Levine
2026-04-03 11:40:14 -07:00
committed by GitHub
parent e439700992
commit d006858316
2 changed files with 373 additions and 416 deletions
File diff suppressed because it is too large Load Diff