llama.cpp

Files

T

Rail Chabdarov 5a32a9b8a5 Fix data race in CUDA's "cpy" kernel (influences GGML's DUP, CONT operations). (#20507 )

* Fix datarace in CUDA's "cpy" kernel.

* Remove extra barrier by using more of shared memory.

2026-03-14 13:19:44 +08:00

2025-08-07 13:45:41 +02:00

2026-03-11 22:46:40 +02:00

2026-03-14 13:19:44 +08:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2026-03-13 14:36:13 +01:00