Georgi Gerganov
|
1da7b76569
|
server : fix speculative decoding with context shift (#10641)
* server : fix speculative decoding with context shift
ggml-ci
* server : take into account speculative limits
ggml-ci
* server : add tests
|
2024-12-04 22:38:20 +02:00 |
|