server: allow router to report child instances sleep status (#20849)

* server: allow router to report child instances sleep status

* refactor

* move sleeping to state

* nits
This commit is contained in:
Xuan-Son Nguyen
2026-03-22 18:33:52 +01:00
committed by GitHub
parent bd3f1d9d65
commit 49bfddeca1
7 changed files with 102 additions and 43 deletions
+7
View File
@@ -1634,6 +1634,13 @@ The `status` object can be:
}
```
```json
"status": {
"value": "sleeping",
"args": ["llama-server", "-ctx", "4096"]
}
```
### POST `/models/load`: Load a model
Load a model