ggml-virtgpu: improve the reliability of the code (#19846)

* ggml-virtgpu-backend: validate the consistency of the received objects

This patch adds consistency checks in the
ggml-virtgpu-backend (running on the host side) to ensure that the
data received from the guest is consistent (valid pointers, valid
sizes and offsets).

* ggml-virtgpu-backend: add fallback/skips for optional ggml backend methods

```
  1. bck->iface.synchronize(bck)
  2. buft->iface.get_alloc_size(buft, op)
  3. buft->iface.get_max_size(buft)
```

these three methods are optional in the GGML interface. `get_max_size`
was already properly defaulted, but `backend sychronize` and `butf
get_max_size` would have segfaulted the backend if not implemented.

* ggml-virtgpu-backend: fix log format missing argument

* ggml-virtgpu-backend: improve the abort message

* ggml-virtgpu-backend: more safety checks

* ggml-virtgpu-backend: new error code

* ggml-virtgpu-backend: initialize all the error codes

* ggml-virtgpu: add a missing comment generated by the code generator

* ggml-virtgpu: add the '[virtgpu]' prefix to the device/buffer names

* ggml-virtgpu: apir_device_buffer_from_ptr: improve the error message

* ggml-virtgpu: shared: make it match the latest api_remoting.h of Virglrenderer APIR

(still unmerged)

* ggml-virtgpu: update the code generator to have dispatch_command_name in a host/guest shared file

* ggml-virtgpu: REMOTE_CALL: fail if the backend returns an error

* docs/backend/VirtGPU.md: indicate that the RAM+VRAM size is limed to 64 GB with libkrun

* ggml-virtgpu: turn off clang-format header ordering for some of the files

Compilation breaks when ordered alphabetically.

* ggml-virtgpu: clang-format

* ggml-virtgpu/backend/shared/api_remoting: better comments for the APIR return codes

This commit is contained in:

Kevin Pouget

2026-02-26 13:00:57 +01:00

committed by

GitHub

parent efba35a860

commit ffaafde16f

30 changed files with 398 additions and 257 deletions

									
										docs/backend/VirtGPU.md
									
		+3
		-1
	
												View File
												
				@@ -152,7 +152,9 @@ Commands and data are serialized using a custom binary protocol with:

				- **VM-specific**: Only works in virtual machines with virtio-gpu support

				- **Host dependency**: Requires properly configured host-side backend

				- **Latency**: Small overhead from VM escaping for each operation

				- **Shared-memory size**: with the `libkrun` hypervisor, the RAM + VRAM

				  addressable memory is limited to 64 GB. So the maximum GPU memory

				  will be `64GB - RAM`, regardless of the hardware VRAM size.

				* This work is pending upstream changes in the VirglRenderer

				  project.