ggml-virtgpu: make the code thread safe (#19204)

* ggml-virtgpu: regenerate_remoting.py: add the ability to deprecate a function

* ggml-virtgpu: deprecate buffer_type is_host remoting

not necessary

* ggml-virtgpu: stop using static vars as cache

The static init isn't thread safe.

* ggml-virtgpu: protect the use of the shared memory to transfer data

* ggml-virtgpu: make the remote calls thread-safe

* ggml-virtgpu: backend: don't continue if couldn't allocate the tensor memory

* ggml-virtgpu: add a cleanup function for consistency

* ggml-virtgpu: backend: don't crash if buft->iface.get_max_size is missing

* fix style and ordering

* Remove the static variable in apir_device_get_count

* ggml-virtgpu: improve the logging

* fix review minor formatting changes

This commit is contained in:

Kevin Pouget

2026-02-04 03:46:18 +01:00

committed by

GitHub

parent 2ceda3f662

commit 015deb9048

27 changed files with 397 additions and 237 deletions

									
										ggml/include/ggml-virtgpu.h
									
		-2
	
												View File
												
				@@ -7,8 +7,6 @@

				extern "C" {

				#endif

				#define GGML_REMOTING_FRONTEND_NAME "RemotingFrontend"

				GGML_BACKEND_API ggml_backend_reg_t ggml_backend_virtgpu_reg();

				#ifdef  __cplusplus