vulkan: Implement set_tensor_async and the event interfaces #18047

jeffbolznv · 2025-12-15T04:49:26Z

The goal is to enable the async loading code paths in llama_model_loader::load_all_data, originally from #7896. This works and the loads themselves are faster, but with host visible vidmem I think the cost of allocating/mapping vidmem moves and becomes more expensive, and I don't see a benefit by default. But with GGML_VK_DISABLE_HOST_VISIBLE_VIDMEM=1 I do see a significant improvement in model loading time.

It would be interesting to test on Linux how this interacts with #18012.

The goal is to enable the async loading code paths in llama_model_loader::load_all_data, originally from ggml-org#7896. This works and the loads themselves are faster, but with host visible vidmem I think the cost of allocating/mapping vidmem moves and becomes more expensive, and I don't see a benefit by default. But with GGML_VK_DISABLE_HOST_VISIBLE_VIDMEM=1 I do see a significant improvement in model loading time.

jeffbolznv requested a review from 0cc4m as a code owner December 15, 2025 04:49

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Dec 15, 2025

loci-dev mentioned this pull request Dec 15, 2025

UPSTREAM PR #18047: vulkan: Implement set_tensor_async and the event interfaces auroralabs-loci/llama.cpp#572

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: Implement set_tensor_async and the event interfaces #18047

vulkan: Implement set_tensor_async and the event interfaces #18047

jeffbolznv commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vulkan: Implement set_tensor_async and the event interfaces #18047

Are you sure you want to change the base?

vulkan: Implement set_tensor_async and the event interfaces #18047

Conversation

jeffbolznv commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant