ggml : allow CUDA graphs when using pipeline parallelism #13814

slaren · 2025-05-27T00:05:54Z

Due to changes in the graph in each evaluation when pipeline parallelism is enabled, CUDA graphs could not be used. This change ensures that during generation the graphs will not change, allowing the use of CUDA graphs when pipeline parallelism is enabled.

Fixes #13751

ggerganov · 2025-05-30T10:24:38Z

Not sure, but this change might be causing some of the following issues:

#13879, #13906, #13909

slaren · 2025-05-30T10:27:42Z

I have seen the issues, but I cannot reproduce it.

ggml : allow CUDA graphs when using pipeline parallelism

c17627c

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label May 27, 2025

slaren mentioned this pull request May 27, 2025

Large performance drop when using pipeline parallelism and layer splitting on multiple GPUs #13751

Closed

koush mentioned this pull request May 27, 2025

cuda: fix layer split mode preventing cuda graph compilation #13815

Closed

ggerganov approved these changes May 27, 2025

View reviewed changes

slaren merged commit 952f395 into master May 27, 2025
46 checks passed

slaren deleted the sl/fix-cuda-graphs-pp branch May 27, 2025 11:05

pjguzman mentioned this pull request May 30, 2025

Eval bug: CUDA error: an illegal memory access was encountered on mistral-small-3.1-24b-instruct with mmproj #13879

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : allow CUDA graphs when using pipeline parallelism #13814

ggml : allow CUDA graphs when using pipeline parallelism #13814

Uh oh!

slaren commented May 27, 2025

Uh oh!

Uh oh!

ggerganov commented May 30, 2025

Uh oh!

slaren commented May 30, 2025

Uh oh!

Uh oh!

ggml : allow CUDA graphs when using pipeline parallelism #13814

ggml : allow CUDA graphs when using pipeline parallelism #13814

Uh oh!

Conversation

slaren commented May 27, 2025

Uh oh!

Uh oh!

ggerganov commented May 30, 2025

Uh oh!

slaren commented May 30, 2025

Uh oh!

Uh oh!