CUDA: revert part of the RDNA1 optimizations #8309

daniandtheweb · 2024-07-04T19:40:59Z

The change on the launch_bounds was causing a small performance drop in prompt processing, apparently this change was only beneficial before I tuned the mmq_y values.

model	size	params	backend	ngl	test	t/s master	t/s PR	Speedup
llama 8B Q5_K - Small	5.21 GiB	8.03 B	ROCm	99	pp512	276.60 ± 0.41	300.60 ± 0.46	1.09

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

CUDA: revert part of the RDNA1 optimizations

9f3e9e3

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

github-actions bot added the Nvidia GPU Issues specific to Nvidia GPUs label Jul 4, 2024

JohannesGaessler approved these changes Jul 4, 2024

View reviewed changes

JohannesGaessler merged commit 0a42380 into ggml-org:master Jul 5, 2024
49 checks passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024

CUDA: revert part of the RDNA1 optimizations (ggml-org#8309)

7b41c5f

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024

CUDA: revert part of the RDNA1 optimizations (ggml-org#8309)

4bb7223

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: revert part of the RDNA1 optimizations #8309

CUDA: revert part of the RDNA1 optimizations #8309

Uh oh!

daniandtheweb commented Jul 4, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

CUDA: revert part of the RDNA1 optimizations #8309

CUDA: revert part of the RDNA1 optimizations #8309

Uh oh!

Conversation

daniandtheweb commented Jul 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniandtheweb commented Jul 4, 2024 •

edited

Loading