Suggest setting AMDGPU_TARGETS #4011

ardfork · 2023-11-09T18:41:04Z

Currently, when building llama.cpp with CMake and hipBLAS option, it will build for gfx900, gfx906, gfx908, gfx90a, gfx1030 if using 5.6.1 or older. And, build for gfx906 if using ROCm 5.7.0 or 5.7.1 (it is set to empty, but that will result in it building only for gfx906.

The former raised multiple issues that CMake build wasn't working for RDNA3 cards, and once ROCm 5.7.0 or 5.7.1 is available in popular linux distributions, it will surely raise issues for most AMD cards owner.

This issue was fixed upstream 3 days ago, it will do something similar to this suggestion.

It might be better to implement AMD GPU targets selection in CMakeLists.txt, using devices present in computer by default with an option to set GPU targets for package maintainers.

~~Also, I noticed that in Makefile, the way it select GPU targets probably doesn't work on a computer with multiple GPU that use different ISA.~~

ardfork · 2023-11-09T21:52:56Z

I also discovered that you can pass native to AMDGPU_TARGETS or --offload-arch, but I'm not sure if it works with multiple GPU using different ISA. That would be cleaner than using amdgpu-arch binary and could also be applied to Makefile.

Edit: After looking at it further, the implementation check for multiple GPU, it basically call the same binary. Also, I was wrong about the Makefile, it correctly handle multiple ISA. Tested by replacing amdgpu-arch binary to something simply returning gfx1030 and gfx1100; CMake with -DAMDGPU_TARGETS=native and current Makefile correctly build binaries for gfx1030 and gfx1100.

Currently, when building llama.cpp with CMake and hipBLAS option, it will build for gfx900, gfx906, gfx908, gfx90a, gfx1030 if using 5.6.1 or older. And, build for gfx906 if using ROCm 5.7.0 or 5.7.1. The former raised multiple issues that CMake build wasn't working for RDNA3 cards, and once ROCm 5.7.0 or 5.7.1 is available in popular linux distributions, it will surely raise issues for most AMD cards owner. This issue was fixed upstream 3 days ago, it will do something similar to this suggestion.

ardfork force-pushed the amdgpu-targets branch from b070b36 to 811c26a Compare November 9, 2023 23:16

ardfork mentioned this pull request Nov 10, 2023

Make hipBLAS CMake more similar to cuBLAS #4024

Closed

ggerganov closed this Jan 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Suggest setting AMDGPU_TARGETS #4011

Suggest setting AMDGPU_TARGETS #4011

Uh oh!

ardfork commented Nov 9, 2023 •

edited

Loading

Uh oh!

ardfork commented Nov 9, 2023 •

edited

Loading

Uh oh!

Uh oh!

Suggest setting AMDGPU_TARGETS #4011

Suggest setting AMDGPU_TARGETS #4011

Uh oh!

Conversation

ardfork commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ardfork commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ardfork commented Nov 9, 2023 •

edited

Loading

ardfork commented Nov 9, 2023 •

edited

Loading