cuda : fix 2-bit quants on amd hip #5105

Engininja2 · 2024-01-24T02:35:40Z

Fixes evaluating iq2_xxs and iq2_xs on AMD HIP from HIP's half2 components appearing to be shorts instead of half. main would generate '######' and some MUL_MAT_ID and MUL_MAT ops would fail.

JohannesGaessler

There are functions __low2half and __high2half that do the exact same thing as .x and .y on NVIDIA but notably also do the correct thing on AMD. I suggest you use those instead; check the rest of the code for examples.

Artefact2 · 2024-01-24T12:47:25Z

Thanks for this! I can confirm this works on gfx1030 on Linux/rocm 5.7.1.

sorasoras · 2024-01-24T19:51:25Z

Tested on windows with rocm 5.7.1.
I can confirm working on my 7900XTX

Engininja2 · 2024-01-24T21:41:29Z

I used __low2float since the value was immediately being cast to float and that function is already used elsewhere in the code.

* cuda : fix 2-bit quants on amd hip * use __low2float intrinsic function for new quants

cuda : fix 2-bit quants on amd hip

5a69780

JohannesGaessler reviewed Jan 24, 2024

View reviewed changes

use __low2float intrinsic function for new quants

249dfc0

JohannesGaessler approved these changes Jan 24, 2024

View reviewed changes

JohannesGaessler merged commit cd4fddb into ggml-org:master Jan 24, 2024

Engininja2 deleted the fix-2bit-quants-amd branch January 31, 2024 14:29

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Feb 3, 2024

cuda : fix 2-bit quants on amd hip (ggml-org#5105)

d4a6840

* cuda : fix 2-bit quants on amd hip * use __low2float intrinsic function for new quants

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

cuda : fix 2-bit quants on amd hip (ggml-org#5105)

0daaed1

* cuda : fix 2-bit quants on amd hip * use __low2float intrinsic function for new quants

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cuda : fix 2-bit quants on amd hip #5105

cuda : fix 2-bit quants on amd hip #5105

Uh oh!

Engininja2 commented Jan 24, 2024

Uh oh!

JohannesGaessler left a comment

Uh oh!

Artefact2 commented Jan 24, 2024

Uh oh!

sorasoras commented Jan 24, 2024

Uh oh!

Engininja2 commented Jan 24, 2024

Uh oh!

Uh oh!

cuda : fix 2-bit quants on amd hip #5105

cuda : fix 2-bit quants on amd hip #5105

Uh oh!

Conversation

Engininja2 commented Jan 24, 2024

Uh oh!

JohannesGaessler left a comment

Choose a reason for hiding this comment

Uh oh!

Artefact2 commented Jan 24, 2024

Uh oh!

sorasoras commented Jan 24, 2024

Uh oh!

Engininja2 commented Jan 24, 2024

Uh oh!

Uh oh!