[SYCL] add concat through dim 1/2 #8483

airMeng · 2024-07-15T02:41:05Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

airMeng · 2024-07-15T03:37:13Z

@characharm can you try deepseek on this branch? Thank you in advance, your support is our motivation

airMeng · 2024-07-15T03:42:44Z

@OuadiElfarouki, I'm wondering who I should tag for PR reviews at Codeplay. Could you or someone else represent Codeplay for the following PRs? For any Intel-related issues, feel free to tag me.

NeoZhangJianyu

I test UT successfully.
And verify DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M.gguf on Arc770 is passed.

OuadiElfarouki · 2024-07-15T09:26:10Z

@airMeng Thanks for the tag. You can either reach out to @joeatodd (Team PO at Codeplay), or @Alcpz or myself.

OuadiElfarouki

LGTM Thanks!

characharm · 2024-07-15T17:27:14Z

I test UT successfully. And verify DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M.gguf on Arc770 is passed.

Could you please check this model: DeepSeek-Coder-V2-Lite-Instruct-Q6_K? link I have tried several times, but the computer completely froze every time after loading the model into memory. tested on this build b3398

airMeng · 2024-07-16T00:12:52Z

I test UT successfully. And verify DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M.gguf on Arc770 is passed.

Could you please check this model: DeepSeek-Coder-V2-Lite-Instruct-Q6_K? link I have tried several times, but the computer completely froze every time after loading the model into memory. tested on this build b3398

seems the model file itself is 14.1GB, how much vram your GPU has?

characharm · 2024-07-16T07:52:42Z

I test UT successfully. And verify DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M.gguf on Arc770 is passed.

Could you please check this model: DeepSeek-Coder-V2-Lite-Instruct-Q6_K? link I have tried several times, but the computer completely froze every time after loading the model into memory. tested on this build b3398

seems the model file itself is 14.1GB, how much vram your GPU has?

16gb, but other models with the same or even slightly larger file size work fine

characharm · 2024-07-18T18:10:51Z

b3408 , now it shuts down at this,tried with smaller quants, with the same result

* add concat through dim 1/2

add concat through dim 1/2

3c151d9

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Jul 15, 2024

fix format

9ecc119

airMeng requested a review from NeoZhangJianyu July 15, 2024 02:47

airMeng requested a review from OuadiElfarouki July 15, 2024 03:37

NeoZhangJianyu approved these changes Jul 15, 2024

View reviewed changes

airMeng requested a review from joeatodd July 15, 2024 09:31

OuadiElfarouki approved these changes Jul 15, 2024

View reviewed changes

airMeng merged commit 16bdfa4 into master Jul 15, 2024
54 checks passed

airMeng deleted the sycl-concat branch July 15, 2024 11:32

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 27, 2024

[SYCL] add concat through dim 1/2 (ggml-org#8483)

88db0dc

* add concat through dim 1/2

ClarkChin08 mentioned this pull request Aug 2, 2024

Bug: n_ctx will reuse n_ctx_train when --ctx_size not set and make deepseek-v2 models meet out of memory crash even on a small output length. #8817

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] add concat through dim 1/2 #8483

[SYCL] add concat through dim 1/2 #8483

Uh oh!

airMeng commented Jul 15, 2024

Uh oh!

airMeng commented Jul 15, 2024

Uh oh!

airMeng commented Jul 15, 2024

Uh oh!

NeoZhangJianyu left a comment •

edited

Loading

Uh oh!

OuadiElfarouki commented Jul 15, 2024

Uh oh!

OuadiElfarouki left a comment

Uh oh!

Uh oh!

characharm commented Jul 15, 2024 •

edited

Loading

Uh oh!

airMeng commented Jul 16, 2024

Uh oh!

characharm commented Jul 16, 2024

Uh oh!

characharm commented Jul 18, 2024 •

edited

Loading

Uh oh!

Uh oh!

[SYCL] add concat through dim 1/2 #8483

[SYCL] add concat through dim 1/2 #8483

Uh oh!

Conversation

airMeng commented Jul 15, 2024

Uh oh!

airMeng commented Jul 15, 2024

Uh oh!

airMeng commented Jul 15, 2024

Uh oh!

NeoZhangJianyu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

OuadiElfarouki commented Jul 15, 2024

Uh oh!

OuadiElfarouki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

characharm commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

airMeng commented Jul 16, 2024

Uh oh!

characharm commented Jul 16, 2024

Uh oh!

characharm commented Jul 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

NeoZhangJianyu left a comment •

edited

Loading

characharm commented Jul 15, 2024 •

edited

Loading

characharm commented Jul 18, 2024 •

edited

Loading