Skip to content

Commit df68d4f

Browse files
authored
[SYCL] Update README-sycl.md for Chapter "Recommended release" and "News" (#7946)
* Update README-sycl.md * Update README-sycl.md * Update README-sycl.md * Update README-sycl.md
1 parent 43b35e3 commit df68d4f

File tree

1 file changed

+16
-0
lines changed

1 file changed

+16
-0
lines changed

README-sycl.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
# llama.cpp for SYCL
22

33
- [Background](#background)
4+
- [Recommended Release](#recommended-release)
45
- [News](#news)
56
- [OS](#os)
67
- [Hardware](#hardware)
@@ -31,8 +32,23 @@ When targeting **Intel CPU**, it is recommended to use llama.cpp for [Intel oneM
3132

3233
It has the similar design of other llama.cpp BLAS-based paths such as *OpenBLAS, cuBLAS, etc..*. In beginning work, the oneAPI's [SYCLomatic](https://github.com/oneapi-src/SYCLomatic) open-source migration tool (Commercial release [Intel® DPC++ Compatibility Tool](https://www.intel.com/content/www/us/en/developer/tools/oneapi/dpc-compatibility-tool.html)) was used for this purpose.
3334

35+
## Recommended Release
36+
37+
The SYCL backend would be broken by some PRs due to no online CI.
38+
39+
The following release is verified with good quality:
40+
41+
|Commit ID|Tag|Release|Verified Platform|
42+
|-|-|-|-|
43+
|fb76ec31a9914b7761c1727303ab30380fd4f05c|b3038 |[llama-b3038-bin-win-sycl-x64.zip](https://github.com/ggerganov/llama.cpp/releases/download/b3038/llama-b3038-bin-win-sycl-x64.zip) |Arc770/Linux/oneAPI 2024.1<br>MTL Arc GPU/Windows 11/oneAPI 2024.1|
44+
45+
3446
## News
3547

48+
- 2024.5
49+
- Performance is increased: 34 -> 37 tokens/s of llama-2-7b.Q4_0 on Arc770.
50+
- Arch Linux is verified successfully.
51+
3652
- 2024.4
3753
- Support data types: GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M.
3854

0 commit comments

Comments
 (0)