You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* update guide for make installation, memory, gguf model link, rm todo for windows build
* add vs install requirement
* update for gpu device check
* update help of llama-bench
* fix grammer issues
|Intel iGPU| Support| iGPU in i5-1250P, i7-1165G7|
52
54
55
+
Note: If the EUs (Execution Unit) in iGPU is less than 80, the inference speed will be too slow to use.
56
+
57
+
### Memory
58
+
59
+
The memory is a limitation to run LLM on GPUs.
60
+
61
+
When run llama.cpp, there is print log to show the applied memory on GPU. You could know how much memory to be used in your case. Like `llm_load_tensors: buffer size = 3577.56 MiB`.
62
+
63
+
For iGPU, please make sure the shared memory from host memory is enough. For llama-2-7b.Q4_0, recommend the host memory is 8GB+.
64
+
65
+
For dGPU, please make sure the device memory is enough. For llama-2-7b.Q4_0, recommend the device memory is 4GB+.
There should be one or more level-zero devices. Like**[ext_oneapi_level_zero:gpu:0]**.
121
+
There should be one or more level-zero devices. Please confirm that at least one GPU is present, like**[ext_oneapi_level_zero:gpu:0]**.
109
122
110
123
Output (example):
111
124
```
@@ -152,6 +165,8 @@ Note:
152
165
153
166
1. Put model file to folder **models**
154
167
168
+
You could download [llama-2-7b.Q4_0.gguf](https://huggingface.co/TheBloke/Llama-2-7B-GGUF/blob/main/llama-2-7b.Q4_0.gguf) as example.
169
+
155
170
2. Enable oneAPI running environment
156
171
157
172
```
@@ -223,7 +238,13 @@ Using device **0** (Intel(R) Arc(TM) A770 Graphics) as main device
223
238
224
239
Please install Intel GPU driver by official guide: [Install GPU Drivers](https://www.intel.com/content/www/us/en/products/docs/discrete-gpus/arc/software/drivers.html).
225
240
226
-
2. Install Intel® oneAPI Base toolkit.
241
+
Note: **The driver is mandatory for compute function**.
242
+
243
+
2. Install Visual Studio.
244
+
245
+
Please install [Visual Studio](https://visualstudio.microsoft.com/) which impact oneAPI environment enabling in Windows.
246
+
247
+
3. Install Intel® oneAPI Base toolkit.
227
248
228
249
a. Please follow the procedure in [Get the Intel® oneAPI Base Toolkit ](https://www.intel.com/content/www/us/en/developer/tools/oneapi/base-toolkit.html).
229
250
@@ -252,23 +273,29 @@ In oneAPI command line:
252
273
sycl-ls
253
274
```
254
275
255
-
There should be one or more level-zero devices. Like**[ext_oneapi_level_zero:gpu:0]**.
276
+
There should be one or more level-zero devices. Please confirm that at least one GPU is present, like**[ext_oneapi_level_zero:gpu:0]**.
a. Download & install cmake for Windows: https://cmake.org/download/
267
289
268
-
a. Download & install cmake for windows: https://cmake.org/download/
290
+
b. Download & install make for Windows provided by mingw-w64
269
291
270
-
b. Download & install make for windows provided by mingw-w64: https://www.mingw-w64.org/downloads/
292
+
- Download binary package for Windows in https://github.com/niXman/mingw-builds-binaries/releases.
271
293
294
+
Like [x86_64-13.2.0-release-win32-seh-msvcrt-rt_v11-rev1.7z](https://github.com/niXman/mingw-builds-binaries/releases/download/13.2.0-rt_v11-rev1/x86_64-13.2.0-release-win32-seh-msvcrt-rt_v11-rev1.7z).
295
+
296
+
- Unzip the binary package. In the **bin** sub-folder and rename **xxx-make.exe** to **make.exe**.
297
+
298
+
- Add the **bin** folder path in the Windows system PATH environment.
272
299
273
300
### Build locally:
274
301
@@ -309,6 +336,8 @@ Note:
309
336
310
337
1. Put model file to folder **models**
311
338
339
+
You could download [llama-2-7b.Q4_0.gguf](https://huggingface.co/TheBloke/Llama-2-7B-GGUF/blob/main/llama-2-7b.Q4_0.gguf) as example.
340
+
312
341
2. Enable oneAPI running environment
313
342
314
343
- In Search, input 'oneAPI'.
@@ -419,8 +448,25 @@ Using device **0** (Intel(R) Arc(TM) A770 Graphics) as main device
419
448
420
449
Miss to enable oneAPI running environment.
421
450
422
-
## Todo
451
+
- Meet compile error.
452
+
453
+
Remove folder **build** and try again.
454
+
455
+
- I can **not** see **[ext_oneapi_level_zero:gpu:0]** afer install GPU driver in Linux.
423
456
424
-
- Support to build in Windows.
457
+
Please run **sudo sycl-ls**.
458
+
459
+
If you see it in result, please add video/render group to your ID:
460
+
461
+
```
462
+
sudo usermod -aG render username
463
+
sudo usermod -aG video username
464
+
```
465
+
466
+
Then **relogin**.
467
+
468
+
If you do not see it, please check the installation GPU steps again.
0 commit comments