You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -462,11 +476,9 @@ Building the program with BLAS support may lead to some performance improvements
462
476
- Using manual oneAPI installation:
463
477
By default, `LLAMA_BLAS_VENDOR` is set to `Generic`, so if you already sourced intel environment script and assign `-DLLAMA_BLAS=ON`in cmake, the mkl version of Blas will automatically been selected. Otherwise please install oneAPI and follow the below steps:
464
478
```bash
465
-
mkdir build
466
-
cd build
467
479
source /opt/intel/oneapi/setvars.sh # You can skip this step if in oneapi-basekit docker image, only required for manual installation
@@ -487,10 +499,8 @@ Building the program with BLAS support may lead to some performance improvements
487
499
- Using `CMake`:
488
500
489
501
```bash
490
-
mkdir build
491
-
cd build
492
-
cmake .. -DLLAMA_CUDA=ON
493
-
cmake --build . --config Release
502
+
cmake -B build -DLLAMA_CUDA=ON
503
+
cmake --build build --config Release
494
504
```
495
505
496
506
The environment variable [`CUDA_VISIBLE_DEVICES`](https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars) can be used to specify which GPU(s) will be used. The following compilation options are also available to tweak performance:
@@ -517,8 +527,8 @@ Building the program with BLAS support may lead to some performance improvements
517
527
- Using `CMake`for Linux (assuming a gfx1030-compatible AMD GPU):
On Linux it is also possible to use unified memory architecture (UMA) to share main memory between the CPU and integrated GPU by setting `-DLLAMA_HIP_UMA=ON"`.
524
534
However, this hurts performance for non-integrated GPUs (but enables working with integrated GPUs).
@@ -564,15 +574,14 @@ Building the program with BLAS support may lead to some performance improvements
0 commit comments