You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/build.md
+35-11Lines changed: 35 additions & 11 deletions
Original file line number
Diff line number
Diff line change
@@ -197,28 +197,52 @@ The following compilation options are also available to tweak performance:
197
197
198
198
## MUSA
199
199
200
-
This provides GPU acceleration using the MUSA cores of your Moore Threads MTT GPU. Make sure to have the MUSA SDK installed. You can download it from here: [MUSA SDK](https://developer.mthreads.com/sdk/download/musa).
200
+
This provides GPU acceleration using a Moore Threads GPU. Make sure to have the [MUSA SDK](https://developer.mthreads.com/musa/musa-sdk) installed.
201
201
202
-
- Using `CMake`:
202
+
#### Download directly from Moore Threads
203
203
204
-
```bash
205
-
cmake -B build -DGGML_MUSA=ON
206
-
cmake --build build --config Release
207
-
```
204
+
You may find the official downloads here: [Moore Threads developer site](https://developer.mthreads.com/sdk/download/musa).
208
205
209
-
For static build:
206
+
### Compilation
210
207
211
-
```bash
208
+
```bash
209
+
cmake -B build -DGGML_MUSA=ON
210
+
cmake --build build --config Release
211
+
```
212
+
213
+
#### Override Compute Capability Specifications
214
+
215
+
By default, all supported compute capabilities are enabled. To customize this behavior, you can specify the `MUSA_ARCHITECTURES` option in the CMake command:
The environment variable [`MUSA_VISIBLE_DEVICES`](https://docs.mthreads.com/musa-sdk/musa-sdk-doc-online/programming_guide/Z%E9%99%84%E5%BD%95/) can be used to specify which GPU(s) will be used.
234
+
### Runtime MUSA environmental variables
218
235
219
-
The environment variable `GGML_CUDA_ENABLE_UNIFIED_MEMORY=1` can be used to enable unified memory in Linux. This allows swapping to system RAM instead of crashing when the GPU VRAM is exhausted.
236
+
You may set the [musa environmental variables](https://docs.mthreads.com/musa-sdk/musa-sdk-doc-online/programming_guide/Z%E9%99%84%E5%BD%95/) at runtime.
220
237
221
-
Most of the compilation options available for CUDA should also be available for MUSA, though they haven't been thoroughly tested yet.
238
+
```bash
239
+
# Use `MUSA_VISIBLE_DEVICES` to hide the first compute device.
The environment variable `GGML_CUDA_ENABLE_UNIFIED_MEMORY=1` can be used to enable unified memory in Linux. This allows swapping to system RAM instead of crashing when the GPU VRAM is exhausted.
0 commit comments