You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docker/README.md
+16-5Lines changed: 16 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -1,10 +1,21 @@
1
-
# Dockerfiles for building the llama-cpp-python server
2
-
-`Dockerfile.openblas_simple` - a simple Dockerfile for non-GPU OpenBLAS
3
-
-`Dockerfile.cuda_simple` - a simple Dockerfile for CUDA accelerated CuBLAS
4
-
-`hug_model.py` - a Python utility for interactively choosing and downloading the latest `5_1` quantized models from [huggingface.co/TheBloke](https://huggingface.co/TheBloke)
1
+
# Simple Dockerfiles for building the llama-cpp-python server with external model bin files
2
+
-`./openblas_simple/Dockerfile` - a simple Dockerfile for non-GPU OpenBLAS, where the model is located outside the Docker image
3
+
-`cd ./openblas_simple`
4
+
-`docker build -t openblas_simple .`
5
+
-`docker run -e USE_MLOCK=0 -e MODEL=/var/model/<model-path> -v <model-root-path>:/var/model -t openblas_simple`
6
+
where `<model-root-path>/<model-path>` is the full path to the model file on the Docker host system.
7
+
-`./cuda_simple/Dockerfile` - a simple Dockerfile for CUDA accelerated CuBLAS, where the model is located outside the Docker image
8
+
-`cd ./cuda_simple`
9
+
-`docker build -t cuda_simple .`
10
+
-`docker run -e USE_MLOCK=0 -e MODEL=/var/model/<model-path> -v <model-root-path>:/var/model -t cuda_simple`
11
+
where `<model-root-path>/<model-path>` is the full path to the model file on the Docker host system.
12
+
13
+
# "Bot-in-a-box" - a method to build a Docker image by choosing a model to be downloaded and loading into a Docker image
14
+
-`cd ./auto_docker`:
15
+
-`hug_model.py` - a Python utility for interactively choosing and downloading the latest `5_1` quantized models from [huggingface.co/TheBloke](https://huggingface.co/TheBloke)
5
16
-`Dockerfile` - a single OpenBLAS and CuBLAS combined Dockerfile that automatically installs a previously downloaded model `model.bin`
6
17
7
-
# Get model from Hugging Face
18
+
##Get model from Hugging Face
8
19
`python3 ./hug_model.py`
9
20
10
21
You should now have a model in the current directory and `model.bin` symlinked to it for the subsequent Docker build and copy step. e.g.
0 commit comments