doc: comment on what the two cuda images are

canardleteer · canardleteer · commit aeb43a1196b2 · 2023-05-15T07:06:33.000-07:00
diff --git a/README.md b/README.md
@@ -532,8 +532,8 @@ Assuming one has the [nvidia-container-toolkit](https://github.com/NVIDIA/nvidia
 #### Building Locally
 
 ```bash
-docker build -t local/llama.cpp:full -f .devops/full-cuda.Dockerfile .
-docker build -t local/llama.cpp:light -f .devops/main-cuda.Dockerfile .
+docker build -t local/llama.cpp:full-cuda -f .devops/full-cuda.Dockerfile .
+docker build -t local/llama.cpp:light-cuda -f .devops/main-cuda.Dockerfile .
 ```
 
 You may want to pass in some different `ARGS`, depending on the CUDA environment supported by your container host, as well as the GPU architecture.
@@ -543,6 +543,11 @@ The defaults are:
 - `CUDA_VERSION` set to `11.7.1`
 - `CUDA_DOCKER_ARCH` set to `all`
 
+The resulting images, are essentially the same as the non-CUDA images:
+
+1. `local/llama.cpp:full-cuda`: This image includes both the main executable file and the tools to convert LLaMA models into ggml and convert into 4-bit quantization.
+2. `local/llama.cpp:light-cuda`: This image only includes the main executable file.
+
 #### Usage
 
 After building locally, Usage is similar to the non-CUDA examples, but you'll need to add the `--gpus` flag. You will also want to use the `--n-gpu-layers` flag.