You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -279,7 +280,7 @@ case visit our [customization guide](docs/model_customization.md).
279
280
280
281
To run in a python enviroment, use the generate subcommand like before, but include the dso file.
281
282
282
-
```
283
+
```bash
283
284
python3 torchchat.py generate llama3.1 --dso-path exportedModels/llama3.1.so --prompt "Hello my name is"
284
285
```
285
286
**Note:** Depending on which accelerator is used to generate the .dso file, the command may need the device specified: `--device (cuda | cpu)`.
@@ -292,9 +293,14 @@ To run in a C++ enviroment, we need to build the runner binary.
292
293
scripts/build_native.sh aoti
293
294
```
294
295
295
-
Then run the compiled executable, with the exported DSO from earlier.
296
+
To compile the AOTI generated artifacts into a `.so`:
297
+
```bash
298
+
make -C exportedModels/llama3_1_artifacts
299
+
```
300
+
301
+
Then run the compiled executable, with the compiled DSO.
296
302
```bash
297
-
cmake-out/aoti_run exportedModels/llama3.1.so -z `python3 torchchat.py where llama3.1`/tokenizer.model -l 3 -i "Once upon a time"
303
+
cmake-out/aoti_run exportedModels/llama3_1_artifacts/llama3_1_artifacts.so -z `python3 torchchat.py where llama3.1`/tokenizer.model -l 3 -i "Once upon a time"
298
304
```
299
305
**Note:** Depending on which accelerator is used to generate the .dso file, the runner may need the device specified: `-d (CUDA | CPU)`.
"code generation needs to choose different implementations for DSO and PTE path. Please only use one export option, and call export twice if necessary!"
94
+
"code generation needs to choose different implementations for AOTI and PTE path. Please only use one export option, and call export twice if necessary!"
"code generation needs to choose different implementations for DSO and PTE path. Please only use one export option, and call export twice if necessary!"
111
+
"code generation needs to choose different implementations for AOTI and PTE path. Please only use one export option, and call export twice if necessary!"
0 commit comments