You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please refer to [this tutorial](https://pytorch.org/executorch/main/llm/llama-demo-android.html) to for full instructions on building the Android LLAMA Demo App.
201
+
199
202
# What is coming next?
200
203
## Quantization
201
204
- Enabling FP16 model to leverage smaller groupsize for 4-bit quantization.
0 commit comments