-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Support Qwen3 and Qwen3MoE #12828
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Qwen3 and Qwen3MoE #12828
Conversation
Don't see information about Qwen3 - maybe not published yet? |
We’ll update the blog once the model is officially released—hopefully very soon! |
Qwen3 is the nextgen AI |
Excited for it. Hope it has native multimodal support and a huge boost in creative writing (lacks in that department imo) Anyways, kudos on implementing support so early! Others should take note. |
lol awesome |
Good guy Devs, added day 1 support, Thank you! |
It most likely won't. In the transformers commit they had no processors, and if I understand correctly, no vision here. We'll have to wait some for vision modules to be integrated |
Hopefully it will be a native lean causal inference model rather than a bloated multimodal model, |
🔥 day-0 support for Qwen3 + Qwen3MoE, looking forward to the release of the weight!! |
Why not both. Gemma 3 is native multimodal, but you don't have to download the mmproj adapter, so there's no bloat for those who don't care about vision. And more importantly, pretraining on images allows the model to get more information about the world, enhancing its general performance. |
* add qwen3 & qwen3moe support. * fix --------- Co-authored-by: bozheng-hit <[email protected]>
* add qwen3 & qwen3moe support. * fix --------- Co-authored-by: bozheng-hit <[email protected]>
* add qwen3 & qwen3moe support. * fix --------- Co-authored-by: bozheng-hit <[email protected]>
* add qwen3 & qwen3moe support. * fix --------- Co-authored-by: bozheng-hit <[email protected]>
* add qwen3 & qwen3moe support. * fix --------- Co-authored-by: bozheng-hit <[email protected]>
Adding Qwen3
This PR adds the support of codes for the coming Qwen3 and Qwen3MoE models. For information about Qwen, please visit https://github.com/QwenLM/Qwen2.5. @ggerganov