Skip to content

glm4-4-0414 : add Glm4Model implementation for GLM-4-0414 #12867

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 12 commits into from
Apr 11, 2025

Conversation

zRzRzRzRzRzRzR
Copy link
Contributor

This PR adds the implementation of Glm4Model, based on the open-source model GLM-4-0414 by Zhipu AI, scheduled for release on April 14, 2025.

The GLM-4 family is a multilingual, multitask autoregressive language model developed by Zhipu AI. The GLM-4-0414 checkpoint represents one of the most recent and publicly accessible variants of the series.

References

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR requested a review from ngxson as a code owner April 10, 2025 08:43
@github-actions github-actions bot added examples python python script changes server labels Apr 10, 2025
@ngxson
Copy link
Collaborator

ngxson commented Apr 10, 2025

I tested https://huggingface.co/THUDM/glm-4-9b-hf, it works. Just to be sure, you don't yet released instruction-tuned model, right? Sorry I haven't notice, the release is 14th april

This PR can be merged once you fix the failed CI job

@zRzRzRzRzRzRzR
Copy link
Contributor Author

Yes, the model for 0414 has not been released yet.

@ngxson ngxson merged commit 06bb53a into ggml-org:master Apr 11, 2025
53 checks passed
Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Apr 11, 2025
…2867)

* GLM-4-0414

* use original one

* Using with tensor map

* fix bug

* change order

* change order

* format with flask8
Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Apr 12, 2025
…2867)

* GLM-4-0414

* use original one

* Using with tensor map

* fix bug

* change order

* change order

* format with flask8
colout pushed a commit to colout/llama.cpp that referenced this pull request Apr 21, 2025
…2867)

* GLM-4-0414

* use original one

* Using with tensor map

* fix bug

* change order

* change order

* format with flask8
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
examples python python script changes server
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants