-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Add support for BitnetForCausalLM (new model / new datatype) #7931
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 34 commits
Commits
Show all changes
38 commits
Select commit
Hold shift + click to select a range
076b4a1
hf bitnet v1
Eddie-Wang1120 57dfc3b
hf bitnet e2e v2
Eddie-Wang1120 1f2e0ee
finish bitnet e2e
Eddie-Wang1120 5e59660
finish f16 hf bitnet e2e
Eddie-Wang1120 2a01a7c
remove unsed
Eddie-Wang1120 4e1ab50
finish bitnet i2 e2e
Eddie-Wang1120 ca09085
move i2s to quantize v1
Eddie-Wang1120 dbee0a8
move i2 to quantize
1c5a8b7
clean code
3a0f8b0
clean code 2
97d22be
fix codestyle
Eddie-Wang1120 344467f
fix code
Eddie-Wang1120 65ac3a3
fix
Eddie-Wang1120 abd798d
fix code
Eddie-Wang1120 841c903
Merge branch 'ggerganov:master' into bitnet
Eddie-Wang1120 c0fd4df
fix merge
Eddie-Wang1120 de1d507
remove unused
Eddie-Wang1120 2322e9d
Merge branch 'ggerganov:master' into bitnet
Eddie-Wang1120 c0cd08d
Merge branch 'ggerganov:master' into bitnet
Eddie-Wang1120 f395dd9
change table name
Eddie-Wang1120 5e5eee7
fix whitespace
Eddie-Wang1120 7a8961f
delete redundant
Eddie-Wang1120 95dced0
i2_s to absmax
Eddie-Wang1120 569a03e
finish i2_s/i8_s vec_dot x86 simd
Eddie-Wang1120 a03eff3
i2s->q22
Eddie-Wang1120 4edc958
fix code
Eddie-Wang1120 89c7e4c
remove block scale
Eddie-Wang1120 fcf2da4
add dequantize
Eddie-Wang1120 fa9a742
fix seq
Eddie-Wang1120 230396b
update avx2
Eddie-Wang1120 2b09768
remove q2_2
Eddie-Wang1120 a58cf0d
remove q22_grid
Eddie-Wang1120 abcdc50
Merge branch 'ggerganov:master' into bitnet
Eddie-Wang1120 c6ddfa7
fix whitespace
Eddie-Wang1120 55a57a5
reuse llm_build_kv
Eddie-Wang1120 0520d88
Merge branch 'ggerganov:master' into bitnet
Eddie-Wang1120 16f0c30
Merge branch 'ggerganov:master' into bitnet
Eddie-Wang1120 226c5ee
fix bo
Eddie-Wang1120 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.