Skip to content

correction of the attn.v.weight quantization for IQ3_XS #6209

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 22, 2024

Conversation

Nexesenex
Copy link
Contributor

@Nexesenex Nexesenex commented Mar 21, 2024

IQ3_XS was not mentioned in the quant strategy list for the attn.v.weight tensor, IQ3_S and IQ3_M were present twice.

That PR corrects this in the manner which was probably intended initially.

IQ3_XS was not mentioned, IQ3_S and IQ3_M were present twice.

That PR corrects this in the manner which was probably intended initially.
@ggerganov ggerganov requested a review from ikawrakow March 22, 2024 11:10
Copy link
Contributor

@ikawrakow ikawrakow left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess, this is what I meant to do. Thanks!

@ggerganov ggerganov merged commit e80f06d into ggml-org:master Mar 22, 2024
@Nexesenex Nexesenex deleted the patch-1 branch March 23, 2024 16:58
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024
…-org#6209)

IQ3_XS was not mentioned, IQ3_S and IQ3_M were present twice.

That PR corrects this in the manner which was probably intended initially.
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 3, 2024
…-org#6209)

IQ3_XS was not mentioned, IQ3_S and IQ3_M were present twice.

That PR corrects this in the manner which was probably intended initially.
tybalex pushed a commit to rubra-ai/tools.cpp that referenced this pull request Apr 17, 2024
…-org#6209)

IQ3_XS was not mentioned, IQ3_S and IQ3_M were present twice.

That PR corrects this in the manner which was probably intended initially.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants