convert: remove most of the n_mult usage in convert.py #3098

Green-Sky · 2023-09-09T15:48:42Z

Little bit of clean up of n_mult. it is only used to calculate n_ff and the formula only works for llama and close derivatives. (eg not for falcon)

KerfuffleV2 · 2023-09-09T16:54:26Z

Any reason not to remove the find_n_mult function? Nothing uses it after your changes.

Green-Sky · 2023-09-09T17:27:06Z

Any reason not to remove the find_n_mult function? Nothing uses it after your changes.

hm true. ... time to remove my hack then :)

convert.py

goerch · 2023-09-09T19:26:59Z

Hm. A layman's view: I know of multiple_of as an input to compute the hidden dimensions, is this mapped to ffn_dim_multiplier now? I'd find it easier to follow if you'd be going to support different model classes to back reference the original models.

convert.py

Green-Sky · 2023-09-10T12:06:58Z

Hm. A layman's view: I know of multiple_of as an input to compute the hidden dimensions, is this mapped to ffn_dim_multiplier now? I'd find it easier to follow if you'd be going to support different model classes to back reference the original models.

I am not sure where they come from, but yea, they AND the formulas describe a relationship in the architecture. But multiple_of can be a different value too, there are always multiple solutions. They also only seem to be specific to llama. falcon for example just uses 4 * hidden_size . Huggingface models also don't contain the information at all, they just have the intermediate_size param.

Green-Sky changed the title ~~convert: remove most n_mult usage~~ convert: remove most of the n_mult usage in convert.py Sep 9, 2023

convert: remove most n_mult usage

ecd7bed

Green-Sky force-pushed the convert_reduce_n_mult branch from f862b9e to ecd7bed Compare September 9, 2023 15:51

Green-Sky requested a review from cebtenzzre September 9, 2023 15:53

convert: remove the now unused find_n_mult

5bb9bf4

cebtenzzre reviewed Sep 9, 2023

View reviewed changes

convert.py Outdated Show resolved Hide resolved

cebtenzzre approved these changes Sep 9, 2023

View reviewed changes

cebtenzzre reviewed Sep 9, 2023

View reviewed changes

convert.py Outdated Show resolved Hide resolved

convert: movre n_mult removing

2f50a58

Green-Sky force-pushed the convert_reduce_n_mult branch from bda804a to 2f50a58 Compare September 10, 2023 11:52

cebtenzzre merged commit 6eeb4d9 into ggml-org:master Sep 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

convert: remove most of the n_mult usage in convert.py #3098

convert: remove most of the n_mult usage in convert.py #3098

Uh oh!

Green-Sky commented Sep 9, 2023

Uh oh!

KerfuffleV2 commented Sep 9, 2023

Uh oh!

Green-Sky commented Sep 9, 2023

Uh oh!

Uh oh!

goerch commented Sep 9, 2023

Uh oh!

Uh oh!

Green-Sky commented Sep 10, 2023

Uh oh!

Uh oh!

convert: remove most of the n_mult usage in convert.py #3098

convert: remove most of the n_mult usage in convert.py #3098

Uh oh!

Conversation

Green-Sky commented Sep 9, 2023

Uh oh!

KerfuffleV2 commented Sep 9, 2023

Uh oh!

Green-Sky commented Sep 9, 2023

Uh oh!

Uh oh!

goerch commented Sep 9, 2023

Uh oh!

Uh oh!

Green-Sky commented Sep 10, 2023

Uh oh!

Uh oh!