-
Notifications
You must be signed in to change notification settings - Fork 12.1k
Option to split during conversion #6942
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
73 commits
Select commit
Hold shift + click to select a range
874c341
support splits in convert.py
christianazinn 72cbd4e
Support split by size and dry run to write estimated shards/filesizes
christianazinn 702a744
Move split functionality to new GGUFManager class
christianazinn c33bdf3
fix improper function signature
christianazinn b7c6120
tentative push of convert-hf-to-gguf support
christianazinn 14b3291
Merge branch 'master' into convert-split
mofosyne 87a98a5
resolve merge + SplitArguments for easier parsing
christianazinn 2dd7841
Merge remote-tracking branch 'origin' into convert-split
christianazinn 3ff27ef
Fix eager tensor memory leak and remove convert.py changes
christianazinn 6b5c375
refactor SplitStrategy to be a deque
christianazinn 09baf2f
fix Q8 quantization
christianazinn 240243e
remove unnecessary imports in gguf_manager
christianazinn 140eb52
Merge branch 'master' into convert-split
christianazinn a9c7703
fix final? merge issue
christianazinn efead04
fix gguf_writer placement and remove comments
christianazinn c8ecbc6
oops, actually fix gguf_writer placement
christianazinn 3e9430d
reduce duplicated code from gguf_writer
christianazinn f6fd3ea
further simplify GGUFManager
christianazinn bb5ee02
simplify even further and standardize with GGUFWriter
christianazinn 5ad397d
reduce diffs with master
christianazinn ce7e698
form shards while adding tensors, SHA256 sums agree with master
christianazinn 706bd69
re-add type hint
christianazinn 6a05183
GGUFWriter compatibility fix
christianazinn 3328b0a
Shard dataclass and un-negative dont_add_architecture
christianazinn 1cbab22
type consistency in format_n_bytes_to_str
christianazinn 2037eab
move kv keys to constants.py
christianazinn 83e4a3f
make pathlib explicit
christianazinn 13ffe22
base-1024 bytes to base-1000
christianazinn 6d3a256
rename GGUFManager to GGUFWriterSplit
christianazinn 1312e28
Update gguf-py/gguf/constants.py
christianazinn 5f29d4a
fix convert-hf-to-gguf.py permissions
christianazinn 0283fc1
fix line endings
christianazinn dc5cf5f
Update gguf-py/gguf/gguf_writer_split.py
christianazinn e093dfb
convert-hf : restore executable file permission
compilade 9576965
examples/convert-legacy-llama.py: restore executable file permission
christianazinn c6ae1d6
reinstate original gguf package import and fix type annotation
christianazinn 2e70fa1
attempt to appease the linter
christianazinn 891b19c
attempt 2 to appease the linter
christianazinn 02be0dd
attempt 3 to appease the linter
christianazinn f658e91
comma consistency
christianazinn 079dfe3
Update convert-hf-to-gguf.py
christianazinn 282e71f
edit cmd line args
christianazinn 666bb09
Merge branch 'master' into convert-split
christianazinn 03cc9bc
use simplification from #7827
christianazinn 97dd416
kv/ti data are still wrong
christianazinn ff2dd7d
try to refactor kv data (still fails)
christianazinn ba1be97
fix ti data messiness
christianazinn 69d6e7a
Merge branch 'master' into convert-split
christianazinn 0779f2f
tidy up
christianazinn a234bf8
fix linting
christianazinn 49b9fbe
actually make the linter happy
christianazinn 0471f67
cleanup round 1
christianazinn 5a96b8f
remove SplitStrategy, SplitArguments
christianazinn f7ecd99
appease linter
christianazinn 9d7f694
fix typing and clean up
christianazinn 0417104
fix linting
christianazinn 70a6bc9
Update gguf-py/gguf/gguf_writer.py
christianazinn 1e2d9cb
progress bar, fix split logic
christianazinn f7e7983
Update gguf-py/gguf/gguf_writer.py
christianazinn 79bd2bf
catch oversights
christianazinn 7eea552
Update gguf-py/gguf/gguf_writer.py
christianazinn 99f9a24
Update gguf-py/gguf/gguf_writer.py
christianazinn ad02c94
Update gguf-py/gguf/gguf_writer.py
christianazinn c1b1a29
Update gguf-py/gguf/gguf_writer.py
christianazinn 4550826
Update gguf-py/gguf/gguf_writer.py
christianazinn efa0609
swap bar orders
christianazinn b843445
Update gguf-py/gguf/gguf_writer.py
christianazinn 854bd64
Update gguf-py/gguf/gguf_writer.py
christianazinn 05b183f
compatibility fix
christianazinn e9895d2
Update gguf-py/gguf/gguf_writer.py
christianazinn 4e4e376
Merge branch 'master' into convert-split
christianazinn 163712e
Update convert-hf-to-gguf.py
mofosyne 6e4182c
Merge branch 'master' into convert-split
christianazinn File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
christianazinn marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.