Skip to content

Commit 1af511f

Browse files
authored
Add convert.py removal to hot topics (#7662)
1 parent 0541f06 commit 1af511f

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,8 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others)
2222

2323
### Hot topics
2424

25-
- **Initial Flash-Attention support: https://github.com/ggerganov/llama.cpp/pull/5021**
25+
- **`convert.py` has been deprecated and moved to `examples/convert-legacy-llama.py`, please use `convert-hf-to-gguf.py` https://github.com/ggerganov/llama.cpp/pull/7430
26+
- Initial Flash-Attention support: https://github.com/ggerganov/llama.cpp/pull/5021
2627
- BPE pre-tokenization support has been added: https://github.com/ggerganov/llama.cpp/pull/6920
2728
- MoE memory layout has been updated - reconvert models for `mmap` support and regenerate `imatrix` https://github.com/ggerganov/llama.cpp/pull/6387
2829
- Model sharding instructions using `gguf-split` https://github.com/ggerganov/llama.cpp/discussions/6404

0 commit comments

Comments
 (0)