Skip to content

Understanding memory usage #1876

Answered by africalimedrop
siddhsql asked this question in Q&A
Discussion options

You must be logged in to vote

Suspect this will help answer; bolded relevant part:

  • --no-mmap: Do not memory-map the model. By default, models are mapped into memory, which allows the system to load only the necessary parts of the model as needed. However, if the model is larger than your total amount of RAM or if your system is low on available memory, using mmap might increase the risk of pageouts, negatively impacting performance. Disabling mmap results in slower load times but may reduce pageouts if you're not using --mlock. Note that if the model is larger than the total amount of RAM, turning off mmap would prevent the model from loading at all.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by KerfuffleV2
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants