Don't perform memory check if client sets use_mmap true. #8895

rick-github · 2025-02-06T19:44:25Z

If the client overrides use_mmap, don't prevent the model from loading due to apparent over-commit.

On Linux, a mmap'd file doesn't use swap backing store unless modified, so there's no need for the check. Windows has dynamic swap and so falls i 8000 n to the same bucket as darwin. Inference on deepseek-r1:671b-1.5b runs at ~0.15 t/s where the model requires swap on SSD, ~0.3 t/s with mmap instead of swap on the the same SSD, and ~1.4 t/s when the model is mapped on an NVME drive.

Also add OLLAMA_USE_MMAP for global configuration.

DrShadow34 · 2025-03-16T20:48:51Z

Any chance that will be merged in one regular human lifetime?

jmv2009 · 2025-03-20T15:57:18Z

This one I circumvent with generating a large zram swap, which is useful anyway. I normally load the models into a ramdrive anyway on a live linux. Then the models are already in memory, and duplication is avoided with mmap. I actually need to modify line 213 as well to not get no_mmap.

In this scenario line 213 acts insane: If the model is so small that it fits again into memory, it works, but uses mmap, and it actually does not duplicate and does not end up using that extra memory. If the model is so big that it does not fit again, it does not use mmap, and, with the zram swap, runs out of memory.

Don't perform memory check of client sets use_mmap true.

e5e1ded

BruceMacD requested a review from dhiltgen February 6, 2025 21:01

This was referenced Feb 7, 2025

deepseek-r1:671b Q4_K_M: error="model requires more system memory (446.3 GiB) than is available #8667

Open

Available memory check should be disabled when mmap is in use #8654

Open

rick-github and others added 8 commits May 1, 2025 22:57

Merge branch 'ollama:main' into mmap

ec0ef40

Merge branch 'ollama:main' into mmap

836153d

Merge branch 'ollama:main' into mmap

a374fbd

Merge branch 'ollama:main' into mmap

93113a7

Add environment variable OLLAMA_USE_MMAP.

eba78f9

Merge branch 'ollama:main' into mmap

4d5ba52

Merge branch 'mmap' of https://github.com/rick-github/ollama into mmap

22c7219

Merge branch 'ollama:main' into mmap

1c8af56

rick-github mentioned this pull request Jun 6, 2025

Allow "use_mmap" to be set at a global level using enviroment variables. #10539

Open

rick-github added 2 commits June 17, 2025 15:40

Merge branch 'ollama:main' into mmap

583a8ac

Merge branch 'ollama:main' into mmap

295a80e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't perform memory check if client sets use_mmap true. #8895

Don't perform memory check if client sets use_mmap true. #8895

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Don't perform memory check if client sets use_mmap true. #8895

Are you sure you want to change the base?

Don't perform memory check if client sets use_mmap true. #8895

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!