feat: Qwen3, Gemma3 and Llama4 support #1002

grahamking · 2025-05-08T21:05:10Z

. New mistralrs and llamacpp version
. mistralrs: Handle Gemma 3 and Llama 4 as vision models
. Update the dynamo-run docs to use Qwen 3
. Our pre-processor now supports Llama 4's newer multi-modal config.json
. Upgrade minijinja to handle Qwen 3's prompt template

For Llama 4 we'll need to limit the max seq len. vllm says:

To serve at least one request with the models's max seq len (10485760), (240.00 GiB KV cache is needed,...

I was able to run Llama 4 with llamacpp and a quantized GGUF, with Dynamo doing the pre-processing.

. New mistralrs and llamacpp version . mistralrs: Handle Gemma 3 and Llama 4 as vision models . Update the dynamo-run docs to use Qwen 3 . Our pre-processor now supports Llama 4's newer multi-modal `config.json` . Upgrade minijinja to handle Qwen 3's prompt template

grahamking requested review from ryanolson, paulhendricks, biswapanda, tmonty12, GuanLuo, rmccorm4, kkranen, oandreeva-nv, a team, alec-flowers and jthomson04 as code owners May 8, 2025 21:05

pull-request-size bot added the size/L label May 8, 2025

copy-pr-bot bot temporarily deployed to GITLAB May 8, 2025 21:05 Inactive

github-actions bot added the feat label May 8, 2025

copy-pr-bot bot temporarily deployed to GITLAB May 8, 2025 21:07 Inactive

grahamking changed the title ~~feat: Upgrade mistralrs and llamcpp for Qwen3, Gemma3 and Llama4 support~~ feat: Qwen3, Gemma3 and Llama4 support May 8, 2025

ryanolson approved these changes May 8, 2025

View reviewed changes

grahamking merged commit ceaeba3 into main May 8, 2025
14 checks passed

grahamking deleted the gk-engine-upgrade branch May 8, 2025 22:18

This was referenced May 9, 2025

[FEATURE]: Upgrade mistral.rs for Qwen 3 and Gemma 3 support #923

Closed

[FEATURE]: Llama 4 support #731

Closed

[BUG]: cannot convert number to u64 error while running Qwen3-30B-A3B #1094

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Qwen3, Gemma3 and Llama4 support #1002

feat: Qwen3, Gemma3 and Llama4 support #1002

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat: Qwen3, Gemma3 and Llama4 support #1002

feat: Qwen3, Gemma3 and Llama4 support #1002

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!