8000 Release Dynamo Release v0.3.1 · ai-dynamo/dynamo · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Dynamo Release v0.3.1

Latest
Compare
Choose a tag to compare
@nv-anants nv-anants released this 01 Jul 17:59
· 1 commit to release/0.3.1 since this release
e117295

Dynamo is an open source project under the Apache 2.0 license. The primary distribution is done through pip wheels with minimal binary size. The ai-dynamo GitHub organization hosts two repositories: Dynamo and NIXL. Dynamo is designed as the next-generation inference server, building upon the foundation of NVIDIA® Triton Inference Server™. While Triton focuses on single-node inference deployments, we're integrating its robust capabilities into Dynamo over the next several months. We'll maintain support for Triton while providing a clear migration path for existing users once Dynamo achieves feature parity.

As a vendor-neutral serving framework, Dynamo supports multiple large language model (LLM) inference engines to varying degrees:

  • NVIDIA TensorRT-LLM
  • vLLM
  • SGLang

Dynamo v0.3.1 features:

  • Functional DeepSeek R1 disaggregated serving with wide EP using SGLang
  • Functional EPD disaggregation with video model (Llava video 7B)
  • Proof of concept inference gateway support
  • Prebuilt Dynamo + vLLM container
    • We plan to release these pre-built containers in the coming days
  • Amazon Linux support

Future plans
Dynamo Roadmap

Known Issues

  • KVBM is supported only with python 3.12

What's Changed

🚀 Features & Improvements

🐛 Bug Fixes

📚 Documentation

🛠️ Build, CI and Test

New Contributors

Full Changelog: v0.3.0...v0.3.1

0