8000 feat: Add TensorRT-LLM example for dynamo serve/run by tanmayv25 · Pull Request #456 · ai-dynamo/dynamo · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

feat: Add TensorRT-LLM example for dynamo serve/run #456

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 14 commits into from
Apr 3, 2025
Merged

Conversation

tanmayv25
Copy link
Contributor
@tanmayv25 tanmayv25 commented Apr 1, 2025

Overview:

Adds back limited TensorRT-LLM examples that were removed by this commit: 9aedc85

These new examples follows dynamo serve sdk and can start the processes automatically for the user.

Details:

The TRT-LLM examples were removed because we wanted to put our best UX forward. The outdated examples are still residing in trtllm_llmapi_v1 branch. I am adding these examples back to the main repository so they are more accessible.

Currently, these changes are only working and tested on aggregated example on single-node single gpu deployments.

TODO:

  1. ±Support single-node multiple GPUs with MPI launches in dynamo serve±
  2. Support disaggregated serving using single node system.

Where should the reviewer start?

The components folder within the example. The example follows vLLM closely.

Copy link
copy-pr-bot bot commented Apr 1, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@tanmayv25 tanmayv25 requested review from nnshah1 and ryanolson April 2, 2025 19:06
@tanmayv25 tanmayv25 requested a review from NVShreyas April 2, 2025 22:00
@tanmayv25 tanmayv25 requested a review from nnshah1 April 2, 2025 22:13
Copy link
Contributor
@nnshah1 nnshah1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

great start! will save more clean up for future todo

@tanmayv25 tanmayv25 enabled auto-merge (squash) April 2, 2025 23:47
@tanmayv25 tanmayv25 merged commit fe597df into main Apr 3, 2025
6 checks passed
@tanmayv25 tanmayv25 deleted the tanmayv-trtllm branch April 3, 2025 19:08
kylehh pushed a commit to kylehh/dynamo that referenced this pull request Apr 11, 2025
Co-authored-by: Neelay Shah <neelays@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants
0