8000 refactor: refactor dynamo serve part-1/N by biswapanda · Pull Request #788 · ai-dynamo/dynamo · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

refactor: refactor dynamo serve part-1/N #788

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 29 commits into from
Apr 25, 2025
Merged

Conversation

biswapanda
Copy link
Contributor
@biswapanda biswapanda commented Apr 23, 2025

Overview:

This PR is part 1/N to refactor dynamo serve path and introduce loose coupling with circusd

  • enable api fastapi based web server. note: fastapi based service is for hello world demo only, we're using rust based http server in llm example.
  • functional llm/hello world examples

credits: related PR #721 from @ishandhanani

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

@biswapanda biswapanda changed the title refactor: dynamo clean up feat: remove bento dependency from dynamo serve Apr 23, 2025
@biswapanda biswapanda force-pushed the bis/dynamo-http-api branch from 31173c2 to c01e8fe Compare April 23, 2025 21:22
@biswapanda biswapanda force-pushed the bis/dynamo-http-api branch from cd5ba88 to 40911e8 Compare April 25, 2025 02:34
@ishandhanani ishandhanani enabled auto-merge (squash) April 25, 2025 02:36
@biswapanda biswapanda disabled auto-merge April 25, 2025 02:37
@biswapanda biswapanda enabled auto-merge (squash) April 25, 2025 02:38
@biswapanda biswapanda disabled auto-merge April 25, 2025 03:07
@biswapanda biswapanda enabled auto-merge (squash) April 25, 2025 03:09
@biswapanda biswapanda merged commit 16310b2 into main Apr 25, 2025
11 checks passed
@biswapanda biswapanda deleted the bis/dynamo-http-api branch April 25, 2025 12:10
Copy link
Contributor
@nnshah1 nnshah1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM - will also review after additional refactors

Comment on lines +53 to +56
dynamo={
"enabled": True,
"namespace": "dynamo",
},
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@biswapanda was this a newly required change for Frontend to work with these refactor changes?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. On trtllm example, it seems like http and llmctl commands are no longer getting run -- I noticed you didn't add the dynamo service section in examples/tensorrt_llm/components/frontend.py
  2. After I added this to try to fix it locally, I'm getting errors now:
2025-04-30T01:10:50.985Z  INFO serve_dynamo.worker: [Frontend:1] Registering component dynamo/Frontend   
2025-04-30T01:10:50.985Z  INFO serve_dynamo.worker: [Frontend:1] Created Frontend component   
2025-04-30T01:10:50.986Z ERROR serve_dynamo.worker: [Frontend:1] FATAL ERROR: No Dynamo endpoints found in service Frontend!   
2025-04-30T01:10:50.986Z ERROR serve_dynamo.worker: [Frontend:1] Error in Dynamo component setup: FATAL ERROR: No Dynamo endpoints found in service Frontend!   
2025-04-30T01:10:50.986Z ERROR serve_dynamo: [Frontend:1] Error in main: FATAL ERROR: No Dynamo endpoints found in service Frontend!   

Seems like these changes broke the usage of http+llmctl as frontend if no dynamo endpoint is defined in it?

This was referenced Apr 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants
0