-
Notifications
You must be signed in to change notification settings - Fork 388
feat: Add unified x86 / aarch64 (ARM) build for TRTLLM image #803
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Manually verified build/run/test on both Linux ARM64 and x86 machines with this change, see PR description for details @saturley-hall @dmitry-tokarev-nv |
Signed-off-by: Ryan McCormick <rmccormick@nvidia.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM - few suggestions / questions
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Question - Is there official or something x86 equivalent trtllm base image for arm build?
Also jfyi, triggered gitlab with trtllm job - should validate default x86 flow
Overview:
Adds ARM64 support to TRTLLM docker build via docker build ARGS supplied from
--platform linux/arm64
:By default, the build args are set to x86/amd64 equivalents.
Details
ARM
Manually verified build, run, and inference on an A100 Linux ARM machine
Build
Run
Inference
Reproducibility
x86
Build
Run
# Same as ARM steps above
Inference
# Same as ARM steps above
Reproducibility