8000 gloo: fix building system gloo with CUDA/HIP by nlbrown2 · Pull Request #146637 · pytorch/pytorch · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

gloo: fix building system gloo with CUDA/HIP #146637

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

nlbrown2
Copy link
Contributor
@nlbrown2 nlbrown2 commented Feb 6, 2025

Fix incorrect linking of Gloo's libraries when building with system Gloo. Previously, either Gloo's native library or Gloo's CUDA library were linked. However, Gloo had changed such that all users of Gloo must link the native library, and can optionally link the CUDA or HIP library for Gloo + CUDA/HIP support.
This had been updated when building/linking with vendored Gloo, but not when using system Gloo.

Fixes: #146239

Reported-by: Adam J Stewart ajstewart426@gmail.com

cc @malfet @seemethere @ptrblck @msaroufim @eqy @jerryzh168

Fix incorrect linking of Gloo's libraries when building with system
Gloo. Previously, either Gloo's native library or Gloo's CUDA library
were linked. However, Gloo had changed such that all users of Gloo must
link the native library, and can optionally link the CUDA or HIP
library for Gloo + CUDA/HIP support.
This had been updated when building/linking with vendored Gloo, but not
when using system Gloo.

Fixes: pytorch#146239

Reported-by: Adam J Stewart <ajstewart426@gmail.com>
Signed-off-by: Nathan Brown <nathan.brown@arm.com>
Copy link
pytorch-bot bot commented Feb 6, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/146637

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e9bfe6f with merge base 5d81bc3 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@nlbrown2
Copy link
Contributor Author
nlbrown2 commented Feb 6, 2025

@pytorchbot label "topic: not user facing"

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Feb 6, 2025
@cpuhrsch cpuhrsch requested a review from malfet February 8, 2025 01:39
@cpuhrsch cpuhrsch added module: build Build system issues module: cuda Related to torch.cuda, and CUDA support in general triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Feb 8, 2025
@nlbrown2
Copy link
Contributor Author

Hello,
Any updates on this PR? Any desired changes?

Thanks,
Nathan

@adamjstewart
Copy link
Contributor

Pinging for review

Copy link
Contributor

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

@github-actions github-actions bot added the Stale label Jun 15, 2025
@adamjstewart
Copy link
Contributor

Waiting on @malfet or others for review

@msaroufim msaroufim requested a review from d4l3k June 15, 2025 15:54
@github-actions github-actions bot closed this Jul 15, 2025
@adamjstewart
Copy link
Contributor

Still waiting for review...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: build Build system issues module: cuda Related to torch.cuda, and CUDA support in general open source Stale topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
None yet
Development

Successfully merging this pull request may close these issues.

torch_shm_manager: undefined reference to gloo
4 participants
0