8000 Add SM100/B200 support for torch._grouped_mm · Issue #155434 · pytorch/pytorch · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Add SM100/B200 support for torch._grouped_mm #155434
Open
@drisspg

Description

@drisspg

Summary

torch._grouped_mm currently only supports compute capability 9.0, but should also support SM100/B200 cards.

Current Behavior

torch._grouped_mm(...)
# Error: torch._grouped_mm is only supported on CUDA devices with compute capability = 9.0

Expected Behavior

torch._grouped_mm should work on SM100/B200 hardware (compute capability 9.0+).

Environment

  • Hardware: SM100/B200 (Blackwell)
  • PyTorch: current main

cc @ptrblck @msaroufim @eqy @jerryzh168

Metadata

Metadata

Assignees

No one assigned

    Labels

    BlackwellSpecific failures or issues related to sm100 + Cuda archesmodule: cudaRelated to torch.cuda, and CUDA support in generaltriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0