Popular repositories Loading
-
Megatron-DeepSpeed-TT
Megatron-DeepSpeed-TT PublicForked from deepspeedai/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python
-
-
astra-sim 3CAC
astra-sim PublicForked from astra-sim/astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
C++
-
chakra
chakra PublicForked from mlcommons/chakra
Repository for MLCommons Chakra schema and tools
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.