Highlights
- Pro
-
-
-
Hetu-Galvatron Public
Forked from PKU-DAIR/Hetu-GalvatronGalvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).
Python Apache License 2.0 UpdatedDec 22, 2024 -
Hetu Public
Forked from PKU-DAIR/HetuA high-performance distributed deep learning system targeting large-scale and automated distributed training.
Python Apache License 2.0 UpdatedNov 4, 2024 -
-
-
-
-
sona Public
Forked from Angel-ML/sonaSpark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models
Scala Apache License 2.0 UpdatedOct 14, 2019 -
SketchML Public
Accelerating Distributed Machine Learning with Data Sketches
-
angel Public
Forked from Angel-ML/angelA Flexible and Powerful Parameter Server for large-scale machine learning