-
Amazon Web Services
- Bellevue Washington
- http://zhuangwang.org/
-
zhuangwang93.github.io Public
Forked from SebastinSanty/minimal-research-themeJust a plain, simple and elegant one-page theme for research/academia.
HTML UpdatedJun 24, 2025 -
-
mamba Public
Forked from state-spaces/mambaMamba SSM architecture
Python Apache License 2.0 UpdatedAug 3, 2024 -
LoRA Public
Forked from microsoft/LoRACode for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Python MIT License UpdatedJan 9, 2024 -
Espresso Public
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '23)
-
Gemini_SOSP23 Public
Artifact Evaluation of Gemini SOSP 2023
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedAug 22, 2023 -
Cupcake Public
Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)
-
dragonn Public
DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks (ICML '22)
-
research-course Public
Forked from noise-lab/research-course"How to Do Great Research" Course for Ph.D. Students
TeX Other UpdatedApr 12, 2023 -
byteps Public
Forked from bytedance/bytepsA high performance and generic framework for distributed DNN training
Python Other UpdatedAug 20, 2022 -
pytorch-image-models Public
Forked from huggingface/pytorch-image-modelsPyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Python Apache License 2.0 UpdatedJul 22, 2022 -
8000 -
Model-Compression-Papers Public
Forked from chester256/Model-Compression-PapersPapers for deep neural network compression and acceleration
UpdatedJun 21, 2021 -
cuda-samples Public
Forked from NVIDIA/cuda-samplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
C Other UpdatedMay 4, 2021 -
code-samples Public
Forked from NVIDIA-developer-blog/code-samplesSource code examples from the Parallel Forall Blog
HTML BSD 3-Clause "New" or "Revised" License UpdatedJan 14, 2021 -
detectron2 Public
Forked from facebookresearch/detectron2Detectron2 is FAIR's next-generation platform for object detection and segmentation.
Python Apache License 2.0 UpdatedJan 13, 2021 -
Best_AI_paper_2020 Public
Forked from louisfb01/Best_AI_paper_2020A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
MIT License UpdatedDec 22, 2020 -
gitignore Public
Forked from github/gitignoreA collection of useful .gitignore templates
Creative Commons Zero v1.0 Universal UpdatedOct 24, 2020 -
grace Public
Forked from sands-lab/graceGRACE - GRAdient ComprEssion for distributed deep learning
Python BSD 2-Clause "Simplified" License UpdatedOct 24, 2020 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedSep 30, 2020 -
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesDeep Learning Examples
Python UpdatedSep 25, 2020 -
training-bottleneck Public
Forked from netx-repo/training-bottleneckAnalyze network performance in distributed training
Python Apache License 2.0 UpdatedAug 25, 2020 examples Public
Forked from pytorch/examplesA set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 3, 2020 attention-is-all-you-need-pytorch Public
Forked from jadore801120/attention-is-all-you-need-pytorchA PyTorch implementation of the Transformer model in "Attention is All You Need".
Python MIT License UpdatedJul 8, 2020 nccl-tests Public
Forked from NVIDIA/nccl-testsNCCL Tests
Cuda BSD 3-Clause "New" or "Revised" License UpdatedJun 24, 2020 deep-gradient-compression Public
Forked from synxlin/deep-gradient-compression[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Python Other UpdatedJun 22, 2020 parallax Public
Forked from snuspl/parallaxA Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
Python Apache License 2.0 UpdatedMay 14, 2020 smhasher Public
Forked from aappleby/smhasherAutomatically exported from code.google.com/p/smhasher
C++ UpdatedMar 4, 2020 tutorials Public
Forked from p4lang/tutorialsP4 language tutorials
Python Apache License 2.0 UpdatedNov 9, 2019 Previous Next