zhuangwang93

Zhuang Wang zhuangwang93

Applied Scientist at AWS AI

20 followers · 2 following

Amazon Web Services
Bellevue Washington
http://zhuangwang.org/

Achievements

zhuangwang93.github.io Public
Forked from SebastinSanty/minimal-research-theme

Just a plain, simple and elegant one-page theme for research/academia.

HTML Updated Jun 24, 2025
ZEN Public

ZEN for sparse tensor communication [OSDI '25]

Python 1 Updated May 20, 2025
mamba Public
Forked from state-spaces/mamba

Mamba SSM architecture

Python Apache License 2.0 Updated Aug 3, 2024
LoRA Public
Forked from microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python MIT License Updated Jan 9, 2024
Espresso Public

Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '23)

Python 15 3 Other Updated Sep 21, 2023
Gemini_SOSP23 Public

Artifact Evaluation of Gemini SOSP 2023

Python 1 MIT License Updated Aug 25, 2023
DeepSpeed Public
Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python Apache License 2.0 Updated Aug 22, 2023
Cupcake Public

Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)

Python 9 2 Updated Jul 13, 2023
dragonn Public

DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks (ICML '22)

Python 6 Updated Jun 19, 2023
research-course Public
Forked from noise-lab/research-course

"How to Do Great Research" Course for Ph.D. Students

TeX Other Updated Apr 12, 2023
byteps Public
Forked from bytedance/byteps

A high performance and generic framework for distributed DNN training

Python Other Updated Aug 20, 2022
pytorch-image-models Public
Forked from huggingface/pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Python Apache License 2.0 Updated Jul 22, 2022
virtualQ Public

Shell Updated Mar 7, 2022

8000
Model-Compression-Papers Public
Forked from chester256/Model-Compression-Papers

Papers for deep neural network compression and acceleration

Updated Jun 21, 2021
cuda-samples Public
Forked from NVIDIA/cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C Other Updated May 4, 2021
code-samples Public
Forked from NVIDIA-developer-blog/code-samples

Source code examples from the Parallel Forall Blog

HTML BSD 3-Clause "New" or "Revised" License Updated Jan 14, 2021
detectron2 Public
Forked from facebookresearch/detectron2

Detectron2 is FAIR's next-generation platform for object detection and segmentation.

Python Apache License 2.0 Updated Jan 13, 2021
Best_AI_paper_2020 Public
Forked from louisfb01/Best_AI_paper_2020

A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code

MIT License Updated Dec 22, 2020
gitignore Public
Forked from github/gitignore

A collection of useful .gitignore templates

Creative Commons Zero v1.0 Universal Updated Oct 24, 2020
grace Public
Forked from sands-lab/grace

GRACE - GRAdient ComprEssion for distributed deep learning

Python BSD 2-Clause "Simplified" License Updated Oct 24, 2020
fairseq Public
Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python MIT License Updated Sep 30, 2020
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamples

Deep Learning Examples

Python Updated Sep 25, 2020
training-bottleneck Public
Forked from netx-repo/training-bottleneck

Analyze network performance in distributed training

Python Apache License 2.0 Updated Aug 25, 2020
examples Public
Forked from pytorch/examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python BSD 3-Clause "New" or "Revised" License Updated Aug 3, 2020
attention-is-all-you-need-pytorch Public
Forked from jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python MIT License Updated Jul 8, 2020
nccl-tests Public
Forked from NVIDIA/nccl-tests

NCCL Tests

Cuda BSD 3-Clause "New" or "Revised" License Updated Jun 24, 2020
deep-gradient-compression Public
Forked from synxlin/deep-gradient-compression

[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Python Other Updated Jun 22, 2020
parallax Public
Forked from snuspl/parallax

A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.

Python Apache License 2.0 Updated May 14, 2020
smhasher Public
Forked from aappleby/smhasher

Automatically exported from code.google.com/p/smhasher

C++ Updated Mar 4, 2020
tutorials Public
Forked from p4lang/tutorials

P4 language tutorials

Python Apache License 2.0 Updated Nov 9, 2019

Zhuang Wang zhuangwang93

Achievements

Achievements

zhuangwang93.github.io Public

Uh oh!

ZEN Public

Uh oh!

mamba Public

Uh oh!

LoRA Public

Uh oh!

Espresso Public

Uh oh!

Gemini_SOSP23 Public

Uh oh!

DeepSpeed Public

Uh oh!

Cupcake Public

Uh oh!

dragonn Public

Uh oh!

research-course Public

Uh oh!

byteps Public

Uh oh!

pytorch-image-models Public

Uh oh!

virtualQ Public

Uh oh!

Model-Compression-Papers Public

Uh oh!

cuda-samples Public

Uh oh!

code-samples Public

Uh oh!

detectron2 Public

Uh oh!

Best_AI_paper_2020 Public

Uh oh!

gitignore Public

Uh oh!

grace Public

Uh oh!

fairseq Public

Uh oh!

DeepLearningExamples Public

Uh oh!

training-bottleneck Public

Uh oh!

examples Public

Uh oh!

attention-is-all-you-need-pytorch Public

Uh oh!

nccl-tests Public

Uh oh!

deep-gradient-compression Public

Uh oh!

parallax Public

Uh oh!

smhasher Public

Uh oh!

tutorials Public

Uh oh!