-
Tsinghua University
-
23:09
(UTC +08:00) - https://pacman.cs.tsinghua.edu.cn/~zjd/author/shizhi-tang/
- https://orcid.org/0000-0002-6543-0859
Highlights
- Pro
-
-
-
FreeTensor Public
A language and compiler for irregular tensor programs.
-
fairscale Public
Forked from facebookresearch/fairscalePyTorch extensions for high performance and large scale training.
Python Other UpdatedNov 26, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedSep 10, 2024 -
-
benchmark Public
Forked from google/benchmarkA microbenchmark support library
C++ Apache License 2.0 UpdatedAug 13, 2024 -
taskflow Public
Forked from taskflow/taskflowA General-purpose Task-parallel Programming System using Modern C++
C++ Other UpdatedAug 13, 2024 -
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python MIT License UpdatedAug 9, 2024 -
pybind11 Public
Forked from pybind/pybind11Seamless operability between C++11 and Python
C++ Other UpdatedAug 9, 2024 -
googletest Public
Forked from google/googletestGoogleTest - Google Testing and Mocking Framework
C++ BSD 3-Clause "New" or "Revised" License UpdatedAug 7, 2024 -
-
llm-awq Public
Forked from mit-han-lab/llm-awq[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedJul 16, 2024 -
EETQ Public
Forked from NetEase-FuXi/EETQEasy and Efficient Quantization for Transformers
C++ Apache License 2.0 UpdatedJul 15, 2024 -
fastmoe Public
Forked from laekov/fastmoeA fast MoE impl for PyTorch
Python Apache License 2.0 UpdatedJun 3, 2024 -
pytorch-benchmark Public
Forked from pytorch/benchmarkTorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 2, 2024 -
ADBench Public
Forked from microsoft/ADBenchBenchmarking various AD tools.
-
checkout Public
Forked from actions/checkoutAction for checking out a repo
TypeScript MIT License UpdatedJan 12, 2024 -
FreeTensor_experiments Public
Experiments on FreeTensor
-
Enzyme Public
Forked from EnzymeAD/EnzymeHigh-performance automatic differentiation of LLVM and MLIR.
LLVM Other UpdatedJul 6, 2023 -
Open standard for machine learning interoperability
C++ Apache License 2.0 UpdatedOct 8, 2021 -
taco Public
Forked from tensor-compiler/tacoThe Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
C++ Other UpdatedDec 16, 2020 -
transformers Public
Forked from huggingface/transformers🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Python Apache License 2.0 UpdatedDec 7, 2020 -
incubator-tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedSep 17, 2020 -
longformer Public
Forked from allenai/longformerLongformer: The Long-Document Transformer
Python Apache License 2.0 UpdatedJun 6, 2020 -
async-syscall-app Public
Userspace for roastduck/linux:async. Working in progress.
C++ UpdatedMay 26, 2020 -
-
-