HandH1998

HandH1998 HandH1998

63 followers · 55 following

Beijing
22:03 (UTC +08:00)
https://scholar.google.com/citations?hl=zh-CN&user=MBR97ZIAAAAJ

Achievements

x2 x2 x3

Achievements

x2 x2 x3

sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 1 Apache License 2.0 Updated May 23, 2025
QQQ Public

QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.

Python 130 15 Updated Apr 7, 2025
DeepGEMM Public
Forked from deepseek-ai/DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda MIT License Updated Feb 28, 2025
compressed-tensors Public
Forked from neuralmagic/compressed-tensors

A safetensors extension to efficiently store sparse quantized tensors on disk

10000 Python Apache License 2.0 Updated Feb 20, 2025
llm-compressor Public
Forked from vllm-project/llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python Apache License 2.0 Updated Feb 19, 2025
HandH1998 Public

Updated Feb 17, 2025
ao Public
Forked from pytorch/ao

PyTorch native quantization and sparsity for training and inference

Python BSD 3-Clause "New" or "Revised" License Updated Nov 14, 2024
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Sep 5, 2024
lmdeploy Public
Forked from InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python Apache License 2.0 Updated Aug 29, 2024
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 2 Apache License 2.0 Updated Jul 31, 2024
marlin Public
Forked from IST-DASLab/marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 2 Apache License 2.0 Updated Jun 20, 2024
smoothquant Public
Forked from mit-han-lab/smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python MIT License Updated Nov 29, 2023
Megatron-DeepSpeed Public
Forked from deepspeedai/Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python Other Updated Aug 28, 2023
manifold_distillation Public

Python 1 Updated Mar 10, 2023
mct_former Public

Python 1 Updated Mar 8, 2023
pregenerate_bert_train_corpus Public

Python 1 Updated Mar 3, 2023
books_and_wiki_en_clean_format_and_shard Public

Python Updated Mar 3, 2023
tmp_bert_mlkd Public

Python Updated Mar 3, 2023
lightseq Public
Forked from bytedance/lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ Other Updated Dec 30, 2022
HandH1998.github.io Public

个人主页

HTML Updated Jan 5, 2022
academicpages.github.io Public
Forked from academicpages/academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript MIT License Updated Jan 5, 2022
net2net Public
Forked from zhengjian2322/net2net

Python Updated Jul 19, 2021
JS_learn Public

HTML Updated May 23, 2021
NN-CUDA-Example Public template
Forked from godweiyang/NN-CUDA-Example

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python Apache License 2.0 Updated Apr 29, 2021
NLP-Tutorials Public
Forked from MorvanZhou/NLP-Tutorials

Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com

Python MIT License Updated Mar 7, 2021
carInsurancePred Public

Python Updated Jan 25, 2021
soln-ml Public
Forked from thomas-young-2013/mindware

A research framework for fast prototyping of automl algorithms.

Python MIT License Updated Jan 1, 2021
ML_practice Public

Python Updated Nov 23, 2020
matplotlib Public

Python Updated Nov 20, 2020
easy-scrape Public

Python Updated Nov 18, 2020

HandH1998 HandH1998

Achievements

Achievements

sglang Public

Uh oh!

QQQ Public

Uh oh!

DeepGEMM Public

Uh oh!

compressed-tensors Public

Uh oh!

llm-compressor Public

Uh oh!

HandH1998 Public

Uh oh!

ao Public

Uh oh!

transformers Public

Uh oh!

lmdeploy Public

Uh oh!

vllm Public

Uh oh!

marlin Public

Uh oh!

smoothquant Public

Uh oh!

Megatron-DeepSpeed Public

Uh oh!

manifold_distillation Public

Uh oh!

mct_former Public

Uh oh!

pregenerate_bert_train_corpus Public

Uh oh!

books_and_wiki_en_clean_format_and_shard Public

Uh oh!

tmp_bert_mlkd Public

Uh oh!

lightseq Public

Uh oh!

HandH1998.github.io Public

Uh oh!

academicpages.github.io Public

Uh oh!

net2net Public

Uh oh!

JS_learn Public

Uh oh!

NN-CUDA-Example Public template

Uh oh!

NLP-Tutorials Public

Uh oh!

carInsurancePred Public

Uh oh!

soln-ml Public

Uh oh!

ML_practice Public

Uh oh!

matplotlib Public

Uh oh!

easy-scrape Public

Uh oh!