A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,407 420 Updated May 14, 2025

open-mmlab / mmdeploy

OpenMMLab Model Deployment Framework

Python 2,941 663 Updated Sep 30, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,341 540 Updated May 13, 2025

NVIDIA / CUDALibrarySamples

CUDA Library Samples

Cuda 1,925 386 Updated May 12, 2025

CVCUDA / CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,497 229 Updated May 2, 2025

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,402 173 Updated Jul 12, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,045 5,222 Updated Jun 27, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,013 694 Updated May 13, 2025

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,153 904 Updated Mar 27, 2024

zijie0 / HumanSystemOptimization

健康学习到150岁 - 人体系统调优不完全指南

14,136 1,029 Updated May 9, 2024

Jack47 / hack-SysML

The road to hack SysML and become an system expert

Emacs Lisp 483 59 Updated Sep 25, 2024

YehLi / xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

Python 969 105 Updated Feb 27, 2023

google / gemmlowp

Low-precision matrix multiplication

C++ 1,803 457 Updated Jan 29, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 28,944 3,587 Updated Jul 23, 2024

MilesCranmer / symbolic_deep_learning

Code for "Discovering Symbolic Models from Deep Learning with Inductive Biases"

Python 750 136 Updated Nov 20, 2023

rguo12 / awesome-causality-algorithms

An index of algorithms for learning causality with data

3,135 468 Updated Jan 22, 2025

amit-sharma / causal-inference-tutorial

Repository with code and slides for a tutorial on causal inference.

Jupyter Notebook 575 111 Updated Sep 23, 2019

hyz-xmaster / VarifocalNet

VarifocalNet: An IoU-aware Dense Object Detector

Python 353 52 Updated Mar 5, 2021

thunlp / GNNPapers

Must-read papers on graph neural networks (GNN)

16,397 3,010 Updated Dec 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

王波超(Bochao Wang) sergeywong

Achievements

Achievements

Block or report sergeywong

Stars

morpho-matters / morpholib

vietnh1009 / ASCII-generator

terrastruct / d2

facebookresearch / chameleon

xlite-dev / Awesome-LLM-Inference

2noise / ChatTTS

Dao-AILab / flash-attention

ggml-org / llama.cpp

microsoft / unilm

NVIDIA / NeMo-Aligner

HazyResearch / ThunderKittens

NVIDIA / TransformerEngine