Stars
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频目标分割、图像抠图、图像编辑、单目标跟踪、多目标跟踪、行人重识别、RGBT、图像去噪、去雨、去雾、去阴影、去模糊、超分辨…
Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Unofficial implementation of LSQ-Net, a neural network quantization framework
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precision"
Official source code for "Oscillations Make Neural Networks Robust to Quantization". Wenshøj et al. 2025 https://arxiv.org/abs/2502.00490
The official implementation of the ICML 2023 paper OFQ-ViT
Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training" (IEEE T-PAMI2025)
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Papers and codes about Quantized Networks for easier survey and reference.
Efficient computing methods developed by Huawei Noah's Ark Lab
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…
[ICCV 2023] Overcoming Forgetting Catastrophe in Quantization-Aware Training
The PyTorch implementation of of Neural Networks for Low-precision Integer Hardware (LLSQ) in ICLR2020 (unofficial)
Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]
Offical implementation of "Deep Directly-Trained Spiking Neural Networks for Object Detection" (ICCV2023)
A binary neural network (BNN) for object detection called BOB-YOLO to achieve a balanced performance in terms of computational speed, model size, and detection accuracy.
Offical Implementation of "CLIF: Complementary Leaky Integrate-and-Fire Neuron for Spiking Neural Networks" (ICML 2024 spotlight)
Official code for "EC-SNN: Splitting Deep Spiking Neural Networks on Edge Devices" (IJCAI2024)