talenz

talenz

Stars

intel / auto-round

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM.

Python 523 42 Updated Jul 4, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 16,042 1,900 Updated Dec 25, 2024

ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 509 58 Updated Jul 3, 2025

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,374 846 Updated Aug 12, 2024

kssteven418 / I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 252 39 Updated Jan 29, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,729 1,023 Updated Jul 1, 2025

Showmax / kinetics-downloader

Download DeepMind's Kinetics dataset.

Python 266 85 Updated Jun 7, 2022

ModelTC / MQBench

Model Quantization Benchmark

Python 819 143 Updated Apr 20, 2025

snap-research / F8Net

[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

Python 95 14 Updated May 5, 2022

liuzechun / Nonuniform-to-Uniform-Quantization

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Python 133 12 Updated Apr 28, 2022

larq / larq

An Open-Source Library for Training Binarized Neural Networks

Python 720 86 Updated Aug 12, 2024

z-hXu / ReCU

Pytorch implementation of our paper accepted by ICCV 2021 -- ReCU: Reviving the Dead Weights in Binary Neural Networks http://arxiv.org/abs/2103.12369

Python 39 8 Updated Dec 7, 2021

cvlab-yonsei / DAQ

An official PyTorch implementation of the paper "Distance-aware Quantization", ICCV 2021.

Python 48 9 Updated Nov 1, 2024

mit-han-lab / temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2,127 419 Updated Jul 11, 2024

ZiweiWangTHU / BiDet

This is the official pytorch implementation for paper: BiDet: An Efficient Binarized Object Detector, which is accepted by CVPR2020.

Python 174 35 Updated Jul 7, 2021

amirgholami / PyHessian

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 744 124 Updated Apr 16, 2024

deJQK / FracBits

Neural Network Quantization With Fractional Bit-widths

Python 12 6 Updated Feb 19, 2021

GATECH-EIC / Double-Win-Quant

[ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inference" by Yonggan Fu, Qixuan Yu, Meng Li, Vikas Chandra, Yingya…

Python 14 4 Updated Feb 13, 2022

zylo117 / Yet-Another-EfficientDet-Pytorch

The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.

Jupyter Notebook 5,255 1,264 Updated Oct 24, 2021

jakc4103 / scale-adjusted-training

PyTorch implementation of Towards Efficient Training for Neural Network Quantization

Python 15 2 Updated Jan 16, 2020

uber-research / permute-quantize-finetune

Using ideas from product quantization for state-of-the-art neural network compression.

Python 145 15 Updated Aug 14, 2021

cvlab-yonsei / EWGS

An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.

Python 93 17 Updated Jul 14, 2023

changgyhub / leetcode_101

LeetCode 101：力扣刷题指南

9,528 1,237 Updated Dec 8, 2024

ChaofanTao / FAT_Quantization

Pytorch implementation for FAT: learning low-bitwidth parametric representation via frequency-aware transformation

Jupyter Notebook 27 6 Updated May 2, 2021

EunhyeokPark / PROFIT

Python 47 8 Updated Jan 21, 2022

sony / ai-research-code

Python 354 67 Updated Sep 12, 2023

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,358 406 Updated Jul 4, 2025

Efficient-ML / Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,153 225 Updated Mar 4, 2025

yhhhli / BRECQ

Pytorch implementation of BRECQ, ICLR 2021

Python 276 59 Updated Aug 1, 2021

yanghr / BSQ

BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)

Python 40 9 Updated Jan 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly