8000 talenz / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View talenz's full-sized avatar

Block or report talenz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM.

Python 523 42 Updated Jul 4, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 16,042 1,900 Updated Dec 25, 2024

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 509 58 Updated Jul 3, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,374 846 Updated Aug 12, 2024

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 252 39 Updated Jan 29, 2023

✨✨Latest Advances on Multimodal Large Language Models

15,729 1,023 Updated Jul 1, 2025

Download DeepMind's Kinetics dataset.

Python 266 85 Updated Jun 7, 2022

Model Quantization Benchmark

Python 819 143 Updated Apr 20, 2025

[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization

Python 95 14 Updated May 5, 2022

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Python 133 12 Updated Apr 28, 2022

An Open-Source Library for Training Binarized Neural Networks

Python 720 86 Updated Aug 12, 2024

Pytorch implementation of our paper accepted by ICCV 2021 -- ReCU: Reviving the Dead Weights in Binary Neural Networks http://arxiv.org/abs/2103.12369

Python 39 8 Updated Dec 7, 2021

An official PyTorch implementation of the paper "Distance-aware Quantization", ICCV 2021.

Python 48 9 Updated Nov 1, 2024

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2,127 419 Updated Jul 11, 2024

This is the official pytorch implementation for paper: BiDet: An Efficient Binarized Object Detector, which is accepted by CVPR2020.

Python 174 35 Updated Jul 7, 2021

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 744 124 Updated Apr 16, 2024

Neural Network Quantization With Fractional Bit-widths

Python 12 6 Updated Feb 19, 2021

[ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inference" by Yonggan Fu, Qixuan Yu, Meng Li, Vikas Chandra, Yingya…

Python 14 4 Updated Feb 13, 2022

The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.

Jupyter Notebook 5,255 1,264 Updated Oct 24, 2021

PyTorch implementation of Towards Efficient Training for Neural Network Quantization

Python 15 2 Updated Jan 16, 2020

Using ideas from product quantization for state-of-the-art neural network compression.

Python 145 15 Updated Aug 14, 2021

An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.

Python 93 17 Updated Jul 14, 2023

LeetCode 101:力扣刷题指南

9,528 1,237 Updated Dec 8, 2024

Pytorch implementation for FAT: learning low-bitwidth parametric representation via frequency-aware transformation

Jupyter Notebook 27 6 Updated May 2, 2021
Python 47 8 Updated Jan 21, 2022
Python 354 67 Updated Sep 12, 2023

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,358 406 Updated Jul 4, 2025

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,153 225 Updated Mar 4, 2025

Pytorch implementation of BRECQ, ICLR 2021

Python 276 59 Updated Aug 1, 2021

BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)

Python 40 9 Updated Jan 12, 2021
Next
0