8000 Mingxue-Xu (Mingxue (Mercy) Xu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Mingxue-Xu's full-sized avatar
đź’ˇ
đź’ˇ

Block or report Mingxue-Xu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,701 7,506 Updated May 20, 2025

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 794 104 Updated Aug 20, 2024

For releasing code related to compression methods for transformers, accompanying our publications

Python 428 52 Updated Jan 16, 2025

đź’Ť Efficient tensor decomposition-based filter pruning

Jupyter Notebook 16 4 Updated Jun 28, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,365 130 Updated May 20, 2025
Jupyter Notebook 53 5 Updated Oct 3, 2024

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang

Python 47 1 Updated Apr 21, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,271 5,998 Updated May 20, 2025

A thoroughly investigated survey for tensorial neural networks.

134 14 Updated Jan 15, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,523 1,439 Updated May 20, 2025

TensorLy: Tensor Learning in Python.

Python 1,609 292 Updated May 5, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,580 29,010 Updated May 20, 2025

Causal Reasoning for Membership Inference Attacks

Python 11 2 Updated Oct 21, 2022

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

HTML 12,570 2,952 Updated Oct 19, 2024

MLPerf™ Tiny is an ML benchmark suite for extremely low-power systems such as microcontrollers

C 404 98 Updated May 19, 2025

Computational statistics and machine learning reading group at Imperial College London (2019-2020)

SCSS 24 2 Updated Feb 27, 2025

ÎĽNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.

Python 80 14 Updated Jan 26, 2021
Python 897 143 Updated Nov 29, 2023

[SenSys 2019] Neuro.ZERO: A Zero-Energy Neural Network Accelerator for Embedded Sensing and Inference Systems

C 6 2 Updated May 28, 2020

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

Python 607 83 Updated Nov 19, 2021

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,903 1,912 Updated May 16, 2025
0