-
Imperial College London
- London, United Kingdom
- https://mingxue-xu.github.io/
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
For releasing code related to compression methods for transformers, accompanying our publications
đź’Ť Efficient tensor decomposition-based filter pruning
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu, Jiawei Zhao, Yuandong Tian, Zhangyang Wang
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A thoroughly investigated survey for tensorial neural networks.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
MLPerf™ Tiny is an ML benchmark suite for extremely low-power systems such as microcontrollers
Computational statistics and machine learning reading group at Imperial College London (2019-2020)
ÎĽNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.
[SenSys 2019] Neuro.ZERO: A Zero-Energy Neural Network Accelerator for Embedded Sensing and Inference Systems
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…