lifeiteng

Feiteng lifeiteng

Full stack Algorithm Engineer

503 followers · 95 following

Achievements

x3 x2

Achievements

x3 x2

Stars

pytorch / torchcodec

PyTorch video decoding

Python 534 33 Updated May 5, 2025

WeichenFan / CFG-Zero-star

Official repo for CFG-Zero*

Python 524 19 Updated May 2, 2025

taco-group / DecAlign

A novel cross-modal decoupling and alignment framework for multimodal representation learning.

JavaScript 20 1 Updated Mar 19, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 2,913 151 Updated May 6, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,416 1,406 Updated May 6, 2025

SilentView / GigaTok

Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"

Python 142 1 Updated Apr 22, 2025

knik0 / faad2

Freeware Advanced Audio (AAC) Decoder faad2 mirror

C 186 76 Updated Mar 4, 2025

axiomatic-systems / Bento4

Full-featured MP4 format, MPEG DASH, HLS, CMAF SDK and tools

C++ 2,153 500 Updated Nov 15, 2024

jonghwanhyeon / python-ffmpeg

A python binding for FFmpeg which provides sync and async APIs

Python 334 53 Updated Jul 31, 2024

NVIDIA / free-threaded-python

No-GIL Python environment featuring NVIDIA Deep Learning libraries.

Dockerfile 59 3 Updated Apr 14, 2025

AudioLLMs / AudioBench

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 202 7 Updated Apr 1, 2025

MatthewCYM / VoiceBench

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 186 10 Updated May 6, 2025

OpenBMB / UltraEval-Audio

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 91 2 Updated Apr 17, 2025

dynamic-superb / dynamic-superb

The official repository of Dynamic-SUPERB.

Python 180 90 Updated Mar 15, 2025

YuqingWang1029 / TokenBridge

TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge

Python 107 3 Updated May 6, 2025

lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.

Python 1,017 233 Updated May 2, 2025

NVIDIA / multi-storage-client

Unified high-performance Python client for object and file stores.

Python 24 3 Updated May 5, 2025

timofurrer / colorful

Terminal string styling done right, in Python 🐍 🎉

Python 533 23 Updated Jan 7, 2024

xiaomi-research / r1-aqa

🤗 R1-AQA Model: mispeech/r1-aqa

Python 245 21 Updated Mar 28, 2025

RobinWu218 / SimDINO

Implementation for SimDINO/SimDINOv2

Python 126 9 Updated Mar 15, 2025

mbzuai-oryx / LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Python 243 25 Updated Mar 20, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 9,077 947 Updated Apr 9, 2025

xzhang9308 / OLVQ

2 Updated Dec 3, 2024

gradio-app / fastrtc

The python library for real-time communication

JavaScript 3,829 330 Updated Apr 23, 2025

Audio-WestlakeU / CleanMel

Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".

Python 50 2 Updated Apr 15, 2025

Jiang-Yidi / UniCodec

UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound

118 2 Updated Feb 28, 2025

lmxue / Audio-FLAN

Audio-FLAN

142 4 Updated Mar 6, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,516 829 Updated Apr 29, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,225 723 Updated May 4, 2025

sp-nitech / diffsptk

A differentiable version of SPTK

Python 182 16 Updated Apr 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feiteng lifeiteng

Achievements

Achievements

Block or report lifeiteng

Stars

pytorch / torchcodec

WeichenFan / CFG-Zero-star

taco-group / DecAlign

SandAI-org / MAGI-1

NVIDIA / TensorRT-LLM

SilentView / GigaTok

knik0 / faad2

axiomatic-systems / Bento4

jonghwanhyeon / python-ffmpeg

NVIDIA / free-threaded-python

AudioLLMs / AudioBench

MatthewCYM / VoiceBench

OpenBMB / UltraEval-Audio

dynamic-superb / dynamic-superb

YuqingWang1029 / TokenBridge

lhotse-speech / lhotse

NVIDIA / multi-storage-client

timofurrer / colorful

xiaomi-research / r1-aqa

RobinWu218 / SimDINO

mbzuai-oryx / LLMVoX

SparkAudio / Spark-TTS

xzhang9308 / OLVQ

gradio-app / fastrtc

Audio-WestlakeU / CleanMel

Jiang-Yidi / UniCodec

lmxue / Audio-FLAN

deepseek-ai / FlashMLA

QwenLM / Qwen2.5-VL

sp-nitech / diffsptk