bigpon

🎯

Focusing

Yi-Chiao WU bigpon

🎯

Focusing

Research Scientist @ Meta Reality Labs Research topics: Neural codec Voice Conversion, Speech Synthesis, Speech Enhancement.

98 followers · 9 following

Meta
New York City, NY, US
03:36 (UTC -04:00)
https://bigpon.github.io/

Achievements

Stars

ddlBoJack / MMAR

Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Python 141 4 Updated Jun 6, 2025

helblazer811 / Diffusion-Explorer

Interactive visualizations of the geometric intuition behind diffusion models.

Svelte 781 32 Updated Jun 17, 2025

wavlab-speech / versa

Versatile Evaluation of Speech and Audio

Python 288 31 Updated Jul 1, 2025

pnlong / PDMX

PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing

Python 62 4 Updated Jun 1, 2025

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 1,073 93 Updated May 16, 2025

facebookresearch / FlowDec

An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.

Python 178 14 Updated Mar 22, 2025

stepfun-ai / Step-Audio

Python 4,392 358 Updated Jun 12, 2025

yaoxunji / gen-se

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Python 142 21 Updated Feb 28, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 281 33 Updated Jun 15, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 526 37 Updated Jun 5, 2025

JusperLee / SonicSim

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Python 237 23 Updated Jan 22, 2025

JusperLee / TIGER

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 282 45 Updated May 22, 2025

etzinis / heterogeneous_separation

Code and data recipes for the paper: Heterogeneous Target Speech Separation

Python 42 1 Updated Dec 6, 2022

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,079 204 Updated Jul 2, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,921 166 Updated May 28, 2025

justinsalamon / scaper

A library for soundscape synthesis and augmentation

Python 401 65 Updated May 4, 2022

asteroid-team / asteroid

The PyTorch-based audio source separation toolkit for researchers

Python 2,410 436 Updated Jan 11, 2025

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,073 834 Updated Jul 4, 2025

alibabasglab / MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

Python 132 9 Updated Nov 28, 2024

wenet-e2e / wesep

Target Speaker Extraction Toolkit

Python 179 20 Updated Jul 4, 2025

JusperLee / SPMamba

Python 172 23 Updated Dec 5, 2024

JusperLee / Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Python 485 77 Updated May 26, 2023

mpariente / pywsj0-mix

wsj0-{2, 3, 4, 5} mix generation scripts, in Python.

Python 60 6 Updated Mar 17, 2021

JorisCos / LibriMix

An open source dataset for source separation

Python 431 71 Updated Feb 9, 2024

sh-lee-prml / PeriodWave

The official Implementation of PeriodWave and PeriodWave-Turbo

Python 200 13 Updated Apr 14, 2025

sp-uhh / ears_benchmark

Generation scripts for EARS-WHAM and EARS-Reverb

Python 34 4 Updated Jul 4, 2025

mdeff / fma

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,423 452 Updated Jan 5, 2023

MME-Benchmarks / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

586 24 Updated May 8, 2025

soham97 / PAM

PAM is a no-reference audio quality metric for audio generation tasks

Python 65 6 Updated Jul 19, 2024

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 808 135 Updated Dec 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yi-Chiao WU bigpon

Achievements

Achievements

Block or report bigpon

Stars

ddlBoJack / MMAR

helblazer811 / Diffusion-Explorer

wavlab-speech / versa

pnlong / PDMX

KinWaiCheuk / nnAudio

facebookresearch / FlowDec

stepfun-ai / Step-Audio

yaoxunji / gen-se

zhenye234 / X-Codec-2.0

facebookresearch / audiobox-aesthetics

JusperLee / SonicSim

JusperLee / TIGER

etzinis / heterogeneous_separation

iver56 / audiomentations

facebookresearch / flow_matching

justinsalamon / scaper

asteroid-team / asteroid

modelscope / modelscope

alibabasglab / MossFormer2

wenet-e2e / wesep

JusperLee / SPMamba

JusperLee / Conv-TasNet

mpariente / pywsj0-mix

JorisCos / LibriMix

sh-lee-prml / PeriodWave

sp-uhh / ears_benchmark

mdeff / fma

MME-Benchmarks / Video-MME

soham97 / PAM

gabrielmittag / NISQA