8000 infected4098 (Yong Joon Lee) / Starred · GitHub

More Web Proxy on the site http://driver.im/

infected4098

Follow

Yong Joon Lee infected4098

Follow

Masters' student in KAIST EE

7 followers · 5 following

Highlights

Pro

Stars

ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement

Python 367 64 Updated May 3, 2024

seongq / flowmse

flow matching based speech enhancement

Python 14 Updated Jun 30, 2025

sh-lee-prml / PeriodWave

The official Implementation of PeriodWave and PeriodWave-Turbo

Python 199 13 Updated Apr 14, 2025

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 946 112 Updated Aug 7, 2024

vivian556123 / NeurIPS2024-CoVoMix

Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

Python 57 3 Updated Jan 16, 2025

alessandroragano / scoreq

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 73 4 Updated Jun 27, 2025

tarepan / SpeechMOS

Easy-to-Use Speech MOS predictors

Python 291 16 Updated Oct 24, 2023

luotianze666 / WaveFM

[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

Python 100 7 Updated Mar 27, 2025

kaistmm / fregrad

Python 33 4 Updated May 13, 2024

ncsoft / avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Python 152 19 Updated Feb 1, 2023

JasonSWFu / VQscore

Python 53 5 Updated Dec 2, 2024

vdng9338 / audio-embedding-sensitivity

Jupyter Notebook 2 Updated Jan 29, 2025

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,497 147 Updated Jun 24, 2025

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,055 134 Updated Sep 5, 2024

rishikksh20 / Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python 120 15 Updated Jul 14, 2022

microsoft / fadtk

A simple library for Fréchet Audio Distance (FAD) calculation

Python 222 24 Updated May 26, 2025

msmsajjadi / precision-recall-distributions

Assessing Generative Models via Precision and Recall (official repository)

Python 105 12 Updated Nov 21, 2022

infected4098 / Wave-U-Mamba

An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.

Python 15 1 Updated Jun 4, 2025

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 794 72 Updated Jul 30, 2024

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,309 233 Updated May 21, 2023

szagoruyko / pytorchviz

A small package to create visualizations of PyTorch execution graphs

Jupyter Notebook 3,389 284 Updated Dec 30, 2024

kyegomez / VisionMamba

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…

Python 459 22 Updated Jul 1, 2025

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,476 243 Updated Feb 13, 2025

pyyush / SpecAugment

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Python 83 15 Updated Sep 5, 2020

maum-ai / nuwave

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021

Python 284 20 Updated Jul 22, 2022

haoheliu / ssr_eval

Evaluation and Benchmarking of Speech Super-resolution Methods

Python 150 12 Updated Jun 17, 2022

state-spaces / s4

Structured state space sequence models

Jupyter Notebook 2,672 325 Updated Jul 17, 2024

Jin1025 / Timbre2Vec

7th deep daiv. 은근예민

Python 3 Updated Apr 27, 2024

sujaykundu777 / devlopr-jekyll

(FREE SITE GENERATOR) - A Customizable/Hackable portfolio jekyll theme where you can blog using Markdown or CMS 🚀 in minutes built for developers. (with CMS) ✨

SCSS 723 1,013 Updated Apr 29, 2025

0