- Hanam-si, Gyeonggi-Do, South Korea
- https://velog.io/@infected4098/posts
- https://infected4098.github.io
Highlights
- Pro
Stars
Conformer-based Metric GAN for speech enhancement
The official Implementation of PeriodWave and PeriodWave-Turbo
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Official PyTorch implementation of BigVGAN (ICLR 2023)
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
A simple library for Fréchet Audio Distance (FAD) calculation
Assessing Generative Models via Precision and Recall (official repository)
An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.
Collection of audio-focused loss functions in PyTorch
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
A small package to create visualizations of PyTorch execution graphs
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021
Evaluation and Benchmarking of Speech Super-resolution Methods
Structured state space sequence models
(FREE SITE GENERATOR) - A Customizable/Hackable portfolio jekyll theme where you can blog using Markdown or CMS 🚀 in minutes built for developers. (with CMS) ✨