8000 infected4098 (Yong Joon Lee) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View infected4098's full-sized avatar

Highlights

  • Pro

Block or report infected4098

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Conformer-based Metric GAN for speech enhancement

Python 367 64 Updated May 3, 2024

flow matching based speech enhancement

Python 14 Updated Jun 30, 2025

The official Implementation of PeriodWave and PeriodWave-Turbo

Python 199 13 Updated Apr 14, 2025

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 946 112 Updated Aug 7, 2024

Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations

Python 57 3 Updated Jan 16, 2025

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 73 4 Updated Jun 27, 2025

Easy-to-Use Speech MOS predictors

Python 291 16 Updated Oct 24, 2023

[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

Python 100 7 Updated Mar 27, 2025
Python 33 4 Updated May 13, 2024

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Python 152 19 Updated Feb 1, 2023
Python 53 5 Updated Dec 2, 2024
Jupyter Notebook 2 Updated Jan 29, 2025

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,497 147 Updated Jun 24, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,055 134 Updated Sep 5, 2024

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python 120 15 Updated Jul 14, 2022

A simple library for Fréchet Audio Distance (FAD) calculation

Python 222 24 Updated May 26, 2025

Assessing Generative Models via Precision and Recall (official repository)

Python 105 12 Updated Nov 21, 2022

An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.

Python 15 1 Updated Jun 4, 2025

Collection of audio-focused loss functions in PyTorch

Python 794 72 Updated Jul 30, 2024

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,309 233 Updated May 21, 2023

A small package to create visualizations of PyTorch execution graphs

Jupyter Notebook 3,389 284 Updated Dec 30, 2024

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…

Python 459 22 Updated Jul 1, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,476 243 Updated Feb 13, 2025

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Python 83 15 Updated Sep 5, 2020

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021

Python 284 20 Updated Jul 22, 2022

Evaluation and Benchmarking of Speech Super-resolution Methods

Python 150 12 Updated Jun 17, 2022

Structured state space sequence models

Jupyter Notebook 2,672 325 Updated Jul 17, 2024

7th deep daiv. 은근예민

Python 3 Updated Apr 27, 2024

(FREE SITE GENERATOR) - A Customizable/Hackable portfolio jekyll theme where you can blog using Markdown or CMS 🚀 in minutes built for developers. (with CMS) ✨

SCSS 723 1,013 Updated Apr 29, 2025
0