mtxing

🎯

Focusing

mtxing mtxing

🎯

Focusing

8 followers · 13 following

Achievements

GPT-SoVITS-V3-Infer-API Public
Forked from CNFlyCat/GPT-SoVITS-V3-Infer-API

Convenient for developers to call inference models from version v1 to v3 through API, supporting streaming transmission and specified type file transfer.

Python MIT License Updated Feb 19, 2025
S3Tokenizer Public
Forked from xingchensong/S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python Apache License 2.0 Updated Oct 12, 2024
The_C_Programming_Language Public

C_Programming_Language submit

C Updated May 6, 2023
vits_chinese Public
Forked from PlayVoice/vits_chinese

Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!

Python Updated Mar 20, 2023
espnet Public
Forked from espnet/espnet

End-to-End Speech Processing Toolkit

Python Apache License 2.0 Updated Sep 8, 2022
metrics Public
Forked from Lightning-AI/torchmetrics

Machine learning metrics for distributed, scalable PyTorch applications.

Python Apache License 2.0 Updated Jul 27, 2022
speechbrain Public
Forked from speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Python Apache License 2.0 Updated Mar 25, 2022
DNS-Challenge Public
Forked from microsoft/DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python Creative Commons Attribution 4.0 International Updated Mar 22, 2022
BLOOM-Net Public
Forked from kimsunwiub/BLOOM-Net

Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"

Python MIT License Updated Feb 13, 2022
HGCN Public
Forked from wangtianrui/HGCN

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

Python Updated Jan 31, 2022
FullSubNet Public
Forked from Audio-WestlakeU/FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python MIT License Updated Jan 21, 2022
AFRCNN-For-Speech-Separation Public
Forked from JusperLee/AFRCNN-For-Speech-Separation

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Python 1 MIT License Updated Dec 4, 2021
sudo_rm_rf Public template
Forked from etzinis/sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of sep…

Jupyter Notebook MIT License Updated Nov 21, 2021
aps Public
Forked from funcwj/aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Python Apache License 2.0 Updated Nov 5, 2021
voicefixer_main Public
Forked from haoheliu/voicefixer_main

General Speech Restoration

Python GNU Affero General Public License v3.0 Updated Nov 2, 2021
conformer Public
Forked from sooftware/conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python Apache License 2.0 Updated Oct 9, 2021
generative_inpainting Public
Forked from JiahuiYu/generative_inpainting

DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral

Python Other Updated Aug 29, 2021
pyloudnorm Public
Forked from csteinmetz1/pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python MIT License Updated Aug 28, 2021
IguanaTexMac Public
Forked from tsung-ju/IguanaTexMac

IguanaTex for mac

VBA Updated Jun 12, 2021
DeepLearning-500-questions Public
Forked from scutan90/DeepLearning-500-questions

深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06

JavaScript GNU General Public License v3.0 Updated May 30, 2021
deep_avsr Public
Forked from smeetrs/deep_avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Python MIT License Updated May 20, 2021
hifi-gan Public
Forked from jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python MIT License Updated Apr 28, 2021
SpecAugment Public
Forked from DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Python Apache License 2.0 Updated Apr 27, 2021
performer-pytorch Public
Forked from lucidrains/performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

Python MIT License Updated Apr 21, 2021
MetricGAN Public
Forked from JasonSWFu/MetricGAN

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)

MATLAB Updated Apr 19, 2021
hifigan-denoiser Public
Forked from rishikksh20/hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python Apache License 2.0 Updated Apr 8, 2021
MS-SNSD Public
Forked from microsoft/MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML MIT License Updated Mar 15, 2021
torch-dct Public
Forked from zh217/torch-dct

DCT (discrete cosine transform) functions for pytorch

Python MIT License Updated Mar 9, 2021
Parrotron Public
Forked from 8secz-johndpope/Parrotron

Python Updated Mar 5, 2021
pytorch-inpainting-with-partial-conv Public
Forked from naoto0804/pytorch-inpainting-with-partial-conv

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]

Python MIT License Updated Dec 4, 2020

mtxing mtxing

Achievements

Achievements

GPT-SoVITS-V3-Infer-API Public

Uh oh!

S3Tokenizer Public

Uh oh!

The_C_Programming_Language Public

Uh oh!

vits_chinese Public

Uh oh!

espnet Public

Uh oh!

metrics Public

Uh oh!

speechbrain Public

Uh oh!

DNS-Challenge Public

Uh oh!

BLOOM-Net Public

Uh oh!

HGCN Public

Uh oh!

FullSubNet Public

Uh oh!

AFRCNN-For-Speech-Separation Public

Uh oh!

sudo_rm_rf Public template

Uh oh!

aps Public

Uh oh!

voicefixer_main Public

Uh oh!

conformer Public

Uh oh!

generative_inpainting Public

Uh oh!

pyloudnorm Public

Uh oh!

IguanaTexMac Public

Uh oh!

DeepLearning-500-questions Public

Uh oh!

deep_avsr Public

Uh oh!

hifi-gan Public

Uh oh!

SpecAugment Public

Uh oh!

performer-pytorch Public

Uh oh!

MetricGAN Public

Uh oh!

hifigan-denoiser Public

Uh oh!

MS-SNSD Public

Uh oh!

torch-dct Public

Uh oh!

Parrotron Public

Uh oh!

pytorch-inpainting-with-partial-conv Public

Uh oh!