saber5433

saber5433

1 follower · 23 following

Achievements

MoonCast Public
Forked from jzq2000/MoonCast

Python MIT License Updated Apr 3, 2025
DNSMOSPro Public
Forked from fcumlin/DNSMOSPro

Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).

Python MIT License Updated Mar 17, 2025
S3Tokenizer Public
Forked from xingchensong/S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python Apache License 2.0 Updated Dec 27, 2024
InspireMusic Public
Forked from FunAudioLLM/InspireMusic

InspireMusic: A fundamental toolkit for music, song and audio generation.

Python Apache License 2.0 Updated Dec 11, 2024
Music-Source-Separation-Training Public
Forked from ZFTurbo/Music-Source-Separation-Training

Repository for training models for music source separation.

Python MIT License Updated Dec 3, 2024
speech-trident Public
Forked from ga642381/speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

Updated Dec 2, 2024
MuCodec Public
Forked from tencent-ailab/MuCodec

Python MIT License Updated Nov 22, 2024
MuseTalk Public
Forked from TMElyralab/MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python Other Updated Nov 15, 2024
python-audio-separator Public
Forked from nomadkaraoke/python-audio-separator

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python MIT License Updated Nov 4, 2024
bitsandbytes Public
Forked from bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python MIT License Updated Oct 22, 2024
DiariZen Public
Forked from BUTSpeechFIT/DiariZen

A toolkit for speaker diarization.

Jupyter Notebook MIT License Updated Oct 21, 2024
FCPE Public
Forked from CNChTu/FCPE

Python MIT License Updated Oct 18, 2024
F5-TTS Public
Forked from SWivid/F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python MIT License Updated Oct 16, 2024
to-jyutping Public
Forked from CanCLID/to-jyutping

粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool

TypeScript BSD 2-Clause "Simplified" License Updated Sep 30, 2024
ToJyutping Public
Forked from CanCLID/ToJyutping

粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool

Python BSD 2-Clause "Simplified" License Updated Sep 24, 2024
ctc-forced-aligner Public
Forked from MahmoudAshraf97/ctc-forced-aligner

Text to speech alignment using CTC forced alignment

Python Updated Sep 22, 2024
NeMo-text-processing Public
Forked from NVIDIA/NeMo-text-processing

NeMo text processing for ASR and TTS

Python Apache License 2.0 Updated Sep 19, 2024
BigCodec Public
Forked from Aria-K-Alethia/BigCodec

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python MIT License Updated Sep 19, 2024
super-monotonic-align Public
Forked from supertone-inc/super-monotonic-align

Python MIT License Updated Sep 14, 2024
LangSegment Public
Forked from JaccoSu/juntaosun_LangSegment

It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言（97种语言）混合文本内容自动分词工具。

Python Updated Sep 7, 2024
LLaMA-Factory Public
Forked from hiyouga/LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python Apache License 2.0 Updated Sep 2, 2024
text-labeler Public
Forked from fishaudio/text-labeler

A simple svs labeling tool

TypeScript Apache License 2.0 Updated Aug 19, 2024
SimpleSpeech Public
Forked from yangdongchao/SimpleSpeech

The open source code for SimpleSpeech series

Python Updated Aug 19, 2024
DeepFilterNet Public
Forked from Rikorose/DeepFilterNet

Noise supression using deep filtering

Python Other Updated Jul 31, 2024
Supercodec Public
Forked from exercise-book-yq/Supercodec

Python MIT License Updated Jul 31, 2024
open_clip Public
Forked from mlfoundations/open_clip

An open source implementation of CLIP.

Python Other Updated Jul 4, 2024
AudioLDM2 Public
Forked from haoheliu/AudioLDM2

Text-to-Audio/Music Generation

Python Other Updated Jun 27, 2024
LibriTTS-P Public
Forked from line/LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

Updated Jun 13, 2024
mamba Public
Forked from state-spaces/mamba

Mamba SSM architecture

Python Apache License 2.0 Updated Jun 5, 2024
lina-speech Public
Forked from theodorblackbird/lina-speech

lina-speech : linear attention based text-to-speech

Jupyter Notebook Other Updated Jun 3, 2024

saber5433

Achievements

Achievements

MoonCast Public

Uh oh!

DNSMOSPro Public

Uh oh!

S3Tokenizer Public

Uh oh!

InspireMusic Public

Uh oh!

Music-Source-Separation-Training Public

Uh oh!

speech-trident Public

Uh oh!

MuCodec Public

Uh oh!

MuseTalk Public

Uh oh!

python-audio-separator Public

Uh oh!

bitsandbytes Public

Uh oh!

DiariZen Public

Uh oh!

FCPE Public

Uh oh!

F5-TTS Public

Uh oh!

to-jyutping Public

Uh oh!

ToJyutping Public

Uh oh!

ctc-forced-aligner Public

Uh oh!

NeMo-text-processing Public

Uh oh!

BigCodec Public

Uh oh!

super-monotonic-align Public

Uh oh!

LangSegment Public

Uh oh!

LLaMA-Factory Public

Uh oh!

text-labeler Public

Uh oh!

SimpleSpeech Public

Uh oh!

DeepFilterNet Public

Uh oh!

Supercodec Public

Uh oh!

open_clip Public

Uh oh!

AudioLDM2 Public

Uh oh!

LibriTTS-P Public

Uh oh!

mamba Public

Uh oh!

lina-speech Public

Uh oh!