Stars
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Large, modern dataset for speech recognition
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
E2E system with LF-MMI; word N-gram for Mandarin
Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.
Command line utility for forced alignment using Kaldi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
it's a train acoustics model code lib
Facebook AI Research's Automatic Speech Recognition Toolkit
Single Headed Attention RNN - "Stop thinking with your head"
Download and preperation tool for free speech corpora.
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP