8000 Sundy1219 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Sundy1219's full-sized avatar

Block or report Sundy1219

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Dec 6, 2023

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,128 325 Updated Nov 14, 2023

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,026 570 Updated Oct 27, 2023

A simple, portable decoder

C++ 10 7 Updated Oct 25, 2018

Large, modern dataset for speech recognition

Shell 675 62 Updated Feb 26, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,909 1,912 Updated May 21, 2025

A CRF-based ASR Toolkit

Python 334 76 Updated Aug 13, 2024

E2E system with LF-MMI; word N-gram for Mandarin

Python 165 45 Updated Apr 29, 2022
Python 1,105 337 Updated May 21, 2025

Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.

Python 347 61 Updated Sep 28, 2022

Minimize kaldi nnet3 chain decoder

C++ 45 19 Updated Jan 10, 2020

Command line utility for forced alignment using Kaldi

Python 1,477 254 Updated Mar 25, 2025

A python package for calculating the PESQ.

Python 381 71 Updated Apr 24, 2023

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

MATLAB 509 125 Updated Feb 17, 2022

it's a train acoustics model code lib

Python 26 15 Updated May 20, 2020

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,427 1,012 Updated Nov 23, 2024

用于存储NLP常用模型

Python 145 80 Updated May 19, 2020

Single Headed Attention RNN - "Stop thinking with your head"

Python 1,183 133 Updated Nov 27, 2021

Download and preperation tool for free speech corpora.

Python 16 1 Updated Apr 28, 2019

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

TypeScript 3,367 850 Updated May 21, 2025

Lingvo

Python 2,838 451 Updated May 8, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,448 6,531 Updated Jan 9, 2025

A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。

Python 120 34 Updated Oct 8, 2019

speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

C++ 401 106 Updated Apr 8, 2020

Tools for Speech Enhancement integrated with Kaldi

Python 413 91 Updated Jul 6, 2023

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,711 1,558 Updated May 23, 2024

Custom decoders for Kaldi

C++ 79 26 Updated Jun 10, 2019

基于卷积神经网络的语音识别声学模型的研究

Python 174 48 Updated Jul 22, 2019
Next
0