Highlights
- Pro
Starred repositories
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
CMU multilingual speech repository
"Deep Generative Modeling": Introductory Examples
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
2021/3/30 ~ 2021/7/12 に行われる企画「競プロ典型 90 問」の問題・解説・ソースコードなどの資料をアップロードしています。
Neural network-based singing voice synthesis library for research
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
Tools for handling speech data in machine learning projects.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.
Vue drag-and-drop component based on Sortable.js
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Best Practices, code samples, and documentation for Computer Vision.
A simple library for querying the URIEL typological database.
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages