-
LIA, Avignon University
- France
- in/manh-tuan-nguyen-595898203
Lists (1)
Sort Name ascending (A-Z)
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A PyTorch Library for Multi-Task Learning
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
Command line utility for forced alignment using Kaldi
The Munich Open-Source Large-Scale Multimedia Feature Extractor
Sources for my PhD dissertation on the Raft consensus algorithm
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Manipulate audio with a simple and easy high level interface
feature extraction from speech signals
[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Lime: Explaining the predictions of any machine learning classifier
Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: http://www.idris.fr/eng/jean-zay/
Simulation framework for accelerating research in Private Federated Learning
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Extensive acceptance rates and information of main AI conferences
Contains links to publicly available datasets for modeling health outcomes using speech and language.
Acceptance rates for the major AI conferences
Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning (ICASSP 2023)"