[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
-
Updated
Nov 1, 2024 - Python
8000
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR system that captures human expressions including emotive sounds and non-verbal cues.
Add a description, image, and links to the moshi topic page so that developers can more easily learn about it.
To associate your repository with the moshi topic, visit your repo's landing page and select "manage topics."