Hadj Ali et al., 2020 - Google Patents

DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis

Hadj Ali et al., 2020

Document ID: 15675310687252653261
Author: Hadj Ali I; Mnasri Z; Lachiri Z
Publication year: 2020
Publication venue: International Journal of Speech Technology

External Links

Cited by

Snippet

Arabic text-to-speech synthesis from non-diacritized text is still a big challenge, because of unique Arabic language rules and characteristics. Indeed, the diacritic and gemination signs, which are special characters representing respectively short vowels and consonant …

Continue reading at www.academia.edu (PDF) (other versions)

230000015572 biosynthetic process 0 title abstract description 31

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G06F17/2881—Natural language generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G06F17/2217—Character encodings
- G06F17/2223—Handling non-latin characters, e.g. kana-to-kanji conversion
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2863—Processing of non-latin text
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information

Similar Documents

Publication	Publication Date	Title
Littell et al.	2018	Indigenous language technologies in Canada: Assessment, challenges, and successes
El-Imam	2004	Phonetization of Arabic: rules and algorithms
Hadj Ali et al.	2020	DNN-based grapheme-to-phoneme conversion for Arabic text-to-speech synthesis
Hasegawa-Johnson et al.	2020	Grapheme-to-phoneme transduction for cross-language ASR
Narasimhan et al.	2004	Schwa-deletion in Hindi text-to-speech synthesis
Khomitsevich et al.	2015	A bilingual Kazakh-Russian system for automatic speech recognition and synthesis
Hajj et al.	2022	Comparing nlp solutions for the disambiguation of french heterophonic homographs for end-to-end tts systems
Lőrincz et al.	2023	RoLEX: The development of an extended Romanian lexical dataset and its evaluation at predicting concurrent lexical information
Kłosowski	2017	Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling
Rajendran et al.	2019	A robust syllable centric pronunciation model for Tamil text to speech synthesizer
Singh et al.	2024	MECOS: A bilingual Manipuri–English spontaneous code-switching speech corpus for automatic speech recognition
Park et al.	2019	Jejueo datasets for machine translation and speech synthesis
Yakovenko et al.	2018	Algorithms for automatic accentuation and transcription of russian texts in speech recognition systems
Cherifi et al.	2021	Arabic grapheme-to-phoneme conversion based on joint multi-gram model
Ali et al.	2019	Gemination prediction using DNN for Arabic text-to-speech synthesis
Wasala et al.	2006	Sinhala grapheme-to-phoneme conversion and rules for schwa epenthesis
Ibrahim et al.	2015	The problems, issues and future challenges of automatic speech recognition for quranic verse recitation: A review
Jariwala et al.	2018	A system for the conversion of digital Gujarati text-to-speech for visually impaired people
Samuel Manoharan	2022	A novel text-to-speech synthesis system using syllable-based HMM for Tamil language
Asahiah	2021	Comparison of rule-based and data-driven approaches for syllabification of simple syllable languages and the effect of orthography
Nair et al.	2021	English to Indian Language and Back Transliteration with Phonetic Transcription for Computational Linguistics Tools based on Conventional Transliteration Schemes
Jůzová et al.	2019	Using Auto-Encoder BiLSTM Neural Network for Czech Grapheme-to-Phoneme Conversion
Hendessi et al.	2005	A speech synthesizer for Persian text using a neural network with a smooth ergodic HMM
Babych	2016	Graphonological levenshtein edit distance: Application for automated cognate identification
Ghosh et al.	2023	Boosting Rule-Based Grapheme-to-Phoneme Conversion with Morphological Segmentation and Syllabification in Bengali