Kouroupetroglou, 2013 - Google Patents
Incorporating typographic, logical and layout knowledge of documents into text-to-speechKouroupetroglou, 2013
View PDF- Document ID
- 8050326278621151533
- Author
- Kouroupetroglou G
- Publication year
- Publication venue
- Assistive Technology: from Research to Practice
External Links
Snippet
Abstract Although Text-to-Speech (TtS) is considered a mature technology capable to produce synthetic speech of very high quality, current TtS systems do not include effective acoustic provision of the semantics and the cognitive aspects of the visual (such as the …
- 230000000007 visual effect 0 abstract description 10
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Taking into account non-speech caracteristics
- G10L2015/228—Taking into account non-speech caracteristics of application context
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Weisser | Practical corpus linguistics: An introduction to corpus-based language analysis | |
Waller | Graphic literacies for a digital age: The survival of layout | |
US20080077870A1 (en) | Method and apparatus for producing structured sgml/xml student compositions | |
Macovski | Dialogue and Critical Discourse: Language, Culture, Critical Theory | |
Tono et al. | A frequency dictionary of Japanese | |
Baker et al. | Corpus linguistics and South Asian languages: Corpus creation and tool development | |
Kukulska-Hulme | Language and communication: Essential concepts for user interface and documentation design | |
Waller | Graphic literacies for a digital age | |
Karan | Writing system development and reform: A process | |
Condorelli et al. | The Cambridge Handbook of Historical Orthography | |
Tsonos et al. | Prosodic mapping of text font based on the dimensional theory of emotions: a case study on style and size | |
Kouroupetroglou et al. | Multimodal accessibility of documents | |
Kouroupetroglou | Incorporating typographic, logical and layout knowledge of documents into text-to-speech | |
Kouroupetroglou | Text signals and accessibility of educational documents | |
Zeisler | Why Ladakhi must not be written-Being part of the Great Tradition: another kind of global thinking | |
Farag | Conversation-analytic transcription of Arabic-German talk-in-interaction | |
Rivera | A Stylistic Study on the Selected Poems of Rupi Kaur‟ s “Milk and Honey” | |
Pae | Written languages, East-Asian scripts, and cross-linguistic influences | |
Yamaguchi et al. | Accessible authoring tool for DAISY ranging from mathematics to others | |
Kouroupetroglou | Acoustic mapping of visual text signals through advanced text-to-speech: the case of font size | |
Wyatt et al. | Type matters: The rhetoricity of letterforms | |
Kouroupetroglou et al. | Rendering web-content text signals through advanced Text-to-Speech | |
Spitzmüller | Schematizing information: the macrotypographic framing of text | |
Kramer | Icono: a universal language that shows what it says | |
Kouroupetroglou et al. | DocEmoX: a system for the typography-derived emotional annotation of documents |