Kouroupetroglou, 2013 - Google Patents

Incorporating typographic, logical and layout knowledge of documents into text-to-speech

Kouroupetroglou, 2013

Document ID: 8050326278621151533
Author: Kouroupetroglou G
Publication year: 2013
Publication venue: Assistive Technology: from Research to Practice

External Links

Cited by

Snippet

Abstract Although Text-to-Speech (TtS) is considered a mature technology capable to produce synthetic speech of very high quality, current TtS systems do not include effective acoustic provision of the semantics and the cognitive aspects of the visual (such as the …

Continue reading at www.researchgate.net (PDF) (other versions)

230000000007 visual effect 0 abstract description 10

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Taking into account non-speech caracteristics
- G10L2015/228—Taking into account non-speech caracteristics of application context
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch

Similar Documents

Publication	Publication Date	Title
Weisser	2015	Practical corpus linguistics: An introduction to corpus-based language analysis
Waller	2012	Graphic literacies for a digital age: The survival of layout
US20080077870A1 (en)	2008-03-27	Method and apparatus for producing structured sgml/xml student compositions
Macovski	1997	Dialogue and Critical Discourse: Language, Culture, Critical Theory
Tono et al.	2013	A frequency dictionary of Japanese
Baker et al.	2004	Corpus linguistics and South Asian languages: Corpus creation and tool development
Kukulska-Hulme	1999	Language and communication: Essential concepts for user interface and documentation design
Waller	2017	Graphic literacies for a digital age
Karan	2006	Writing system development and reform: A process
Condorelli et al.	2023	The Cambridge Handbook of Historical Orthography
Tsonos et al.	2016	Prosodic mapping of text font based on the dimensional theory of emotions: a case study on style and size
Kouroupetroglou et al.	2008	Multimodal accessibility of documents
Kouroupetroglou	2013	Incorporating typographic, logical and layout knowledge of documents into text-to-speech
Kouroupetroglou	2015	Text signals and accessibility of educational documents
Zeisler	2006	Why Ladakhi must not be written-Being part of the Great Tradition: another kind of global thinking
Farag	2019	Conversation-analytic transcription of Arabic-German talk-in-interaction
Rivera	2023	A Stylistic Study on the Selected Poems of Rupi Kaur‟ s “Milk and Honey”
Pae	2018	Written languages, East-Asian scripts, and cross-linguistic influences
Yamaguchi et al.	2012	Accessible authoring tool for DAISY ranging from mathematics to others
Kouroupetroglou	2015	Acoustic mapping of visual text signals through advanced text-to-speech: the case of font size
Wyatt et al.	2017	Type matters: The rhetoricity of letterforms
Kouroupetroglou et al.	2015	Rendering web-content text signals through advanced Text-to-Speech
Spitzmüller	2017	Schematizing information: the macrotypographic framing of text
Kramer	2023	Icono: a universal language that shows what it says
Kouroupetroglou et al.	2009	DocEmoX: a system for the typography-derived emotional annotation of documents