default search action
Oscar Saz-Torralba
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2010 – 2019
- 2019
- [j11]Salil Deena, Madina Hasan, Mortaza Doulaty, Oscar Saz, Thomas Hain:
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 572-582 (2019) - 2018
- [j10]Oscar Saz, Salil Deena, Mortaza Doulaty, Madina Hasan, Bilal Khaliq, Rosanna Milner, Raymond W. M. Ng, Julia Olcoz, Thomas Hain:
Lightly supervised alignment of subtitles on multi-genre broadcasts. Multim. Tools Appl. 77(23): 30533-30550 (2018) - 2017
- [j9]Oscar Saz, Thomas Hain:
Acoustic adaptation to dynamic background conditions with asynchronous transformations. Comput. Speech Lang. 41: 180-194 (2017) - [c38]Erfan Loweimi, Jon Barker, Oscar Saz-Torralba, Thomas Hain:
Robust Source-Filter Separation of Speech Signal in the Phase Domain. INTERSPEECH 2017: 414-418 - [c37]Chenhao Wu, Raymond W. M. Ng, Oscar Saz-Torralba, Thomas Hain:
Analysing acoustic model changes for active learning in automatic speech recognition. IWSSIP 2017: 1-5 - 2016
- [c36]Thomas Hain, Jeremy Christian, Oscar Saz, Salil Deena, Madina Hasan, Raymond W. M. Ng, Rosanna Milner, Mortaza Doulaty, Yulan Liu:
webASR 2 - Improved Cloud Based Speech Technology. INTERSPEECH 2016: 1613-1617 - [c35]Julia Olcoz, Oscar Saz, Thomas Hain:
Error Correction in Lightly Supervised Alignment of Broadcast Subtitles. INTERSPEECH 2016: 2110-2114 - [c34]Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Automatic Genre and Show Identification of Broadcast Media. INTERSPEECH 2016: 2115-2119 - [c33]Salil Deena, Madina Hasan, Mortaza Doulaty, Oscar Saz, Thomas Hain:
Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition. INTERSPEECH 2016: 2343-2347 - [c32]Raymond W. M. Ng, Mauro Nicolao, Oscar Saz, Madina Hasan, Bhusan Chettri, Mortaza Doulaty, Tan Lee, Thomas Hain:
The Sheffield language recognition system in NIST LRE 2015. Odyssey 2016: 181-187 - [i7]Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Automatic Genre and Show Identification of Broadcast Media. CoRR abs/1606.03333 (2016) - 2015
- [j8]Oscar Saz-Torralba, Yibin Lin, Maxine Eskénazi:
Measuring the impact of translation on the accuracy and fluency of vocabulary acquisition of English. Comput. Speech Lang. 31(1): 49-64 (2015) - [c31]Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation. ASRU 2015: 130-136 - [c30]Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yulan Liu, Thomas Hain:
The 2015 sheffield system for transcription of Multi-Genre Broadcast media. ASRU 2015: 624-631 - [c29]Rosanna Milner, Oscar Saz, Salil Deena, Mortaza Doulaty, Raymond W. M. Ng, Thomas Hain:
The 2015 sheffield system for longitudinal diarisation of broadcast media. ASRU 2015: 632-638 - [c28]Peter Bell, Mark J. F. Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, Philip C. Woodland:
The MGB challenge: Evaluating multi-genre broadcast media recognition. ASRU 2015: 687-693 - [c27]Mortaza Doulaty, Oscar Saz, Thomas Hain:
Data-selective transfer learning for multi-domain speech recognition. INTERSPEECH 2015: 2897-2901 - [c26]Mortaza Doulaty, Oscar Saz, Thomas Hain:
Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. INTERSPEECH 2015: 3640-3644 - [i6]Mortaza Doulaty, Oscar Saz, Thomas Hain:
Data-selective Transfer Learning for Multi-Domain Speech Recognition. CoRR abs/1509.02409 (2015) - [i5]Mortaza Doulaty, Oscar Saz, Thomas Hain:
Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition. CoRR abs/1509.02412 (2015) - [i4]Raymond W. M. Ng, Mortaza Doulaty, Rama Doddipatla, Wilker Aziz, Kashif Shah, Oscar Saz, Madina Hasan, Ghada AlHarbi, Lucia Specia, Thomas Hain:
The USFD Spoken Language Translation System for IWSLT 2014. CoRR abs/1509.03870 (2015) - [i3]Oscar Saz, Mortaza Doulaty, Thomas Hain:
Background-tracking Acoustic Features for Genre Identification of Broadcast Shows. CoRR abs/1509.04934 (2015) - [i2]Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation. CoRR abs/1511.05076 (2015) - [i1]Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yulan Liu, Thomas Hain:
The 2015 Sheffield System for Transcription of Multi-Genre Broadcast Media. CoRR abs/1512.06643 (2015) - 2014
- [c25]Oscar Saz, Thomas Hain:
Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. ICASSP 2014: 6314-6318 - [c24]Raymond W. M. Ng, Mortaza Doulaty, Rama Doddipatla, Wilker Aziz, Kashif Shah, Oscar Saz, Madina Hasan, Ghada AlHaribi, Lucia Specia, Thomas Hain:
The USFD SLT system for IWSLT 2014. IWSLT (Evaluation Campaign) 2014 - [c23]Oscar Saz, Mortaza Doulaty, Thomas Hain:
Background-tracking acoustic features for genre identification of broadcast shows. SLT 2014: 118-123 - 2013
- [c22]Pierre Lanchantin, Peter Bell, Mark J. F. Gales, Thomas Hain, Xunying Liu, Yanhua Long, Jennifer Quinnell, Steve Renals, Oscar Saz, Matthew Stephen Seigel, Pawel Swietojanski, Philip C. Woodland:
Automatic Transcription of Multi-genre Media Archives. SLAM@INTERSPEECH 2013: 26-31 - [c21]Oscar Saz, Thomas Hain:
Asynchronous factorisation of speaker and background with feature transforms in speech recognition. INTERSPEECH 2013: 1238-1242 - [c20]Elizabeth M. Davis, Oscar Saz, Maxine Eskénazi:
POLLI: a handheld-based aid for non-native student presentations. SLaTE 2013: 43-47 - 2012
- [j7]William Ricardo Rodríguez, Oscar Saz, Eduardo Lleida:
A prelingual tool for the education of altered voices. Speech Commun. 54(5): 583-600 (2012) - [c19]Oscar Saz, Maxine Eskénazi:
Addressing Confusions in Spoken Language in ESL Pronunciation Tutors. INTERSPEECH 2012: 771-774 - 2011
- [j6]Oscar Saz-Torralba, William Ricardo Rodríguez-Dueñas, Eduardo Lleida-Solano:
Development of Voice-Based Tools for Accessibility to Computer Services. Computación y Sistemas 15(1) (2011) - [c18]Oscar Saz, Maxine Eskénazi:
Identifying confusable contexts for automatic generation of activities in second language pronunciation training. SLaTE 2011: 121-124 - 2010
- [j5]Luis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega, Eduardo Lleida:
Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 18(2): 296-309 (2010) - [c17]Oscar Saz, Eduardo Lleida, Carlos Vaquero, William Ricardo Rodríguez:
The Alborada-I3A Corpus of Disordered Speech. LREC 2010
2000 – 2009
- 2009
- [j4]Oscar Saz, Javier Simón, William Ricardo Rodríguez, Eduardo Lleida, Carlos Vaquero:
Analysis of Acoustic Features in Speakers with Cognitive Disorders and Speech Impairments. EURASIP J. Adv. Signal Process. 2009 (2009) - [j3]Oscar Saz, Shou-Chun Yin, Eduardo Lleida, Richard C. Rose, Carlos Vaquero, William Ricardo Rodríguez:
Tools and Technologies for Computer-Aided Speech and Language Therapy. Speech Commun. 51(10): 948-967 (2009) - [c16]Shou-Chun Yin, Richard C. Rose, Oscar Saz, Eduardo Lleida:
A study of pronunciation verification in a speech therapy application. ICASSP 2009: 4609-4612 - [c15]Oscar Saz, Eduardo Lleida, Antonio Miguel:
Combination of acoustic and lexical speaker adaptation for disordered speech recognition. INTERSPEECH 2009: 544-547 - [c14]Oscar Saz, Victor Rodriguez, Eduardo Lleida, William Ricardo Rodríguez, Carlos Vaquero:
An experience with a Spanish second language learning tool in a multilingual environment. SLaTE 2009: 93-96 - [c13]Oscar Saz, William Ricardo Rodríguez, Eduardo Lleida, Carlos Vaquero:
COMUNICA: multilevel tools for Spanish CALL. SLaTE 2009 - [c12]Oscar Saz, Eduardo Lleida, William Ricardo Rodríguez-Dueñas:
Avoiding speaker variability in pronunciation verification of children' disordered speech. WOCCI 2009: 31-35 - 2008
- [j2]Antonio Miguel, Eduardo Lleida, Richard C. Rose, Luis Buera, Oscar Saz, Alfonso Ortega:
Capturing Local Variability for Speaker Normalization in Speech Recognition. IEEE Trans. Speech Audio Process. 16(3): 578-593 (2008) - [c11]Raquel Justo, Oscar Saz, Víctor G. Guijarrubia, Antonio Miguel, M. Inés Torres, Eduardo Lleida:
Improving dialogue systems in a home automation environment. AMBI-SYS 2008: 2 - [c10]Carlos Vaquero, Oscar Saz, Eduardo Lleida, William Ricardo Rodríguez:
E-inclusion technologies for the speech handicapped. ICASSP 2008: 4509-4512 - [c9]Luis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega, Eduardo Lleida:
Feature vector normalization with combined standard and throat microphones for robust ASR. INTERSPEECH 2008: 1289-1292 - [c8]Shou-Chun Yin, Richard C. Rose, Oscar Saz, Eduardo Lleida:
Verifying pronunciation accuracy from speakers with neuromuscular disorders. INTERSPEECH 2008: 2218-2221 - [c7]William Ricardo Rodríguez, Oscar Saz, Eduardo Lleida, Carlos Vaquero, Antonio Escartín:
COMUNICA - tools for speech and language therapy. WOCCI 2008: 12 - [c6]Oscar Saz, William Ricardo Rodríguez, Eduardo Lleida, Carlos Vaquero:
A novel corpus of children2s disordered speech. WOCCI 2008: 13 - 2007
- [j1]Luis Buera, Eduardo Lleida, Antonio Miguel, Alfonso Ortega, Oscar Saz:
Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 15(3): 1098-1113 (2007) - [c5]Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega:
Robust speech recognition with on-line unsupervised acoustic feature compensation. ASRU 2007: 105-110 - [c4]Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega:
On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition. INTERSPEECH 2007: 1046-1049 - [c3]Luis Buera, Antonio Miguel, Oscar Saz, Eduardo Lleida, Alfonso Ortega:
Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database. INTERSPEECH 2007: 2437-2440 - 2006
- [c2]Antonio Miguel, Eduardo Lleida, Alfons Juan, Luis Buera, Alfonso Ortega, Oscar Saz:
Local transformation models for speech recognition. INTERSPEECH 2006 - [c1]Oscar Saz, Antonio Miguel, Eduardo Lleida, Alfonso Ortega, Luis Buera:
Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition. INTERSPEECH 2006
Coauthor Index
aka: Eduardo Lleida-Solano
aka: William Ricardo Rodríguez
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-03 20:17 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint