default search action
Torbjørn Svendsen
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Femke B. Gelderblom, Tron V. Tronstad, Torbjørn Svendsen, Tor André Myrvoll:
On the Predictive Power of Objective Intelligibility Metrics for the Subjective Performance of Deep Complex Convolutional Recurrent Speech Enhancement Networks. IEEE ACM Trans. Audio Speech Lang. Process. 32: 215-226 (2024) - [c70]Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Tamás Grósz, Xinwei Cao, Torbjørn Svendsen, Giampiero Salvi:
Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages. LREC/COLING 2024: 3529-3537 - [c69]Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen:
Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. MLSP 2024: 1-6 - 2023
- [j8]Yaroslav Getman, Nhan Phan, Ragheb Al-Ghezi, Ekaterina Voskoboinik, Mittul Singh, Tamás Grósz, Mikko Kurimo, Giampiero Salvi, Torbjørn Svendsen, Sofia Strömbergsson, Anna-Riikka Smolander, Sari Ylinen:
Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children. IEEE Access 11: 86025-86037 (2023) - [c68]Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen:
Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. ICASSP 2023: 1-5 - [c67]Janine Rugayan, Giampiero Salvi, Torbjørn Svendsen:
Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation. INTERSPEECH 2023: 2158-2162 - [c66]Xinwei Cao, Zijian Fan, Torbjørn Svendsen, Giampiero Salvi:
An Analysis of Goodness of Pronunciation for Child Speech. INTERSPEECH 2023: 4613-4617 - [c65]Phoebe Parsons, Knut Kvale, Torbjørn Svendsen, Giampiero Salvi:
A character-based analysis of impacts of dialects on end-to-end Norwegian ASR. NoDaLiDa 2023: 467-476 - [c64]Per Erik Solberg, Pablo Ortiz, Phoebe Parsons, Torbjørn Svendsen, Giampiero Salvi:
Improving Generalization of Norwegian ASR with Limited Linguistic Resources. NoDaLiDa 2023: 508-517 - 2022
- [j7]Abdolreza Sabzi Shahrebabaki, Giampiero Salvi, Torbjørn Svendsen, Sabato Marco Siniscalchi:
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE ACM Trans. Audio Speech Lang. Process. 30: 135-147 (2022) - [c63]Janine Rugayan, Torbjørn Svendsen, Giampiero Salvi:
Semantically Meaningful Metrics for Norwegian ASR Systems. INTERSPEECH 2022: 2283-2287 - [c62]Yaroslav Getman, Ragheb Al-Ghezi, Katja Voskoboinik, Tamás Grósz, Mikko Kurimo, Giampiero Salvi, Torbjørn Svendsen, Sofia Strömbergsson:
wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. INTERSPEECH 2022: 3618-3622 - 2021
- [c61]Abdolreza Sabzi Shahrebabaki, Negar Olfati, Ali Shariq Imran, Magne Hallstein Johnsen, Sabato Marco Siniscalchi, Torbjørn Svendsen:
A Two-Stage Deep Modeling Approach to Articulatory Inversion. ICASSP 2021: 6453-6457 - [c60]Abdolreza Sabzi Shahrebabaki, Sabato Marco Siniscalchi, Torbjørn Svendsen:
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation. Interspeech 2021: 1184-1188 - [c59]Abdolreza Sabzi Shahrebabaki, Sabato Marco Siniscalchi, Giampiero Salvi, Torbjørn Svendsen:
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion. ISCAS 2021: 1-5 - 2020
- [c58]Abdolreza Sabzi Shahrebabaki, Negar Olfati, Sabato Marco Siniscalchi, Giampiero Salvi, Torbjørn Svendsen:
Transfer Learning of Articulatory Information Through Phone Information. INTERSPEECH 2020: 2877-2881 - [c57]Abdolreza Sabzi Shahrebabaki, Sabato Marco Siniscalchi, Giampiero Salvi, Torbjørn Svendsen:
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals. INTERSPEECH 2020: 2882-2886
2010 – 2019
- 2019
- [j6]Abdolreza Sabzi Shahrebabaki, Ali Shariq Imran, Negar Olfati, Torbjørn Svendsen:
A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification. Circuits Syst. Signal Process. 38(8): 3501-3520 (2019) - [c56]Ali Shariq Imran, Zenun Kastrati, Torbjørn Karl Svendsen, Arianit Kurti:
Text-Independent Speaker ID for Automatic Video Lecture Classification Using Deep Learning. ICCAI 2019: 175-180 - [c55]Ali Shariq Imran, Abdolreza Sabzi Shahrebabaki, Negar Olfati, Torbjørn Svendsen:
A Study on the Performance Evaluation of Machine Learning Models for Phoneme Classification. ICMLC 2019: 52-58 - [c54]Ali Shariq Imran, Vetle Haflan, Abdolreza Sabzi Shahrebabaki, Negar Olfati, Torbjørn Karl Svendsen:
Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification. ICMLC 2019: 211-216 - [c53]Ali Shariq Imran, Zenun Kastrati, Torbjørn Karl Svendsen, Arianit Kurti:
Text-Independent Speaker ID Employing 2D-CNN for Automatic Video Lecture Categorization in a MOOC Setting. ICTAI 2019: 273-277 - [c52]Abdolreza Sabzi Shahrebabaki, Negar Olfati, Ali Shariq Imran, Sabato Marco Siniscalchi, Torbjørn Svendsen:
A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion. INTERSPEECH 2019: 3775-3779 - 2018
- [c51]Abdolreza Sabzi Shahrebabaki, Ali Shariq Imran, Negar Olfati, Torbjørn Svendsen:
Acoustic Feature Comparison for Different Speaking Rates. HCI (3) 2018: 176-189 - 2015
- [c50]Torbjørn Svendsen, Jarle Bauck Hamar:
Combining NDHMM and phonetic feature detection for speech recognition. EUSIPCO 2015: 1666-1670 - 2014
- [j5]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
An artificial neural network approach to automatic speech processing. Neurocomputing 140: 326-338 (2014) - 2013
- [j4]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Universal attribute characterization of spoken languages for automatic spoken language recognition. Comput. Speech Lang. 27(1): 209-227 (2013) - [j3]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 21(4): 786-797 (2013) - [c49]D. Rama Sanand, Torbjørn Svendsen:
Synthetic speaker models using VTLN to improve the performance of children in mismatched speaker conditions for ASR. INTERSPEECH 2013: 3361-3365 - [c48]Jarle Bauck Hamar, Doddipatla Rama Sanand, Torbjørn Svendsen, Thippur Sreenivas:
Non-negative durational HMM. MLSP 2013: 1-6 - 2012
- [j2]Sabato Marco Siniscalchi, Dau-Cheng Lyu, Torbjørn Svendsen, Chin-Hui Lee:
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data. IEEE Trans. Speech Audio Process. 20(3): 875-887 (2012) - 2011
- [c47]Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Alberto Abad, Oscar Koller, Isabel Trancoso, Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo, Rahim Saeidi, Mehdi Soufifar, Tomi Kinnunen, Torbjørn Svendsen, Pasi Fränti:
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation. ASRU 2011: 377-382 - [c46]Line Adde, Torbjørn Svendsen:
Pronunciation variation modeling of non-native proper names by discriminative tree search. ICASSP 2011: 4928-4931 - [c45]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines. INTERSPEECH 2011: 901-904 - [c44]Trond Skogstad, Torbjørn Svendsen:
Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients. INTERSPEECH 2011: 2505-2508 - [c43]Mehdi Soufifar, Marcel Kockmann, Lukás Burget, Oldrich Plchot, Ondrej Glembek, Torbjørn Svendsen:
iVector Approach to Phonotactic Language Recognition. INTERSPEECH 2011: 2913-2916 - 2010
- [c42]Dyre Meen, Torbjørn Svendsen:
The NTNU Concatenative Speech Synthesizer. Blizzard Challenge 2010 - [c41]Sabato Marco Siniscalchi, Torbjørn Svendsen, Filippo Sorbello, Chin-Hui Lee:
Experimental studies on continuous speech recognition using neural architectures with "adaptive" hidden activation functions. ICASSP 2010: 4882-4885 - [c40]Trond Skogstad, Torbjørn Svendsen:
Intra-frame variability as a predictor of frame classifiability. INTERSPEECH 2010: 1708-1711 - [c39]Line Adde, Bert Réveil, Jean-Pierre Martens, Torbjørn Svendsen:
A minimum classification error approach to pronunciation variation modeling of non-native proper names. INTERSPEECH 2010: 2282-2285 - [c38]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition. INTERSPEECH 2010: 2718-2721 - [c37]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A survey on recent progress in the ASAT/SIRKUS paradigm. ISCSLP 2010: 465-470 - [c36]Line Adde, Torbjørn Svendsen:
NameDat: A Database of English Proper Names Spoken by Native Norwegians. LREC 2010 - [c35]Rein Ove Sikveland, Anton Öttl, Ingunn Amdal, Mirjam Ernestus, Torbjørn Svendsen, Jens Edlund:
Spontal-N: A Corpus of Interactional Spoken Norwegian. LREC 2010 - [c34]Line Adde, Torbjørn Svendsen:
On the use of discriminative and non-discriminative pronunciation priors in pronunciation variation modeling of non-native proper names. SLT 2010: 229-234
2000 – 2009
- 2009
- [c33]Timo Mertens, Daniel Schneider, Arild Brandrud Næss, Torbjørn Svendsen:
Lexicon adaptation for subword speech recognition. ASRU 2009: 562-567 - [c32]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A phonetic feature based lattice rescoring approach to LVCSR. ICASSP 2009: 3865-3868 - [c31]Sabato Marco Siniscalchi, Jeremy Reed, Torbjørn Svendsen, Chin-Hui Lee:
Exploring universal attribute characterization of spoken languages for spoken language recognition. INTERSPEECH 2009: 168-171 - 2008
- [c30]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
Toward a detector-based universal phone recognizer. ICASSP 2008: 4261-4264 - [c29]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
A penalized logistic regression approach to detection based phone classification. INTERSPEECH 2008: 2390-2393 - [c28]Ingunn Amdal, Ole Morten Strand, Jørn Almberg, Torbjørn Svendsen:
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus. LREC 2008 - 2007
- [c27]Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee:
Towards bottom-up continuous phone recognition. ASRU 2007: 566-569 - 2006
- [c26]Ingunn Amdal, Torbjørn Svendsen:
FonDat1: A Speech Synthesis Corpus for Norwegian. LREC 2006: 2096-2101 - 2005
- [c25]Ingunn Amdal, Torbjørn Svendsen:
Unit selection synthesis database development using utterance verification. INTERSPEECH 2005: 2553-2556 - [c24]Ingmund Bjrkan, Torbjørn Svendsen, Snorre Farner:
Comparing spectral distance measures for join cost optimization in concatenative speech synthesis. INTERSPEECH 2005: 2577-2580 - [c23]Trond Skogstad, Torbjørn Svendsen:
Distributed ASR using speech coder data for efficient feature vector representation. INTERSPEECH 2005: 2861-2864 - 2003
- [c22]Terrence Martin, Torbjørn Svendsen, Sridha Sridharan:
Cross-lingual pronunciation modelling for indonesian speech recognition. INTERSPEECH 2003: 3125-3128 - [c21]Eddie Wong, Terrence Martin, Torbjørn Svendsen, Sridha Sridharan:
Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques. INTERSPEECH 2003: 3133-3136 - 2002
- [c20]Ingunn Amdal, Torbjørn Svendsen:
Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles. LREC 2002 - 2001
- [c19]Tor André Myrvoll, Kuldip K. Paliwal, Torbjørn Svendsen:
Fast adaptation using constrained affine transformations with hierarchical priors. INTERSPEECH 2001: 1233-1236 - 2000
- [c18]Magne Hallstein Johnsen, Trym Holter, Torbjørn Svendsen, Erik Harborg:
Stochastic modeling of semantic content for use IN a spoken dialogue system. INTERSPEECH 2000: 218-221 - [c17]Trym Holter, Erik Harborg, Magne Hallstein Johnsen, Torbjørn Svendsen:
ASR-based subtitling of live TV-programs for the hearing impaired. INTERSPEECH 2000: 570-573 - [c16]Magne Hallstein Johnsen, Torbjørn Svendsen, Tore Amble, Trym Holter, Erik Harborg:
TABOR - a norwegian spoken dialogue system for bus travel information. INTERSPEECH 2000: 1049-1052
1990 – 1999
- 1999
- [j1]Trym Holter, Torbjørn Svendsen:
Maximum likelihood modelling of pronunciation variation. Speech Commun. 29(2-4): 177-191 (1999) - [c15]Erik Harborg, Trym Holter, Magne Hallstein Johnsen, Torbjørn Svendsen:
On-line captioning of TV-programs for the hearing impaired. EUROSPEECH 1999: 567-570 - 1997
- [c14]Trym Holter, Torbjørn Svendsen:
Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition. EUROSPEECH 1997: 1159-1162 - 1996
- [c13]Trym Holter, Torbjørn Svendsen:
Combined Optimisation of Baseforms and Subword Models for an Hmm Based Speech Recogniser. ISSPA 1996: 321-324 - 1995
- [c12]Torbjørn Svendsen, Frank K. Soong, Heiko Purnhagen:
Optimizing baseforms for HMM-based speech recognition. EUROSPEECH 1995: 783-787 - 1994
- [c11]Torbjørn Svendsen:
Segmental quantization of speech spectral information. ICASSP (1) 1994: 517-520 - 1993
- [c10]Anjan Basu, Torbjørn Svendsen:
A time-frequency segmental neural network for phoneme recognition. ICASSP (1) 1993: 509-512 - [c9]Torbjørn Svendsen:
Efficient quantization of speech spectral information. EUROSPEECH 1993: 1143-1146 - [c8]Andrea Paoloni, Torbjørn Svendsen, Bernhard Kaspar, Denis Johnston, Gunnar Hult:
Cost232: speech recognition over the telephone line. EUROSPEECH 1993: 1845-1848 - 1991
- [c7]P. O. Husoy, Torbjørn Svendsen:
ANN-based speech recognition using a preprocessor for non-linear time compression. EUROSPEECH 1991: 563-566 - 1990
- [c6]Torbjørn Svendsen, Knut Kvale:
Automatic alignment of phonemic labels with continuous speech. ICSLP 1990: 997-1000
1980 – 1989
- 1989
- [c5]Torbjørn Svendsen, Kuldip K. Paliwal, Erik Harborg, P. O. Husoy:
An improved sub-word based speech recognizer. ICASSP 1989: 108-111 - 1987
- [c4]Torbjørn Svendsen, Frank K. Soong:
On the automatic segmentation of speech signals. ICASSP 1987: 77-80 - 1986
- [c3]Torbjørn Svendsen:
Multi-dimensional quantization applied to predictive coding of speech. ICASSP 1986: 3063-3066 - 1985
- [c2]Kuldip K. Paliwal, Torbjørn Svendsen:
A study of three coders (sub-band, RELP and MPE) for speech with additive white noise. ICASSP 1985: 1688-1691 - 1984
- [c1]Torbjørn Svendsen:
Tree encoding of the LPC residual. ICASSP 1984: 424-427
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-30 01:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint