default search action
Giampiero Salvi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c44]Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Tamás Grósz, Xinwei Cao, Torbjørn Svendsen, Giampiero Salvi:
Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages. LREC/COLING 2024: 3529-3537 - [c43]Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen:
Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. MLSP 2024: 1-6 - [i16]Giampiero Salvi:
Segment Boundary Detection via Class Entropy Measurements in Connectionist Phoneme Recognition. CoRR abs/2401.05717 (2024) - [i15]Giampiero Salvi:
Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints. CoRR abs/2401.06588 (2024) - [i14]Giampiero Salvi:
Developing Acoustic Models for Automatic Speech Recognition in Swedish. CoRR abs/2404.16547 (2024) - 2023
- [j14]Yaroslav Getman, Nhan Phan, Ragheb Al-Ghezi, Ekaterina Voskoboinik, Mittul Singh, Tamás Grósz, Mikko Kurimo, Giampiero Salvi, Torbjørn Svendsen, Sofia Strömbergsson, Anna-Riikka Smolander, Sari Ylinen:
Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children. IEEE Access 11: 86025-86037 (2023) - [j13]Mohammad Adiban, Sabato Marco Siniscalchi, Giampiero Salvi:
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity. Neurocomputing 537: 296-308 (2023) - [j12]Jérôme Abdelnour, Jean Rouat, Giampiero Salvi:
NAAQA: A Neural Architecture for Acoustic Question Answering. IEEE Trans. Pattern Anal. Mach. Intell. 45(4): 4997-5009 (2023) - [c42]Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen:
Using Modified Adult Speech as Data Augmentation for Child Speech Recognition. ICASSP 2023: 1-5 - [c41]Janine Rugayan, Giampiero Salvi, Torbjørn Svendsen:
Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation. INTERSPEECH 2023: 2158-2162 - [c40]Xinwei Cao, Zijian Fan, Torbjørn Svendsen, Giampiero Salvi:
An Analysis of Goodness of Pronunciation for Child Speech. INTERSPEECH 2023: 4613-4617 - [c39]Phoebe Parsons, Knut Kvale, Torbjørn Svendsen, Giampiero Salvi:
A character-based analysis of impacts of dialects on end-to-end Norwegian ASR. NoDaLiDa 2023: 467-476 - [c38]Per Erik Solberg, Pablo Ortiz, Phoebe Parsons, Torbjørn Svendsen, Giampiero Salvi:
Improving Generalization of Norwegian ASR with Limited Linguistic Resources. NoDaLiDa 2023: 508-517 - [i13]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. CoRR abs/2307.06701 (2023) - 2022
- [j11]Abdolreza Sabzi Shahrebabaki, Giampiero Salvi, Torbjørn Svendsen, Sabato Marco Siniscalchi:
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models. IEEE ACM Trans. Audio Speech Lang. Process. 30: 135-147 (2022) - [c37]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation. BMVC 2022: 636 - [c36]Janine Rugayan, Torbjørn Svendsen, Giampiero Salvi:
Semantically Meaningful Metrics for Norwegian ASR Systems. INTERSPEECH 2022: 2283-2287 - [c35]Yaroslav Getman, Ragheb Al-Ghezi, Katja Voskoboinik, Tamás Grósz, Mikko Kurimo, Giampiero Salvi, Torbjørn Svendsen, Sofia Strömbergsson:
wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. INTERSPEECH 2022: 3618-3622 - [d1]Jérôme Abdelnour, Giampiero Salvi, Jean Rouat:
CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning. IEEE DataPort, 2022 - [i12]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation. CoRR abs/2208.04554 (2022) - 2021
- [c34]Mohammad Adiban, Arash Safari, Giampiero Salvi:
STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security. ICASSP 2021: 2605-2609 - [c33]Abdolreza Sabzi Shahrebabaki, Sabato Marco Siniscalchi, Giampiero Salvi, Torbjørn Svendsen:
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion. ISCAS 2021: 1-5 - [i11]Jérôme Abdelnour, Jean Rouat, Giampiero Salvi:
NAAQA: A Neural Architecture for Acoustic Question Answering. CoRR abs/2106.06147 (2021) - 2020
- [j10]Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi:
Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions. IEEE Trans. Cogn. Dev. Syst. 12(2): 209-221 (2020) - [j9]Kalin Stefanov, Jonas Beskow, Giampiero Salvi:
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially Aware Language Acquisition. IEEE Trans. Cogn. Dev. Syst. 12(2): 250-259 (2020) - [c32]Kalin Stefanov, Mohammad Adiban, Giampiero Salvi:
Spatial Bias in Vision-Based Voice Activity Detection. ICPR 2020: 10433-10440 - [c31]Abdolreza Sabzi Shahrebabaki, Negar Olfati, Sabato Marco Siniscalchi, Giampiero Salvi, Torbjørn Svendsen:
Transfer Learning of Articulatory Information Through Phone Information. INTERSPEECH 2020: 2877-2881 - [c30]Abdolreza Sabzi Shahrebabaki, Sabato Marco Siniscalchi, Giampiero Salvi, Torbjørn Svendsen:
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals. INTERSPEECH 2020: 2882-2886 - [i10]Mohammad Adiban, Arash Safari, Giampiero Salvi:
STEP-GAN: A Step-by-Step Training for Multi Generator GANs with application to Cyber Security in Power Systems. CoRR abs/2009.05184 (2020)
2010 – 2019
- 2019
- [j8]Andreas Selamtzis, Antonella Castellana, Giampiero Salvi, Alessio Carullo, Arianna Astolfi:
Effect of vowel context in cepstral and entropy analysis of pathological voices. Biomed. Signal Process. Control. 47: 350-357 (2019) - [j7]Kalin Stefanov, Giampiero Salvi, Dimosthenis Kontogiorgos, Hedvig Kjellström, Jonas Beskow:
Modeling of Human Visual Attention in Multiparty Open-World Dialogues. ACM Trans. Hum. Robot Interact. 8(2): 8:1-8:21 (2019) - [c29]Cheng Zhang, Cengiz Öztireli, Stephan Mandt, Giampiero Salvi:
Active Mini-Batch Sampling Using Repulsive Point Processes. AAAI 2019: 5741-5748 - [i9]Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi:
Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions. CoRR abs/1902.09705 (2019) - [i8]Jérôme Abdelnour, Giampiero Salvi, Jean Rouat:
From Visual to Acoustic Question Answering. CoRR abs/1902.11280 (2019) - 2018
- [i7]Cheng Zhang, Cengiz Öztireli, Stephan Mandt, Giampiero Salvi:
Active Mini-Batch Sampling using Repulsive Point Processes. CoRR abs/1804.02772 (2018) - [i6]Jérôme Abdelnour, Giampiero Salvi, Jean Rouat:
CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning. CoRR abs/1811.10561 (2018) - 2017
- [c28]Antonella Castellana, Andreas Selamtzis, Giampiero Salvi, Alessio Carullo, Arianna Astolfi:
Cepstral and Entropy Analyses in Vowels Excerpted from Continuous Speech of Dysphonic and Control Speakers. INTERSPEECH 2017: 1814-1818 - [i5]Kalin Stefanov, Jonas Beskow, Giampiero Salvi:
Self-Supervised Vision-Based Detection of the Active Speaker as a Prerequisite for Socially-Aware Language Acquisition. CoRR abs/1711.08992 (2017) - [i4]Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi:
Interactive Robot Learning of Gestures, Language and Affordances. CoRR abs/1711.09055 (2017) - [i3]Giampiero Salvi, Luis Montesano, Alexandre Bernardino, José Santos-Victor:
Language Bootstrapping: Learning Word Meanings From Perception-Action Association. CoRR abs/1711.09714 (2017) - 2016
- [p1]Giampiero Salvi:
An Analysis of Shallow and Deep Representations of Speech Based on Unsupervised Classification of Isolated Words. Recent Advances in Nonlinear Speech Processing 2016: 151-157 - [i2]Akash Kumar Dhaka, Giampiero Salvi:
Optimising The Input Window Alignment in CD-DNN Based Phoneme Recognition for Low Latency Processing. CoRR abs/1606.09163 (2016) - [i1]Akash Kumar Dhaka, Giampiero Salvi:
Semi-supervised Learning with Sparse Autoencoders in Phone Classification. CoRR abs/1610.00520 (2016) - 2015
- [c27]José Lopes, Giampiero Salvi, Gabriel Skantze, Alberto Abad, Joakim Gustafson, Fernando Batista, Raveesh Meena, Isabel Trancoso:
Detecting repetitions in spoken dialogue systems using phonetic distances. INTERSPEECH 2015: 1805-1809 - 2014
- [c26]Niklas Vanhainen, Giampiero Salvi:
Pattern discovery in continuous speech using Block Diagonal Infinite HMM. ICASSP 2014: 3719-3723 - [c25]Alessandro Pieropan, Giampiero Salvi, Karl Pauwels, Hedvig Kjellström:
Audio-visual classification and detection of human manipulation actions. IROS 2014: 3045-3052 - [c24]Niklas Vanhainen, Giampiero Salvi:
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish. LREC 2014: 388-392 - [c23]Giampiero Salvi, Niklas Vanhainen:
The WaveSurfer Automatic Speech Recognition Plugin. LREC 2014: 3067-3071 - 2013
- [j6]Daniel Neiberg, Giampiero Salvi, Joakim Gustafson:
Semi-supervised methods for exploring the acoustics of simple productive feedback. Speech Commun. 55(3): 451-469 (2013) - [j5]Christos Koniaris, Giampiero Salvi, Olov Engwall:
On mispronunciation analysis of individual foreign speakers using auditory periphery models. Speech Commun. 55(5): 691-706 (2013) - [c22]Giovanni Saponaro, Giampiero Salvi, Alexandre Bernardino:
Robot anticipation of human intentions through continuous gesture recognition. CTS 2013: 218-225 - [c21]Catharine Oertel, Giampiero Salvi:
A gaze-based method for relating group involvement to individual engagement in multimodal multiparty dialogue. ICMI 2013: 99-106 - 2012
- [j4]Giampiero Salvi, Luis Montesano, Alexandre Bernardino, José Santos-Victor:
Language Bootstrapping: Learning Word Meanings From Perception-Action Association. IEEE Trans. Syst. Man Cybern. Part B 42(3): 660-671 (2012) - [c20]Giampiero Salvi:
Biologically Inspired Methods for Automatic Speech Understanding. BICA 2012: 283-286 - [c19]Niklas Vanhainen, Giampiero Salvi:
Word Discovery with Beta Process Factor Analysis. INTERSPEECH 2012: 799-802 - [c18]Christos Koniaris, Olov Engwall, Giampiero Salvi:
Auditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations. INTERSPEECH 2012: 899-902 - 2011
- [c17]Gopal Ananthakrishnan, Giampiero Salvi:
Using Imitation to Learn Infant-Adult Acoustic Mappings. INTERSPEECH 2011: 765-768 - [e1]Giampiero Salvi, Jonas Beskow, Olov Engwall, Samer Al Moubayed:
Auditory-Visual Speech Processing, AVSP 2011, Volterra, Italy, September 1-2, 2011. ISCA 2011 [contents] - 2010
- [c16]Giampiero Salvi, Fabio Tesser, Enrico Zovato, Piero Cosi:
Cluster analysis of differential spectral envelopes on emotional speech. INTERSPEECH 2010: 322-325
2000 – 2009
- 2009
- [j3]Giampiero Salvi, Jonas Beskow, Samer Al Moubayed, Björn Granström:
SynFace - Speech-Driven Facial Animation for Virtual Speech-Reading Support. EURASIP J. Audio Speech Music. Process. 2009 (2009) - [c15]Jonas Beskow, Giampiero Salvi, Samer Al Moubayed:
Synface - verbal and non-verbal face animation from audio. AVSP 2009: 169 - [c14]Verica Krunic, Giampiero Salvi, Alexandre Bernardino, Luis Montesano, José Santos-Victor:
Affordance based word-to-meaning association. ICRA 2009: 4138-4143 - [c13]Samer Al Moubayed, Jonas Beskow, Anne-Marie Öster, Giampiero Salvi, Björn Granström, Nic van Son, Ellen Ormel:
Virtual speech reading support for hard of hearing in a domestic multi-media setting. INTERSPEECH 2009: 1443-1446 - 2008
- [c12]Jonas Beskow, Björn Granström, Peter Nordqvist, Samer Al Moubayed, Giampiero Salvi, Tobias Herzke, Arne Schulz:
Hearing at home - communication support in home environments for hearing impaired persons. INTERSPEECH 2008: 2203-2206 - 2006
- [b1]Giampiero Salvi:
Mining Speech Sounds: Machine Learning Methods for Automatic Speech Recognition and Analysis. KTH Royal Institute of Technology, Sweden, 2006 - [j2]Giampiero Salvi:
Dynamic behaviour of connectionist speech recognition with strong latency constraints. Speech Commun. 48(7): 802-818 (2006) - [j1]Giampiero Salvi:
Segment boundary detection via class entropy measurements in connectionist phoneme recognition. Speech Commun. 48(12): 1666-1676 (2006) - [c11]Eva Agelfors, Jonas Beskow, Inger Karlsson, Jo Kewley, Giampiero Salvi, Neil Thomas:
User Evaluation of the SYNFACE Talking Head Telephone. ICCHP 2006: 579-586 - 2005
- [c10]Giampiero Salvi:
Ecological language acquisition via incremental model-based clustering. INTERSPEECH 2005: 1181-1184 - [c9]Giampiero Salvi:
Advances in regional accent clustering in Swedish. INTERSPEECH 2005: 2841-2844 - [c8]Giampiero Salvi:
Segment Boundaries in Low Latency Phonetic Recognition. NOLISP 2005: 267-276 - 2004
- [c7]Jonas Beskow, Inger Karlsson, Jo Kewley, Giampiero Salvi:
SYNFACE - A Talking Head Telephone for the Hearing-Impaired. ICCHP 2004: 1178-1185 - 2003
- [c6]Inger Karlsson, Andrew Faulkner, Giampiero Salvi:
SYNFACE - a talking face telephone. INTERSPEECH 2003: 1297-1300 - [c5]Giampiero Salvi:
Using accent information in ASR models for Swedish. INTERSPEECH 2003: 2677-2680 - [c4]Giampiero Salvi:
Truncation error and dynamics in very low latency phonetic recognition. NOLISP 2003: 14 - 2000
- [c3]Børge Lindberg, Finn Tore Johansen, Narada D. Warakagoda, Gunnar Lehtinen, Zdravko Kacic, Andrej Zgank, Kjell Elenius, Giampiero Salvi:
A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II). INTERSPEECH 2000: 370-373 - [c2]Finn Tore Johansen, Narada D. Warakagoda, Børge Lindberg, Gunnar Lehtinen, Zdravko Kacic, Andrej Zgank, Kjell Elenius, Giampiero Salvi:
The COST 249 SpeechDat Multilingual Reference Recogniser. LREC 2000
1990 – 1999
- 1999
- [c1]Eva Agelfors, Jonas Beskow, Björn Granström, Magnus Lundeberg, Giampiero Salvi, Karl-Erik Spens, Tobias Öhman:
Synthetic visual speech driven from auditory speech. AVSP 1999: 21
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-30 01:12 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint