default search action
Mirco Ravanelli
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j9]Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli:
Speech self-supervised representations benchmarking: A case for larger probing heads. Comput. Speech Lang. 89: 101695 (2025) - [j8]Giuseppe Alessio D'Inverno, Simone Brugiapaglia, Mirco Ravanelli:
Generalization limits of Graph Neural Networks in identity effects learning. Neural Networks 181: 106793 (2025) - [j7]Davide Borra, Elisa Magosso, Mirco Ravanelli:
A protocol for trustworthy EEG decoding with neural networks. Neural Networks 182: 106847 (2025) - 2024
- [j6]Davide Borra, Francesco Paissan, Mirco Ravanelli:
SpeechBrain-MOABB: An open-source Python library for benchmarking deep neural networks applied to EEG signals. Comput. Biol. Medicine 182: 109097 (2024) - [j5]Luca Della Libera, Pooneh Mousavi, Salah Zaiem, Cem Subakan, Mirco Ravanelli:
CL-MASR: A Continual Learning Benchmark for Multilingual ASR. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4931-4944 (2024) - [c57]Davide Borra, Mirco Ravanelli:
Explaining Network Decision Provides Insights on the Causal Interaction Between Brain Regions in a Motor Imagery Task. ANNPR 2024: 156-167 - [c56]Davide Borra, Matteo Fraternali, Mirco Ravanelli, Elisa Magosso:
Multi-modal Decoding of Reach-to-Grasping from EEG and EMG via Neural Networks. ANNPR 2024: 168-179 - [c55]Shubham Gupta, Isaac Neri Gomez-Sarmiento, Faez Amjed Mezdari, Mirco Ravanelli, Cem Subakan:
Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming. ANNPR 2024: 269-280 - [c54]Salima Mdhaffar, Fethi Bougares, Renato de Mori, Salah Zaiem, Mirco Ravanelli, Yannick Estève:
TARIC-SLU: A Tunisian Benchmark Dataset for Spoken Language Understanding. LREC/COLING 2024: 15606-15616 - [c53]Firat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çagatay Yildiz:
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? EMNLP 2024: 19834-19843 - [c52]Luca Zampierin, Ghouthi Boukli Hacene, Bac Nguyen, Mirco Ravanelli:
Skill: Similarity-Aware Knowledge Distillation for Speech Self-Supervised Learning. ICASSP Workshops 2024: 675-679 - [c51]Luca Della Libera, Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin:
Resource-Efficient Separation Transformer. ICASSP 2024: 761-765 - [c50]Luca Della Libera, Cem Subakan, Mirco Ravanelli:
Focal Modulation Networks for Interpretable Sound Classification. ICASSP Workshops 2024: 853-857 - [c49]Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, Samuel Maddrell-Mander, Callum McLean, Frederik Wenkel, Luis Müller, Jama Hussein Mohamud, Ali Parviz, Michael Craig, Michal Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Christopher Morris, Mirco Ravanelli, Guy Wolf, Prudencio Tossou, Hadrien Mary, Therence Bois, Andrew W. Fitzgibbon, Blazej Banaszewski, Chad Martin, Dominic Masters:
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets. ICLR 2024 - [c48]Francesco Paissan, Mirco Ravanelli, Cem Subakan:
Listenable Maps for Audio Classifiers. ICML 2024 - [c47]Umberto Cappellazzo, Daniele Falavigna, Alessio Brutti, Mirco Ravanelli:
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers. MLSP 2024: 1-6 - [e2]Ching Yee Suen, Adam Krzyzak, Mirco Ravanelli, Edmondo Trentin, Cem Subakan, Nicola Nobile:
Artificial Neural Networks in Pattern Recognition - 11th IAPR TC3 Workshop, ANNPR 2024, Montreal, QC, Canada, October 10-12, 2024, Proceedings. Lecture Notes in Computer Science 15154, Springer 2024, ISBN 978-3-031-71601-0 [contents] - [i63]Seyed Mahed Mousavi, Gabriel Roccabruna, Simone Alghisi, Massimo Rizzoli, Mirco Ravanelli, Giuseppe Riccardi:
Are LLMs Robust for Spoken Dialogues? CoRR abs/2401.02297 (2024) - [i62]Luca Della Libera, Jacopo Andreoli, Davide Dalle Pezze, Mirco Ravanelli, Gian Antonio Susto:
Bayesian Deep Learning for Remaining Useful Life Estimation via Stein Variational Gradient Descent. CoRR abs/2402.01098 (2024) - [i61]Luca Della Libera, Cem Subakan, Mirco Ravanelli:
Focal Modulation Networks for Interpretable Sound Classification. CoRR abs/2402.02754 (2024) - [i60]Luca Zampierin, Ghouthi Boukli Hacene, Bac Nguyen, Mirco Ravanelli:
SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning. CoRR abs/2402.16830 (2024) - [i59]Francesco Paissan, Mirco Ravanelli, Cem Subakan:
Listenable Maps for Audio Classifiers. CoRR abs/2403.13086 (2024) - [i58]Francesco Paissan, Luca Della Libera, Mirco Ravanelli, Cem Subakan:
Listenable Maps for Zero-Shot Audio Classifiers. CoRR abs/2405.17615 (2024) - [i57]Shubham Gupta, Mirco Ravanelli, Pascal Germain, Cem Subakan:
Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice. CoRR abs/2406.10422 (2024) - [i56]Pooneh Mousavi, Jarod Duret, Salah Zaiem, Luca Della Libera, Artem Ploujnikov, Cem Subakan, Mirco Ravanelli:
How Should We Extract Discrete Audio Tokens from Self-Supervised Models? CoRR abs/2406.10735 (2024) - [i55]Pooneh Mousavi, Luca Della Libera, Jarod Duret, Artem Ploujnikov, Cem Subakan, Mirco Ravanelli:
DASB - Discrete Audio and Speech Benchmark. CoRR abs/2406.14294 (2024) - [i54]Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Pierre Champion, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu, Sangeet Sagar, Jarod Duret, Salima Mdhaffar, Gaëlle Laperrière, Mickael Rouvier, Renato De Mori, Yannick Estève:
Open-Source Conversational AI with SpeechBrain 1.0. CoRR abs/2407.00463 (2024) - [i53]Ada Defne Tur, Adel Moumen, Mirco Ravanelli:
ProGRes: Prompted Generative Rescoring on ASR n-Best. CoRR abs/2409.00217 (2024) - [i52]Eleonora Mancini, Francesco Paissan, Mirco Ravanelli, Cem Subakan:
LMAC-TD: Producing Time Domain Explanations for Audio Classifiers. CoRR abs/2409.08655 (2024) - [i51]Yingzhi Wang, Pooneh Mousavi, Artem Ploujnikov, Mirco Ravanelli:
What Are They Doing? Joint Audio-Speech Co-Reasoning. CoRR abs/2409.14526 (2024) - [i50]Shubham Gupta, Isaac Neri Gomez-Sarmiento, Faez Amjed Mezdari, Mirco Ravanelli, Cem Subakan:
Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming. CoRR abs/2410.05455 (2024) - [i49]Firat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çagatay Yildiz:
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? CoRR abs/2410.05581 (2024) - 2023
- [j4]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin, Mirko Bronzi:
Exploring Self-Attention Mechanisms for Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2169-2180 (2023) - [c46]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao:
TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch. ASRU 2023: 1-9 - [c45]Sangeet Sagar, Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff-Korbayová, Josef van Genabith:
Rescuespeech: A German Corpus for Speech Recognition in Search and Rescue Domain. ASRU 2023: 1-7 - [c44]Yingzhi Wang, Mirco Ravanelli, Alya Yacoubi:
Speech Emotion Diarization: Which Emotion Appears When? ASRU 2023: 1-7 - [c43]AmirMohammad Sarfi, Zahra Karimpour, Muawiz Chaudhary, Nasir Mohammad Khalid, Mirco Ravanelli, Sudhir Mudur, Eugene Belilovsky:
Simulated Annealing in Early Layers Leads to Better Generalization. CVPR 2023: 20205-20214 - [c42]Salah Zaiem, Robin Algayres, Titouan Parcollet, Slim Essid, Mirco Ravanelli:
Fine-Tuning Strategies for Faster Inference Using Speech Self-Supervised Models: A Comparative Study. ICASSP Workshops 2023: 1-5 - [c41]Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli:
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right? INTERSPEECH 2023: 2873-2877 - [e1]Neamat El Gayar, Edmondo Trentin, Mirco Ravanelli, Hazem Abbas:
Artificial Neural Networks in Pattern Recognition - 10th IAPR TC3 Workshop, ANNPR 2022, Dubai, United Arab Emirates, November 24-26, 2022, Proceedings. Lecture Notes in Computer Science 13739, Springer 2023, ISBN 978-3-031-20649-8 [contents] - [i48]Salah Zaiem, Robin Algayres, Titouan Parcollet, Slim Essid, Mirco Ravanelli:
Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study. CoRR abs/2303.06740 (2023) - [i47]Cem Subakan, Francesco Paissan, Mirco Ravanelli:
Posthoc Interpretation via Quantization. CoRR abs/2303.12659 (2023) - [i46]AmirMohammad Sarfi, Zahra Karimpour, Muawiz Chaudhary, Nasir Mohammad Khalid, Mirco Ravanelli, Sudhir Mudur, Eugene Belilovsky:
Simulated Annealing in Early Layers Leads to Better Generalization. CoRR abs/2304.04858 (2023) - [i45]Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli:
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right? CoRR abs/2306.00452 (2023) - [i44]Sangeet Sagar, Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff-Korbayová, Josef van Genabith:
RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain. CoRR abs/2306.04054 (2023) - [i43]Yingzhi Wang, Mirco Ravanelli, Alaa Nfissi, Alya Yacoubi:
Speech Emotion Diarization: Which Emotion Appears When? CoRR abs/2306.12991 (2023) - [i42]Giuseppe Alessio D'Inverno, Simone Brugiapaglia, Mirco Ravanelli:
Generalization Limits of Graph Neural Networks in Identity Effects Learning. CoRR abs/2307.00134 (2023) - [i41]Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli:
Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads. CoRR abs/2308.14456 (2023) - [i40]Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, Samuel Maddrell-Mander, Callum McLean, Frederik Wenkel, Luis Müller, Jama Hussein Mohamud, Ali Parviz, Michael Craig, Michal Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Christopher Morris, Ioannis Koutis, Mirco Ravanelli, Guy Wolf, Prudencio Tossou, Hadrien Mary, Therence Bois, Andrew W. Fitzgibbon, Blazej Banaszewski, Chad Martin, Dominic Masters:
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets. CoRR abs/2310.04292 (2023) - [i39]Francesco Paissan, Zhepei Wang, Mirco Ravanelli, Paris Smaragdis, Cem Subakan:
Audio Editing with Non-Rigid Text Prompts. CoRR abs/2310.12858 (2023) - [i38]Luca Della Libera, Pooneh Mousavi, Salah Zaiem, Cem Subakan, Mirco Ravanelli:
CL-MASR: A Continual Learning Benchmark for Multilingual ASR. CoRR abs/2310.16931 (2023) - [i37]Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis:
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch. CoRR abs/2310.17864 (2023) - 2022
- [j3]Zhepei Wang, Cem Subakan, Xilin Jiang, Junkai Wu, Efthymios Tzinis, Mirco Ravanelli, Paris Smaragdis:
Learning Representations for New Sound Classes With Continual Self-Supervised Learning. IEEE Signal Process. Lett. 29: 2607-2611 (2022) - [c40]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin:
Real-M: Towards Speech Separation on Real Mixtures. ICASSP 2022: 6862-6866 - [c39]Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao:
MetricGAN-U: Unsupervised Speech Enhancement/ Dereverberation Based Only on Noisy/ Reverberated Speech. ICASSP 2022: 7412-7416 - [c38]Artem Ploujnikov, Mirco Ravanelli:
SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation. INTERSPEECH 2022: 486-490 - [c37]Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao, Mirco Ravanelli:
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. INTERSPEECH 2022: 981-985 - [i36]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin, Mirko Bronzi:
On Using Transformers for Speech-Separation. CoRR abs/2202.02884 (2022) - [i35]Zhepei Wang, Cem Subakan, Xilin Jiang, Junkai Wu, Efthymios Tzinis, Mirco Ravanelli, Paris Smaragdis:
Learning Representations for New Sound Classes With Continual Self-Supervised Learning. CoRR abs/2205.07390 (2022) - [i34]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin:
Resource-Efficient Separation Transformer. CoRR abs/2206.09507 (2022) - [i33]Artem Ploujnikov, Mirco Ravanelli:
SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation. CoRR abs/2207.13703 (2022) - 2021
- [c36]Juan Manuel Mayor Torres, Mirco Ravanelli, Sara E. Medina-DeVilliers, Matthew D. Lerner, Giuseppe Riccardi:
Interpretable SincNet-based Deep Learning for Emotion Recognition from EEG brain activity. EMBC 2021: 412-415 - [c35]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, Jianyuan Zhong:
Attention Is All You Need In Speech Separation. ICASSP 2021: 21-25 - [c34]Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao:
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement. Interspeech 2021: 201-205 - [c33]Nauman Dawalatabad, Mirco Ravanelli, François Grondin, Jenthe Thienpondt, Brecht Desplanques, Hwidong Na:
ECAPA-TDNN Embeddings for Speaker Diarization. Interspeech 2021: 3560-3564 - [c32]Titouan Parcollet, Mirco Ravanelli:
The Energy and Carbon Footprint of Training End-to-End Speech Recognizers. Interspeech 2021: 4583-4587 - [c31]Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab Heba, Titouan Parcollet:
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers. NeurIPS Datasets and Benchmarks 2021 - [i32]Alex Lamb, Di He, Anirudh Goyal, Guolin Ke, Chien-Feng Liao, Mirco Ravanelli, Yoshua Bengio:
Transformers with Competitive Ensembles of Independent Mechanisms. CoRR abs/2103.00336 (2021) - [i31]Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab Heba, Titouan Parcollet:
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers. CoRR abs/2104.01604 (2021) - [i30]Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao:
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement. CoRR abs/2104.03538 (2021) - [i29]Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-Chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato De Mori, Yoshua Bengio:
SpeechBrain: A General-Purpose Speech Toolkit. CoRR abs/2106.04624 (2021) - [i28]Juan Manuel Mayor Torres, Mirco Ravanelli, Sara E. Medina-DeVilliers, Matthew D. Lerner, Giuseppe Riccardi:
Interpretable SincNet-based Deep Learning for Emotion Recognition from EEG brain activity. CoRR abs/2107.10790 (2021) - [i27]Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao:
MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech. CoRR abs/2110.05866 (2021) - [i26]Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin:
REAL-M: Towards Speech Separation on Real Mixtures. CoRR abs/2110.10812 (2021) - [i25]Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao, Mirco Ravanelli:
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. CoRR abs/2111.05703 (2021) - 2020
- [c30]Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, João Monteiro, Jan Trmal, Yoshua Bengio:
Multi-Task Self-Supervised Learning for Robust Speech Recognition. ICASSP 2020: 6989-6993 - [c29]Loren Lugosch, Brett H. Meyer, Derek Nowrouzezahrai, Mirco Ravanelli:
Using Speech Synthesis to Train End-To-End Spoken Language Understanding Models. ICASSP 2020: 8499-8503 - [c28]Xinchi Qiu, Titouan Parcollet, Mirco Ravanelli, Nicholas D. Lane, Mohamed Morchid:
Quaternion Neural Networks for Multi-Channel Distant Speech Recognition. INTERSPEECH 2020: 329-333 - [c27]Mirco Ravanelli:
Towards Unsupervised Learning of Speech Representations. Odyssey 2020 - [i24]Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, João Monteiro, Jan Trmal, Yoshua Bengio:
Multi-task self-supervised learning for Robust Speech Recognition. CoRR abs/2001.09239 (2020) - [i23]Xinchi Qiu, Titouan Parcollet, Mirco Ravanelli, Nicholas D. Lane, Mohamed Morchid:
Quaternion Neural Networks for Multi-channel Distant Speech Recognition. CoRR abs/2005.08566 (2020) - [i22]François Grondin, Jean-Samuel Lauzon, Simon Michaud, Mirco Ravanelli, François Michaud:
BIRD: Big Impulse Response Dataset. CoRR abs/2010.09930 (2020) - [i21]Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, Jianyuan Zhong:
Attention is All You Need in Speech Separation. CoRR abs/2010.13154 (2020)
2010 – 2019
- 2019
- [c26]Mirco Ravanelli, Titouan Parcollet, Yoshua Bengio:
The Pytorch-kaldi Speech Recognition Toolkit. ICASSP 2019: 6465-6469 - [c25]Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Chiheb Trabelsi, Renato De Mori, Yoshua Bengio:
Quaternion Recurrent Neural Networks. ICLR (Poster) 2019 - [c24]Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte, Yoshua Bengio:
Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks. INTERSPEECH 2019: 161-165 - [c23]Loren Lugosch, Mirco Ravanelli, Patrick Ignoto, Vikrant Singh Tomar, Yoshua Bengio:
Speech Model Pre-Training for End-to-End Spoken Language Understanding. INTERSPEECH 2019: 814-818 - [c22]Mirco Ravanelli, Yoshua Bengio:
Learning Speaker Representations with Mutual Information. INTERSPEECH 2019: 1153-1157 - [i20]Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte, Yoshua Bengio:
Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks. CoRR abs/1904.03416 (2019) - [i19]Loren Lugosch, Mirco Ravanelli, Patrick Ignoto, Vikrant Singh Tomar, Yoshua Bengio:
Speech Model Pre-training for End-to-End Spoken Language Understanding. CoRR abs/1904.03670 (2019) - 2018
- [j2]Mirco Ravanelli, Maurizio Omologo:
Automatic context window composition for distant speech recognition. Speech Commun. 101: 34-44 (2018) - [j1]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
Light Gated Recurrent Units for Speech Recognition. IEEE Trans. Emerg. Top. Comput. Intell. 2(2): 92-102 (2018) - [c21]Mirco Ravanelli, Dmitriy Serdyuk, Yoshua Bengio:
Twin Regularization for Online Speech Recognition. INTERSPEECH 2018: 3718-3722 - [c20]Mirco Ravanelli, Yoshua Bengio:
Speaker Recognition from Raw Waveform with SincNet. SLT 2018: 1021-1028 - [i18]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
Light Gated Recurrent Units for Speech Recognition. CoRR abs/1803.10225 (2018) - [i17]Mirco Ravanelli, Dmitriy Serdyuk, Yoshua Bengio:
Twin Regularization for online speech recognition. CoRR abs/1804.05374 (2018) - [i16]Mirco Ravanelli, Maurizio Omologo:
Automatic context window composition for distant speech recognition. CoRR abs/1805.10498 (2018) - [i15]Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Chiheb Trabelsi, Renato De Mori, Yoshua Bengio:
Quaternion Recurrent Neural Networks. CoRR abs/1806.04418 (2018) - [i14]Mirco Ravanelli, Yoshua Bengio:
Speaker Recognition from raw waveform with SincNet. CoRR abs/1808.00158 (2018) - [i13]Mirco Ravanelli, Titouan Parcollet, Yoshua Bengio:
The PyTorch-Kaldi Speech Recognition Toolkit. CoRR abs/1811.07453 (2018) - [i12]Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Renato De Mori:
Speech recognition with quaternion neural networks. CoRR abs/1811.09678 (2018) - [i11]Mirco Ravanelli, Yoshua Bengio:
Interpretable Convolutional Filters with SincNet. CoRR abs/1811.09725 (2018) - [i10]Mirco Ravanelli, Yoshua Bengio:
Learning Speaker Representations with Mutual Information. CoRR abs/1812.00271 (2018) - [i9]Mirco Ravanelli, Yoshua Bengio:
Speech and Speaker Recognition from Raw Waveform with SincNet. CoRR abs/1812.05920 (2018) - 2017
- [b1]Mirco Ravanelli:
Deep Learning for Distant Speech Recognition. University of Trento, Italy, 2017 - [c19]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
A network of deep neural networks for Distant Speech Recognition. ICASSP 2017: 4880-4884 - [c18]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
Improving Speech Recognition by Revising Gated Recurrent Units. INTERSPEECH 2017: 1308-1312 - [i8]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
A network of deep neural networks for distant speech recognition. CoRR abs/1703.08002 (2017) - [i7]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
Batch-normalized joint training for DNN-based distant speech recognition. CoRR abs/1703.08471 (2017) - [i6]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
Improving speech recognition by revising gated recurrent units. CoRR abs/1710.00641 (2017) - [i5]Mirco Ravanelli, Maurizio Omologo:
The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments. CoRR abs/1710.02560 (2017) - [i4]Mirco Ravanelli, Maurizio Omologo:
Contaminated speech training methods for robust DNN-HMM distant speech recognition. CoRR abs/1710.03538 (2017) - [i3]Mirco Ravanelli, Benjamin Elizalde, Karl Ni, Gerald Friedland:
Audio Concept Classification with Hierarchical Deep Neural Networks. CoRR abs/1710.04288 (2017) - [i2]Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo:
Realistic multi-microphone data simulation for distant speech recognition. CoRR abs/1711.09470 (2017) - [i1]Mirco Ravanelli:
Deep Learning for Distant Speech Recognition. CoRR abs/1712.06086 (2017) - 2016
- [c17]Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo:
Realistic Multi-Microphone Data Simulation for Distant Speech Recognition. INTERSPEECH 2016: 2786-2790 - [c16]Dayana Ribas, Emmanuel Vincent, John H. L. Hansen, Emma Jokinen, Mirco Ravanelli, Hannes Gamper, Fred Richardson:
Discussion. INTERSPEECH 2016 - [c15]Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio:
Batch-normalized joint training for DNN-based distant speech recognition. SLT 2016: 28-34 - 2015
- [c14]Mirco Ravanelli, Luca Cristoforetti, Roberto Gretter, Marco Pellin, Alessandro Sosi, Maurizio Omologo:
The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments. ASRU 2015: 275-282 - [c13]Erich Zwyssig, Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo:
A multi-channel corpus for distant-speech interaction in presence of known interferences. ICASSP 2015: 4480-4484 - [c12]Mirco Ravanelli, Maurizio Omologo:
Contaminated speech training methods for robust DNN-HMM distant speech recognition. INTERSPEECH 2015: 756-760 - [c11]Mirco Ravanelli, Benjamin Elizalde, Julia Bernd, Gerald Friedland:
Insights into Audio-Based Multimedia Event Classification with Neural Networks. MMCommons@ACM Multimedia 2015: 19-23 - 2014
- [c10]Mirco Ravanelli, Benjamin Elizalde, Karl Ni, Gerald Friedland:
Audio concept classification with Hierarchical Deep Neural Networks. EUSIPCO 2014: 606-610 - [c9]Alessio Brutti, Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo:
A speech event detection and localization task for multiroom environments. HSCMA 2014: 157-161 - [c8]Benjamin Elizalde, Mirco Ravanelli, Karl Ni, Damian Borth, Gerald Friedland:
Audio-concept features and hidden Markov models for multimedia event detection. SLAM@INTERSPEECH 2014: 3-8 - [c7]Mirco Ravanelli, Maurizio Omologo:
On the selection of the impulse responses for distant-speech recognition based on contaminated speech training. INTERSPEECH 2014: 1028-1032 - [c6]Marco Matassoni, Ramón Fernandez Astudillo, Athanasios Katsamanis, Mirco Ravanelli:
The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones. INTERSPEECH 2014: 1613-1617 - [c5]Mirco Ravanelli, Van Hai Do, Adam Janin:
TANDEM-bottleneck feature combination using hierarchical Deep Neural Networks. ISCSLP 2014: 113-117 - [c4]Luca Cristoforetti, Mirco Ravanelli, Maurizio Omologo, Alessandro Sosi, Alberto Abad, Martin Hagmueller, Petros Maragos:
The DIRHA simulated corpus. LREC 2014: 2629-2634 - 2013
- [c3]Benjamin Elizalde, Mirco Ravanelli, Gerald Friedland:
Audio Concept Ranking for Video Event Detection on User-Generated Content. SLAM@INTERSPEECH 2013: 9-14 - [c2]Alessandro Sosi, Fabio Brugnara, Luca Cristoforetti, Marco Matassoni, Mirco Ravanelli, Maurizio Omologo:
Embedding speech recognition to control lights. INTERSPEECH 2013: 759-760 - 2012
- [c1]Mirco Ravanelli, Alessandro Sosi, Piergiorgio Svaizer, Maurizio Omologo:
Impulse response estimation for robust speech recognition in a reverberant environment. EUSIPCO 2012: 1668-1672
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 20:52 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint