default search action
Matthew Wiesner
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c31]Bismarck Bamfo Odoom, Nathaniel R. Robinson, Elijah Rippeth, Luis Tavarez-Arce, Kenton Murray, Matthew Wiesner, Paul McNamee, Philipp Koehn, Kevin Duh:
Can Synthetic Speech Improve End-to-End Conversational Speech Translation? AMTA (1) 2024: 167-177 - [c30]Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. ICASSP 2024: 11831-11835 - [c29]Amir Hussein, Dorsa Zeinali, Ondrej Klejch, Matthew Wiesner, Brian Yan, Shammur Absar Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur:
Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora. ICASSP 2024: 12006-12010 - [c28]Patrick Foley, Matthew Wiesner, Bismarck Odoom, Leibny Paola García-Perera, Kenton Murray, Philipp Koehn:
Where are you from? Geolocating Speech and Applications to Language Identification. NAACL-HLT 2024: 5114-5126 - [c27]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Paola García, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. Odyssey 2024: 91-98 - [i24]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. CoRR abs/2401.15676 (2024) - [i23]Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. CoRR abs/2406.02560 (2024) - [i22]Samuele Cornell, Taejin Park, Steve Huang, Christoph Böddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola García, Shinji Watanabe:
The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. CoRR abs/2407.16447 (2024) - [i21]Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization. CoRR abs/2409.03655 (2024) - [i20]Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
HLTCOE JHU Submission to the Voice Privacy Challenge 2024. CoRR abs/2409.08913 (2024) - [i19]Alexander Polok, Dominik Klement, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
Target Speaker ASR with Whisper. CoRR abs/2409.09543 (2024) - 2023
- [c26]Ruizhe Huang, Matthew Wiesner, Leibny Paola García-Perera, Daniel Povey, Jan Trmal, Sanjeev Khudanpur:
Building Keyword Search System from End-To-End Asr Systems. ICASSP 2023: 1-5 - [c25]Brian Yan, Matthew Wiesner, Ondrej Klejch, Preethi Jyothi, Shinji Watanabe:
Towards Zero-Shot Code-Switched Speech Recognition. ICASSP 2023: 1-5 - [c24]Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, Sanjeev Khudanpur:
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts. INTERSPEECH 2023: 924-928 - [c23]Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur:
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation. INTERSPEECH 2023: 4074-4078 - [c22]Amir Hussein, Cihan Xiao, Neha Verma, Thomas Thebaud, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2023 Dialect Speech Translation System Description. IWSLT@ACL 2023: 283-290 - [c21]Henry Li Xinyuan, Neha Verma, Bismarck Bamfo Odoom, Ujvala Pradeep, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2023 Multilingual Speech Translation System Description. IWSLT@ACL 2023: 302-310 - [i18]Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, Sanjeev Khudanpur:
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts. CoRR abs/2306.01031 (2023) - [i17]Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur:
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation. CoRR abs/2306.11252 (2023) - [i16]Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola García, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur:
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios. CoRR abs/2306.13734 (2023) - [i15]Amir Hussein, Dorsa Zeinali, Ondrej Klejch, Matthew Wiesner, Brian Yan, Shammur Absar Chowdhury, Ahmed M. Ali, Shinji Watanabe, Sanjeev Khudanpur:
Speech collage: code-switched audio generation by collaging monolingual corpora. CoRR abs/2309.15674 (2023) - 2022
- [c20]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models. ICASSP 2022: 8597-8601 - [c19]Jinyi Yang, Amir Hussein, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2022 Dialect Speech Translation System Description. IWSLT@ACL 2022: 319-326 - [i14]Brian Yan, Matthew Wiesner, Ondrej Klejch, Preethi Jyothi, Shinji Watanabe:
Towards Zero-Shot Code-Switched Speech Recognition. CoRR abs/2211.01458 (2022) - 2021
- [c18]Matthew Wiesner, Mousmita Sarma, Ashish Arora, Desh Raj, Dongji Gao, Ruizhe Huang, Supreet Preet, Moris Johnson, Zikra Iqbal, Nagendra Goel, Jan Trmal, Leibny Paola García-Perera, Sanjeev Khudanpur:
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. Interspeech 2021: 2906-2910 - [c17]Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post:
The Multilingual TEDx Corpus for Speech Recognition and Translation. Interspeech 2021: 3655-3659 - [c16]Antonios Anastasopoulos, Ondrej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Miguel Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alex Waibel, Changhan Wang, Matthew Wiesner:
Findings of the IWSLT 2021 Evaluation Campaign. IWSLT 2021: 1-29 - [c15]Motoi Omachi, Yuya Fujita, Shinji Watanabe, Matthew Wiesner:
End-to-end ASR to jointly predict transcriptions and linguistic annotations. NAACL-HLT 2021: 1861-1871 - [i13]Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post:
The Multilingual TEDx Corpus for Speech Recognition and Translation. CoRR abs/2102.01757 (2021) - [i12]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models. CoRR abs/2110.04863 (2021) - 2020
- [c14]Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W. Black, Jason Eisner:
A Corpus for Large-Scale Phonetic Typology. ACL 2020: 4526-4546 - [c13]Oliver Adams, Matthew Wiesner, Jan Trmal, Garrett Nicolai, David Yarowsky:
Induced Inflection-Set Keyword Search in Speech. SIGMORPHON 2020: 210-216 - [i11]Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W. Black, Jason Eisner:
A Corpus for Large-Scale Phonetic Typology. CoRR abs/2005.13962 (2020)
2010 – 2019
- 2019
- [c12]Matthew Wiesner, Oliver Adams, David Yarowsky, Jan Trmal, Sanjeev Khudanpur:
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer. ASRU 2019: 1048-1054 - [c11]Martin Karafiát, Murali Karthick Baskar, Shinji Watanabe, Takaaki Hori, Matthew Wiesner, Jan Cernocký:
Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. INTERSPEECH 2019: 2220-2224 - [c10]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings. INTERSPEECH 2019: 4375-4379 - [c9]Oliver Adams, Matthew Wiesner, Shinji Watanabe, David Yarowsky:
Massively Multilingual Adversarial Speech Recognition. NAACL-HLT (1) 2019: 96-108 - [i10]Oliver Adams, Matthew Wiesner, Shinji Watanabe, David Yarowsky:
Massively Multilingual Adversarial Speech Recognition. CoRR abs/1904.02210 (2019) - [i9]Oliver Adams, Matthew Wiesner, Jan Trmal, Garrett Nicolai, David Yarowsky:
Induced Inflection-Set Keyword Search in Speech. CoRR abs/1910.12299 (2019) - 2018
- [c8]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH 2018: 2052-2056 - [c7]Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018: 2207-2211 - [c6]Adithya Renduchintala, Shuoyang Ding, Matthew Wiesner, Shinji Watanabe:
Multi-Modal Data Augmentation for End-to-end ASR. INTERSPEECH 2018: 2394-2398 - [c5]Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Mallidi, Nelson Yalta, Martin Karafiát, Shinji Watanabe, Takaaki Hori:
Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. SLT 2018: 521-527 - [c4]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. SLT 2018: 656-663 - [i8]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak:
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR abs/1802.08731 (2018) - [i7]Adithya Renduchintala, Shuoyang Ding, Matthew Wiesner, Shinji Watanabe:
Multi-Modal Data Augmentation for End-to-end ASR. CoRR abs/1803.10299 (2018) - [i6]Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. CoRR abs/1804.00015 (2018) - [i5]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. CoRR abs/1807.06204 (2018) - [i4]Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Reddy Mallidi, Nelson Yalta, Martin Karafiát, Shinji Watanabe, Takaaki Hori:
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling. CoRR abs/1810.03459 (2018) - [i3]Martin Karafiát, Murali Karthick Baskar, Shinji Watanabe, Takaaki Hori, Matthew Wiesner, Jan Honza Cernocký:
Analysis of Multilingual Sequence-to-Sequence speech recognition systems. CoRR abs/1811.03451 (2018) - [i2]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Low Resource Multi-modal Data Augmentation for End-to-end ASR. CoRR abs/1812.03919 (2018) - 2017
- [c3]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech Without ASR. INTERSPEECH 2017: 2501-2505 - [c2]Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur:
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH 2017: 3597-3601 - [i1]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech without ASR. CoRR abs/1703.07476 (2017) - 2015
- [c1]Hynek Hermansky, Lukás Burget, Jordan Cohen, Emmanuel Dupoux, Naomi Feldman, John Godfrey, Sanjeev Khudanpur, Matthew Maciejewski, Sri Harish Reddy Mallidi, Anjali Menon, Tetsuji Ogawa, Vijayaditya Peddinti, Richard C. Rose, Richard M. Stern, Matthew Wiesner, Karel Veselý:
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop. ICASSP 2015: 5009-5013
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 22:05 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint