More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Ye Jia

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tlt/JiaWSLNHBCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tlt/JiaWSLNHBCL24
Ye Jia, Xiangzhi Eric Wang, Zackary P. T. Sin, Chen Li, Peter H. F. Ng, Xiao Huang, George Baciu, Jiannong Cao, Qing Li:
Knowledge-Graph-Driven Mind Mapping for Immersive Collaborative Learning: A Pilot Study in Edu-Metaverse. IEEE Trans. Learn. Technol. 17: 1834-1848 (2024)
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/metacom/JiaSW0NHD0B0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/metacom/JiaSW0NHD0B0024
Ye Jia, Zackary P. T. Sin, Xiangzhi Eric Wang, Chen Li, Peter H. F. Ng, Xiao Huang, Junnan Dong, Yaowei Wang, George Baciu, Jiannong Cao, Qing Li:
NivTA: Towards a Naturally Interactable Edu-Metaverse Teaching Assistant for CAVE. MetaCom 2024: 57-64
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/vr/SinJLLLN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vr/SinJLLLN24
Zackary P. T. Sin, Ye Jia, Richard Chen Li, Hong Va Leong, Qing Li, Peter H. F. Ng:
illumotion: An Optical-illusion-based VR Locomotion Technique for Long-Distance 3D Movement. VR 2024: 924-934
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02133
Alex Agranovich, Eliya Nachmani, Oleg Rybakov, Yifan Ding, Ye Jia, Nadav Bar, Heiga Zen, Michelle Tadmor Ramanovich:
SimulTron: On-Device Simultaneous Speech to Speech Translation. CoRR abs/2406.02133 (2024)
2023
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tlt/SinJWZLNHBCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tlt/SinJWZLNHBCL23
Zackary P. T. Sin, Ye Jia, Astin C. H. Wu, Isaac Dan Zhao, Richard Chen Li, Peter H. F. Ng, Xiao Huang, George Baciu, Jiannong Cao, Qing Li:
Toward an Edu-Metaverse of Knowledge: Immersive Exploration of University Courses. IEEE Trans. Learn. Technol. 16(6): 1096-1110 (2023)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiJC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiJC23
Xinjian Li, Ye Jia, Chung-Cheng Chiu:
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation. ICASSP 2023: 1-5
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icwl/NgCSJLBCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icwl/NgCSJLBCL23
Peter H. F. Ng, Peter Q. Chen, Zackary P. T. Sin, Ye Jia, Richard Chen Li, George Baciu, Jiannong Cao, Qing Li:
From Classroom to Metaverse: A Study on Gamified Constructivist Teaching in Higher Education. ICWL 2023: 92-106
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauSWRZJ00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauSWRZJ00M23
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Abhinav Rastogi, Jeffrey Zhao, Ye Jia, Wei Han, Yuan Cao, Aramys Miranda:
Speech Aware Dialog System Technology Challenge (DSTC11). INTERSPEECH 2023: 4668-4672
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-06718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-06718
Haotian Zhang, Stuart Dereck Semujju, Zhicheng Wang, Xianwei Lv, Kang Xu, Liang Wu, Ye Jia, Jing Wu, Zhuo Long, Wensheng Liang, Xiaoguang Ma, Ruiyan Zhuang:
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey. CoRR abs/2312.06718 (2023)
2022
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HassidRSWJR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HassidRSWJR22
Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford, Miaosen Wang, Ye Jia, Tal Remez:
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech. CVPR 2022: 10577-10587
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/JiaRRP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JiaRRP22
Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz:
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation. ICML 2022: 10120-10134
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaDBC0CM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaDBC0CM22
Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobu Morioka:
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation. INTERSPEECH 2022: 1721-1725
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ConneauBZMPLCJR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ConneauBZMPLCJR22
Alexis Conneau, Ankur Bapna, Yu Zhang, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson:
XTREME-S: Evaluating Cross-lingual Speech Representations. INTERSPEECH 2022: 3248-3252
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FinkelsteinZCCJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FinkelsteinZCCJ22
Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark:
Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks. INTERSPEECH 2022: 4571-4575
[c16]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/lrec/JiaRWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/JiaRWZ22
Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen:
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation. LREC 2022: 6691-6703
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03713
Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen:
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation. CoRR abs/2201.03713 (2022)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01374
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01374
Ankur Bapna, Colin Cherry, Yu Zhang, Ye Jia, Melvin Johnson, Yong Cheng, Simran Khanuja, Jason Riesa, Alexis Conneau:
mSLAM: Massively multilingual joint pre-training for speech and text. CoRR abs/2202.01374 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-10752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-10752
Alexis Conneau, Ankur Bapna, Yu Zhang, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson:
XTREME-S: Evaluating Cross-lingual Speech Representations. CoRR abs/2203.10752 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-13339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-13339
Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobuyuki Morioka:
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation. CoRR abs/2203.13339 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13183
Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark:
Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks. CoRR abs/2208.13183 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00115
Xinjian Li, Ye Jia, Chung-Cheng Chiu:
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation. CoRR abs/2211.00115 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08704
Hagen Soltau, Izhak Shafran, Mingqiu Wang, Abhinav Rastogi, Jeffrey Zhao, Ye Jia, Wei Han, Yuan Cao, Aramys Miranda:
Speech Aware Dialog System Technology Challenge (DSTC11). CoRR abs/2212.08704 (2022)
2021
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DingJHW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DingJHW21
Shaojin Ding, Ye Jia, Ke Hu, Quan Wang:
Textual Echo Cancellation. ASRU 2021: 548-555
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/EliasZSZJWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/EliasZSZJWW21
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss, Yonghui Wu:
Parallel Tacotron: Non-Autoregressive and Controllable TTS. ICASSP 2021: 5709-5713
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EliasZS0JSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EliasZS0JSW21
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, R. J. Skerry-Ryan, Yonghui Wu:
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. Interspeech 2021: 141-145
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaZSZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaZSZW21
Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang, Yonghui Wu:
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS. Interspeech 2021: 151-155
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-14574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-14574
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, R. J. Skerry-Ryan, Yonghui Wu:
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. CoRR abs/2103.14574 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-15060
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-15060
Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang, Yonghui Wu:
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS. CoRR abs/2103.15060 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-08661
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-08661
Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz:
Translatotron 2: Robust direct speech-to-speech translation. CoRR abs/2107.08661 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10329
Ankur Bapna, Yu-An Chung, Nan Wu, Anmol Gulati, Ye Jia, Jonathan H. Clark, Melvin Johnson, Jason Riesa, Alexis Conneau, Yu Zhang:
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training. CoRR abs/2110.10329 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-10139
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-10139
Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford, Miaosen Wang, Ye Jia, Tal Remez:
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech. CoRR abs/2111.10139 (2021)
2020
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkZJHCLWL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkZJHCLWL20
Daniel S. Park, Yu Zhang, Ye Jia, Wei Han, Chung-Cheng Chiu, Bo Li, Yonghui Wu, Quoc V. Le:
Improved Noisy Student Training for Automatic Speech Recognition. INTERSPEECH 2020: 2817-2821
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09629
Daniel S. Park, Yu Zhang, Ye Jia, Wei Han, Chung-Cheng Chiu, Bo Li, Yonghui Wu, Quoc V. Le:
Improved Noisy Student Training for Automatic Speech Recognition. CoRR abs/2005.09629 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06006
Shaojin Ding, Ye Jia, Ke Hu, Quan Wang:
Textual Echo Cancellation. CoRR abs/2008.06006 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04301
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04301
Jonathan Shen, Ye Jia, Mike Chrzanowski, Yu Zhang, Isaac Elias, Heiga Zen, Yonghui Wu:
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling. CoRR abs/2010.04301 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11439
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss, Yonghui Wu:
Parallel Tacotron: Non-Autoregressive and Controllable TTS. CoRR abs/2010.11439 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RosenbergZRJMWW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RosenbergZRJMWW19
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. ASRU 2019: 996-1002
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiaJMWCCALW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiaJMWCCALW19
Ye Jia, Melvin Johnson, Wolfgang Macherey, Ron J. Weiss, Yuan Cao, Chung-Cheng Chiu, Naveen Ari, Stella Laurenzo, Yonghui Wu:
Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation. ICASSP 2019: 7180-7184
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/HsuZWZWWCJCSNP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HsuZWZWWCJCSNP19
Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang:
Hierarchical Generative Modeling for Controllable Speech Synthesis. ICLR (Poster) 2019
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaWBMJCW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaWBMJCW19
Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu:
Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model. INTERSPEECH 2019: 1123-1127
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenDCZWJCW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenDCZWJCW19
Heiga Zen, Viet Dang, Rob Clark, Yu Zhang, Ron J. Weiss, Ye Jia, Zhifeng Chen, Yonghui Wu:
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech. INTERSPEECH 2019: 1526-1530
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWZWCSJRR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWZWCSJRR19
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. INTERSPEECH 2019: 2080-2084
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangMWSWHSWJL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangMWSWHSWJL19
Quan Wang, Hannah Muckenhirn, Kevin W. Wilson, Prashant Sridhar, Zelin Wu, John R. Hershey, Rif A. Saurous, Ron J. Weiss, Ye Jia, Ignacio López-Moreno:
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking. INTERSPEECH 2019: 2728-2732
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiadsyWMKJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiadsyWMKJ19
Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. INTERSPEECH 2019: 4115-4119
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08295
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-02882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-02882
Heiga Zen, Viet Dang, Rob Clark, Yu Zhang, Ron J. Weiss, Ye Jia, Zhifeng Chen, Yonghui Wu:
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech. CoRR abs/1904.02882 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04169
Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. CoRR abs/1904.04169 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-06037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-06037
Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu:
Direct speech-to-speech translation with a sequence-to-sequence model. CoRR abs/1904.06037 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-04448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-04448
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. CoRR abs/1907.04448 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11699
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. CoRR abs/1909.11699 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-01601
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-01601
Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019)
2018
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WangSZRBSXJRS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangSZRBSXJRS18
Yuxuan Wang, Daisy Stanton, Yu Zhang, R. J. Skerry-Ryan, Eric Battenberg, Joel Shor, Ying Xiao, Ye Jia, Fei Ren, Rif A. Saurous:
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. ICML 2018: 5167-5176
[c1]
- view
- export record
  dblp key:
  - conf/nips/JiaZWWSRCNPLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiaZWWSRCNPLW18
Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio López-Moreno, Yonghui Wu:
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis. NeurIPS 2018: 4485-4495
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-09017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-09017
Yuxuan Wang, Daisy Stanton, Yu Zhang, R. J. Skerry-Ryan, Eric Battenberg, Joel Shor, Ying Xiao, Fei Ren, Ye Jia, Rif A. Saurous:
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. CoRR abs/1803.09017 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-04558
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-04558
Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio López-Moreno, Yonghui Wu:
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis. CoRR abs/1806.04558 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-04826
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-04826
Quan Wang, Hannah Muckenhirn, Kevin W. Wilson, Prashant Sridhar, Zelin Wu, John R. Hershey, Rif A. Saurous, Ron J. Weiss, Ye Jia, Ignacio López-Moreno:
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking. CoRR abs/1810.04826 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-07217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-07217
Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang:
Hierarchical Generative Modeling for Controllable Speech Synthesis. CoRR abs/1810.07217 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02050
Ye Jia, Melvin Johnson, Wolfgang Macherey, Ron J. Weiss, Yuan Cao, Chung-Cheng Chiu, Naveen Ari, Stella Laurenzo, Yonghui Wu:
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation. CoRR abs/1811.02050 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.