More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Xi Wang 0016

> Home > Persons

Person information

affiliation: Microsoft Cloud and AI, Bejing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/TanCLCZLWLYHZQSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/TanCLCZLWLYHZQSL24
Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Sheng Zhao, Tao Qin, Frank K. Soong, Tie-Yan Liu:
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality. IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4234-4245 (2024)
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWZ00WM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWZ00WM24
Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng:
Stylespeech: Self-Supervised Style Enhancing with VQ-VAE-Based Pre-Training for Expressive Audiobook Speech Synthesis. ICASSP 2024: 12316-12320
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Xiao000ZZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Xiao000ZZ024
Yujia Xiao, Xi Wang, Xu Tan, Lei He, Xinfa Zhu, Sheng Zhao, Tan Lee:
Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis. ACM Multimedia 2024: 2099-2107
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhuTWHX00Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhuTWHX00Z024
Xinfa Zhu, Wenjie Tian, Xinsheng Wang, Lei He, Yujia Xiao, Xi Wang, Xu Tan, Sheng Zhao, Lei Xie:
UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis. ACM Multimedia 2024: 7513-7522
2023
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/XuZ0ZW0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/XuZ0ZW0Z23
Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao:
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023. Blizzard Challenge 2023
[c12]
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/WalshHNWRZ0ZDFW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WalshHNWRZ0ZDFW23
Brendan Walsh, Mark Hamilton, Greg Newby, Xi Wang, Serena Ruan, Sheng Zhao, Lei He, Shaofei Zhang, Eric Dettinger, William T. Freeman, Markus Weimer:
Large-Scale Automatic Audiobook Creation. INTERSPEECH 2023: 3675-3676
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoZ000ZSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoZ000ZSL23
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee:
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading. INTERSPEECH 2023: 4883-4887
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-00782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-00782
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee:
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading. CoRR abs/2307.00782 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-02743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-02743
Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao:
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023. CoRR abs/2309.02743 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03926
Brendan Walsh, Mark Hamilton, Greg Newby, Xi Wang, Serena Ruan, Sheng Zhao, Lei He, Shaofei Zhang, Eric Dettinger, William T. Freeman, Markus Weimer:
Large-Scale Automatic Audiobook Creation. CoRR abs/2309.03926 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12181
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12181
Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng:
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis. CoRR abs/2312.12181 (2023)
2022
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoWHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoWHS22
Yujia Xiao, Xi Wang, Lei He, Frank K. Soong:
Improving Fastspeech TTS with Efficient Self-Attention and Compact Feed-Forward Network. ICASSP 2022: 7472-7476
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YiHPWX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YiHPWX22
Yuanhao Yi, Lei He, Shifeng Pan, Xi Wang, Yujia Xiao:
Prosodyspeech: Towards Advanced Prosody Model for Neural Text-to-Speech. ICASSP 2022: 7582-7586
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YiHPWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YiHPWZ22
Yuanhao Yi, Lei He, Shifeng Pan, Xi Wang, Yuchao Zhang:
SoftSpeech: Unsupervised Duration Model in FastSpeech 2. INTERSPEECH 2022: 1606-1610
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuWZHSN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuWZHSN22
Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie:
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis. INTERSPEECH 2022: 5503-5507
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-04421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-04421
Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Frank K. Soong, Tao Qin, Sheng Zhao, Tie-Yan Liu:
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality. CoRR abs/2205.04421 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-12559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-12559
Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie:
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis. CoRR abs/2206.12559 (2022)
2021
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenDWSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenDWSH21
Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He:
Speech Bert Embedding for Improving Prosody in Neural TTS. ICASSP 2021: 6563-6567
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-04312
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-04312
Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He:
Speech BERT Embedding For Improving Prosody in Neural TTS. CoRR abs/2106.04312 (2021)
2020
[c5]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/HwangSSWKK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HwangSSWKK20
Min-Jae Hwang, Frank K. Soong, Eunwoo Song, Xi Wang, Hyeonjoo Kang, Hong-Goo Kang:
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis. APSIPA 2020: 810-814
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiWHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiWHS20
Yang Cui, Xi Wang, Lei He, Frank K. Soong:
An Efficient Subband Linear Prediction for LPCNet-Based Neural Synthesis. INTERSPEECH 2020: 3555-3559
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08480
Xi Wang, Huaiping Ming, Lei He, Frank K. Soong:
s-Transformer: Segment-Transformer for Robust Neural Speech Synthesis. CoRR abs/2011.08480 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengWHPSWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengWHPSWT19
Yibin Zheng, Xi Wang, Lei He, Shifeng Pan, Frank K. Soong, Zhengqi Wen, Jianhua Tao:
Forward-Backward Decoding for Regularizing End-to-End TTS. INTERSPEECH 2019: 1283-1287
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-09006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-09006
Yibin Zheng, Xi Wang, Lei He, Shifeng Pan, Frank K. Soong, Zhengqi Wen, Jianhua Tao:
Forward-Backward Decoding for Regularizing End-to-End TTS. CoRR abs/1907.09006 (2019)
2018
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiWHS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiWHS18
Yang Cui, Xi Wang, Lei He, Frank K. Soong:
A New Glottal Neural Vocoder for Speech Synthesis. INTERSPEECH 2018: 2017-2021
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XieSWHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XieSWHL18
Feng-Long Xie, Frank K. Soong, Xi Wang, Lei He, Haifeng Li:
Frame Selection in SI-DNN Phonetic Space with WaveNet Vocoder for Voice Conversion without Parallel Training Data. ISCSLP 2018: 56-60
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-11913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-11913
Min-Jae Hwang, Frank K. Soong, Feng-Long Xie, Xi Wang, Hong-Goo Kang:
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis. CoRR abs/1811.11913 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.