default search action
Ziyue Jiang 0001
Person information
- unicode name: 江子越
- affiliation: Zhejiang University, Hangzhou, China
Other persons with the same name
- Ziyue Jiang 0002 — TuSimple, China
- Ziyue Jiang 0003 — Southern Medical University, Third Affiliated Hospital, Department of Orthopedics, Guangzhou, China
- Ziyue Jiang 0004 — Southeast University, MOE Key Laboratory of Computer Network and Information Integration, Nanjing, China
- Ziyue Jiang 0005 — South China University of Technology, Guangzhou, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. ACL (1) 2024: 1979-1998 - [c14]Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners. ACL (1) 2024: 10929-10942 - [c13]Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. ACL (1) 2024: 13588-13600 - [c12]Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. EMNLP 2024: 1960-1975 - [c11]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis. ICLR 2024 - [c10]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. ICLR 2024 - [c9]Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024 - [c8]Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639 - [i24]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. CoRR abs/2401.08503 (2024) - [i23]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. CoRR abs/2402.07729 (2024) - [i22]Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialong Zuo, Shulei Wang, Zhou Zhao:
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models. CoRR abs/2402.12208 (2024) - [i21]Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao:
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec. CoRR abs/2406.01205 (2024) - [i20]Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai:
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis. CoRR abs/2407.14006 (2024) - [i19]Jiawei Huang, Chen Zhang, Yi Ren, Ziyue Jiang, Zhenhui Ye, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency. CoRR abs/2408.04708 (2024) - [i18]Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024) - [i17]Yang Chen, Yuhang Jia, Shiwan Zhao, Ziyue Jiang, Haoran Li, Jiarong Kang, Yong Qin:
DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency. CoRR abs/2409.12992 (2024) - [i16]Yu Zhang, Changhao Pan, Wenxiang Guo, Ruiqi Li, Zhiyuan Zhu, Jialei Wang, Wenhao Xu, Jingyu Lu, Zhiqing Hong, Chuxin Wang, LiChao Zhang, Jinzheng He, Ziyue Jiang, Yuxin Chen, Chen Yang, Jiecheng Zhou, Xinyu Cheng, Zhou Zhao:
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks. CoRR abs/2409.13832 (2024) - [i15]Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. CoRR abs/2409.15977 (2024) - [i14]Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li:
FluentEditor+: Text-based Speech Editing by Modeling Local Hierarchical Acoustic Smoothness and Global Prosody Consistency. CoRR abs/2410.03719 (2024) - [i13]Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Jiang, Jiawei Huang, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang, Zehan Wang, Xize Chen, Xiang Yin, Zhou Zhao:
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes. CoRR abs/2410.06734 (2024) - 2023
- [c7]Rongjie Huang, Yi Ren, Ziyue Jiang, Chenye Cui, Jinglin Liu, Zhou Zhao:
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis. ACL (Findings) 2023: 6994-7009 - [c6]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ACL (1) 2023: 9317-9331 - [c5]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. ACL (Findings) 2023: 11655-11671 - [c4]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. ICLR 2023 - [i12]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. CoRR abs/2301.13430 (2023) - [i11]Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023) - [i10]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023) - [i9]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. CoRR abs/2305.13612 (2023) - [i8]Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Unified Voice Synthesis With Discrete Representation. CoRR abs/2305.19269 (2023) - [i7]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis. CoRR abs/2306.03504 (2023) - [i6]Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. CoRR abs/2306.03509 (2023) - [i5]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. CoRR abs/2307.07218 (2023) - [i4]Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models. CoRR abs/2308.14430 (2023) - [i3]Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li:
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency. CoRR abs/2309.11725 (2023) - 2022
- [c3]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. NeurIPS 2022 - [i2]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. CoRR abs/2206.02147 (2022) - 2021
- [c2]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. IJCAI 2021: 3829-3835 - [i1]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. CoRR abs/2110.07216 (2021) - 2020
- [c1]Ziyue Jiang, Hongcheng Zhu, Li Peng, Wenbing Ding, Yanzhen Ren:
Self-Supervised Spoofing Audio Detection Scheme. INTERSPEECH 2020: 4223-4227
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-28 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint