default search action
Gaofeng Cheng
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j14]Sanli Tian, Zehan Li, Zhaobiao Lyv, Gaofeng Cheng, Qing Xiao, Ta Li, Qingwei Zhao:
Factorized and progressive knowledge distillation for CTC-based ASR models. Speech Commun. 160: 103071 (2024) - [j13]Lingxuan Ye, Changfeng Gao, Gaofeng Cheng, Liuping Luo, Qingwei Zhao:
ASQ: An Ultra-Low Bit Rate ASR-Oriented Speech Quantization Method. IEEE Signal Process. Lett. 31: 221-225 (2024) - [j12]Haitian Lu, Gaofeng Cheng, Yonghong Yan:
Conversational Short-Phrase Speaker Diarization via Self-Adjusting Speech Segmentation and Embedding Extraction. IEEE Signal Process. Lett. 31: 2340-2344 (2024) - [j11]Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan:
Boosting Cross-Domain Speech Recognition With Self-Supervision. IEEE ACM Trans. Audio Speech Lang. Process. 32: 471-485 (2024) - [j10]Yifan Chen, Gaofeng Cheng, Runyan Yang, Pengyuan Zhang, Yonghong Yan:
Interrelate Training and Clustering for Online Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1352-1364 (2024) - 2023
- [j9]Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan:
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3320-3330 (2023) - [i16]Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Speech Corpora Divergence Based Unsupervised Data Selection for ASR. CoRR abs/2302.13222 (2023) - [i15]Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan:
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition. CoRR abs/2308.06547 (2023) - 2022
- [j8]Gaofeng Cheng, Huan Chen, Pingzhi Fan, Li Li, Li Hao:
A layered grouping random access scheme based on dynamic preamble selection for massive machine type communications. Sci. China Inf. Sci. 65(7): 1-2 (2022) - [j7]Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
An E2E-ASR-Based Iteratively-Trained Timestamp Estimator. IEEE Signal Process. Lett. 29: 1654-1658 (2022) - [j6]Keqi Deng, Gaofeng Cheng, Runyan Yang, Yonghong Yan:
Alleviating ASR Long-Tailed Problem by Decoupling the Learning of Representation and Classification. IEEE ACM Trans. Audio Speech Lang. Process. 30: 340-354 (2022) - [j5]Gaofeng Cheng, Haoran Miao, Runyan Yang, Keqi Deng, Yonghong Yan:
ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1360-1373 (2022) - [j4]Changfeng Gao, Gaofeng Cheng, Ta Li, Pengyuan Zhang, Yonghong Yan:
Self-Supervised Pre-Training for Attention-Based Encoder-Decoder ASR Model. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1763-1774 (2022) - [c24]Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang:
Improving CTC-Based Speech Recognition Via Knowledge Transferring from Pre-Trained Language Models. ICASSP 2022: 8517-8521 - [c23]Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang:
Improving Non-Autoregressive End-to-End Speech Recognition with Pre-Trained Acoustic and Language Models. ICASSP 2022: 8522-8526 - [c22]Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization. INTERSPEECH 2022: 1456-1460 - [c21]Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan:
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies. INTERSPEECH 2022: 1671-1675 - [c20]Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui Jin, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan:
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset. INTERSPEECH 2022: 1736-1740 - [c19]Han Zhu, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Decoupled Federated Learning for ASR with Non-IID Data. INTERSPEECH 2022: 2628-2632 - [c18]Sanli Tian, Keqi Deng, Zehan Li, Lingxuan Ye, Gaofeng Cheng, Ta Li, Yonghong Yan:
Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning. INTERSPEECH 2022: 2633-2637 - [c17]Lingxuan Ye, Gaofeng Cheng, Runyan Yang, Zehui Yang, Sanli Tian, Pengyuan Zhang, Yonghong Yan:
Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods. INTERSPEECH 2022: 3163-3167 - [c16]Han Zhu, Li Wang, Gaofeng Cheng, Jindong Wang, Pengyuan Zhang, Yonghong Yan:
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR. INTERSPEECH 2022: 4870-4874 - [c15]Qingxuan Li, Han Zhu, Liuping Luo, Gaofeng Cheng, Pengyuan Zhang, Jiasong Sun, Yonghong Yan:
Sequence Distribution Matching for Unsupervised Domain Adaptation in ASR. ISCSLP 2022: 21-25 - [c14]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. ISCSLP 2022: 488-492 - [c13]Shuhao Deng, Chengfei Li, Jinfeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Summary On The ISCSLP 2022 Chinese-English Code-Switching ASR Challenge. ISCSLP 2022: 527-531 - [i14]Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang:
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models. CoRR abs/2201.10103 (2022) - [i13]Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang:
Improving CTC-based speech recognition via knowledge transferring from pre-trained language models. CoRR abs/2203.03582 (2022) - [i12]Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui Jin, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan:
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset. CoRR abs/2203.16844 (2022) - [i11]Han Zhu, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Decoupled Federated Learning for ASR with Non-IID Data. CoRR abs/2206.09102 (2022) - [i10]Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan:
Boosting Cross-Domain Speech Recognition with Self-Supervision. CoRR abs/2206.09783 (2022) - [i9]Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization. CoRR abs/2206.13760 (2022) - [i8]Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan:
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies. CoRR abs/2207.02495 (2022) - [i7]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022) - [i6]Shuhao Deng, Chengfei Li, Jinfeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge. CoRR abs/2210.06091 (2022) - 2021
- [j3]Runyan Yang, Gaofeng Cheng, Haoran Miao, Ta Li, Pengyuan Zhang, Yonghong Yan:
Keyword Search Using Attention-Based End-to-End ASR and Frame-Synchronous Phoneme Alignments. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3202-3215 (2021) - [c12]Yifan Guo, Yifan Chen, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Far-Field Speech Recognition Based on Complex-Valued Neural Networks and Inter-Frame Similarity Difference Method. ASRU 2021: 1003-1010 - [c11]Keqi Deng, Gaofeng Cheng, Haoran Miao, Pengyuan Zhang, Yonghong Yan:
History Utterance Embedding Transformer LM for Speech Recognition. ICASSP 2021: 5914-5918 - [c10]Changfeng Gao, Gaofeng Cheng, Runyan Yang, Han Zhu, Pengyuan Zhang, Yonghong Yan:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data. ICASSP 2021: 6543-6547 - [c9]Changfeng Gao, Gaofeng Cheng, Jun Zhou, Pengyuan Zhang, Yonghong Yan:
Non-autoregressive Deliberation-Attention based End-to-End ASR. ISCSLP 2021: 1-5 - [i5]Gaofeng Cheng, Huan Chen, Pingzhi Fan, Li Li, Li Hao:
A Layered Grouping Random Access Scheme Based on Dynamic Preamble Selection for Massive Machine Type Communications. CoRR abs/2102.12672 (2021) - [i4]Han Zhu, Li Wang, Ying Hou, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition. CoRR abs/2110.04484 (2021) - [i3]Changfeng Gao, Gaofeng Cheng, Yifan Guo, Qingwei Zhao, Pengyuan Zhang:
Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition. CoRR abs/2112.12522 (2021) - 2020
- [j2]Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1452-1465 (2020) - [c8]Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan:
Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture. ICASSP 2020: 6084-6088 - [i2]Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan:
Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture. CoRR abs/2001.08290 (2020)
2010 – 2019
- 2019
- [j1]Gaofeng Cheng, Pengyuan Zhang, Ji Xu:
Automatic Speech Recognition System with Output-Gate Projected Gated Recurrent Unit. IEICE Trans. Inf. Syst. 102-D(2): 355-363 (2019) - [c7]Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun:
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation. APSIPA 2019: 1256-1261 - [c6]Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Ta Li, Yonghong Yan:
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. INTERSPEECH 2019: 2623-2627 - [i1]Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun:
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation. CoRR abs/1912.11613 (2019) - 2018
- [c5]Gaofeng Cheng, Daniel Povey, Lu Huang, Ji Xu, Sanjeev Khudanpur, Yonghong Yan:
Output-Gate Projected Gated Recurrent Unit for Speech Recognition. INTERSPEECH 2018: 1793-1797 - [c4]Wenjie Li, Gaofeng Cheng, Fengpei Ge, Pengyuan Zhang, Yonghong Yan:
Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR. INTERSPEECH 2018: 2888-2892 - [c3]Daniel Povey, Gaofeng Cheng, Yiming Wang, Ke Li, Hainan Xu, Mahsa Yarmohammadi, Sanjeev Khudanpur:
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks. INTERSPEECH 2018: 3743-3747 - [c2]Gaofeng Cheng, Lu Huang, Jiasong Sun, Yonghong Yan:
Bidirectional LSTM with Extended Input Context. ISCSLP 2018: 364-368 - 2017
- [c1]Gaofeng Cheng, Vijayaditya Peddinti, Daniel Povey, Vimal Manohar, Sanjeev Khudanpur, Yonghong Yan:
An Exploration of Dropout with LSTMs. INTERSPEECH 2017: 1586-1590
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-04 20:04 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint