default search action
He Bai 0002
Person information
- affiliation: University of Waterloo, School of Computer Science, Canada
- affiliation (former): Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China
Other persons with the same name
- He Bai 0001 — Oklahoma State University, School of Mechanical and Aerospace Engineering, Stillwater, OK, USA (and 3 more)
- He Bai 0003 — City Cloud Technology (Hangzhou) Co., Ltd., China (and 1 more)
- He Bai 0004 — Dalian University of Technology, School of Management, China
- He Bai 0005 — University of Science and Technology Beijing, China
- He Bai 0007 — Beijing University of Posts and Telecommunications, School of Information and Communication Engineering, China
- He Bai 0008 — Shanghai Normal University, China
- He Bai 0009 — National University of Defense Technology, College of Computer Science and Technology, Changsha, China
- He Bai 0010 — Xidian University, School of Electronic Engineering, Xi'an, China
- He Bai 0011 — Peking University, School of Electronic and Computer Engineering, Shenzhen, China
- He Bai 0012 — China Academy of Space Technology, National Key Laboratory of Science and Technology on Space Microwave, Xi'an, China
- He Bai 0013 — Carnegie Mellon University, Pittsburgh, PA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c12]Pratyush Maini, Skyler Seto, Richard He Bai, David Grangier, Yizhe Zhang, Navdeep Jaitly:
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling. ACL (1) 2024: 14044-14072 - [c11]Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theodoros Rekatsinas, Benjamin Han, Yunyao Li, Jeffrey Pound, Joshua M. Susskind, Natalie Schluter, Ihab F. Ilyas, Navdeep Jaitly:
Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation. LREC/COLING 2024: 3782-3803 - [c10]Zhuofeng Wu, Richard He Bai, Aonan Zhang, Jiatao Gu, V. G. Vinod Vydiswaran, Navdeep Jaitly, Yizhe Zhang:
Divide-or-Conquer? Which Part Should You Distill Your LLM? EMNLP (Findings) 2024: 2572-2585 - [i17]Pratyush Maini, Skyler Seto, He Bai, David Grangier, Yizhe Zhang, Navdeep Jaitly:
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling. CoRR abs/2401.16380 (2024) - [i16]Zhuofeng Wu, He Bai, Aonan Zhang, Jiatao Gu, V. G. Vinod Vydiswaran, Navdeep Jaitly, Yizhe Zhang:
Divide-or-Conquer? Which Part Should You Distill Your LLM? CoRR abs/2402.15000 (2024) - [i15]Yizhe Zhang, He Bai, Ruixiang Zhang, Jiatao Gu, Shuangfei Zhai, Josh M. Susskind, Navdeep Jaitly:
How Far Are We from Intelligent Visual Deductive Reasoning? CoRR abs/2403.04732 (2024) - [i14]Zijin Gu, Tatiana Likhomanenko, He Bai, Erik McDermott, Ronan Collobert, Navdeep Jaitly:
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition. CoRR abs/2405.15216 (2024) - [i13]He Bai, Tatiana Likhomanenko, Ruixiang Zhang, Zijin Gu, Zakaria Aldeneh, Navdeep Jaitly:
dMel: Speech Tokenization made Simple. CoRR abs/2407.15835 (2024) - 2023
- [i12]Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theodoros Rekatsinas, Benjamin Han, Yunyao Li, Jeffrey Pound, Josh M. Susskind, Natalie Schluter, Ihab F. Ilyas, Navdeep Jaitly:
Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation. CoRR abs/2309.11669 (2023) - [i11]Shangshang Zheng, He Bai, Yizhe Zhang, Yi Su, Xiaochuan Niu, Navdeep Jaitly:
KGLens: A Parameterized Knowledge Graph Solution to Assess What an LLM Does and Doesn't Know. CoRR abs/2312.11539 (2023) - 2022
- [c9]He Bai, Tong Wang, Alessandro Sordoni, Peng Shi:
Better Language Model with Hypernym Class Prediction. ACL (1) 2022: 1352-1362 - [c8]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. EMNLP (Findings) 2022: 5248-5259 - [c7]Peng Shi, Linfeng Song, Lifeng Jin, Haitao Mi, He Bai, Jimmy Lin, Dong Yu:
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup. EMNLP (Findings) 2022: 5296-5306 - [c6]He Bai, Renjie Zheng, Junkun Chen, Mingbo Ma, Xintong Li, Liang Huang:
A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing. ICML 2022: 1399-1411 - [i10]He Bai, Renjie Zheng, Junkun Chen, Xintong Li, Mingbo Ma, Liang Huang:
A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing. CoRR abs/2203.09690 (2022) - [i9]He Bai, Tong Wang, Alessandro Sordoni, Peng Shi:
Better Language Model with Hypernym Class Prediction. CoRR abs/2203.10692 (2022) - [i8]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. CoRR abs/2210.13693 (2022) - [i7]Xiaoran Fan, Chao Pang, Tian Yuan, He Bai, Renjie Zheng, Pengfei Zhu, Shuohuan Wang, Junkun Chen, Zeyu Chen, Liang Huang, Yu Sun, Hua Wu:
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech. CoRR abs/2211.03545 (2022) - 2021
- [c5]He Bai, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li:
Segatron: Segment-Aware Transformer for Language Modeling and Understanding. AAAI 2021: 12526-12534 - [c4]He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li:
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2. ACL (student) 2021: 148-162 - [i6]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
Cross-Lingual Training with Dense Retrieval for Document Retrieval. CoRR abs/2109.01628 (2021) - 2020
- [c3]Peng Shi, He Bai, Jimmy Lin:
Cross-Lingual Training of Neural Models for Document Ranking. EMNLP (Findings) 2020: 2768-2773 - [i5]He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li:
Semantics of the Unwritten. CoRR abs/2004.02251 (2020) - [i4]He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Ming Li:
SegaBERT: Pre-training of Segment-aware BERT for Language Understanding. CoRR abs/2004.14996 (2020) - [i3]Minghan Li, He Bai, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin:
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures. CoRR abs/2010.11351 (2020)
2010 – 2019
- 2019
- [c2]He Bai, Yu Zhou, Jiajun Zhang, Chengqing Zong:
Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference. ACL (1) 2019: 5448-5453 - [i2]He Bai, Yu Zhou, Jiajun Zhang, Chengqing Zong:
Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference. CoRR abs/1906.01788 (2019) - 2018
- [c1]He Bai, Yu Zhou, Jiajun Zhang, Liang Zhao, Mei-Yuh Hwang, Chengqing Zong:
Source Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language. COLING 2018: 3597-3607 - [i1]He Bai, Yu Zhou, Jiajun Zhang, Liang Zhao, Mei-Yuh Hwang, Chengqing Zong:
Source-Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language. CoRR abs/1808.06167 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint