Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL).
Name | Description |
---|---|
MacBERT | Chinese pre-trained MacBERT models (MacBERT-base, MacBERT-large) |
CharBERT | English pre-trained CharBERT models |
Chinese-ELECTRA | Chinese pre-trained ELECTRA models (ELECTRA-base, ELECTRA-small) with code supports for six tasks: CMRC 2018, DRCD, XNLI, ChnSentiCorp, LCQMC, BQCorpus |
Chinese-XLNet | Chinese pre-trained XLNet models: XLNet-mid, XLNet-base |
Chinese-BERT-wwm | Chinese BERT with Whole Word Masking (wwm), including BERT-wwm, BERT-wwm-ext, RoBERTa-wwm-ext, RoBERTa-wwm-ext-large, RBT3, RBTL3 |
Name | Type | Paper |
---|---|---|
CMRC 2019 | Reading Comprehension | Cui et al., 2020 |
CJRC | Judiciary Reading Comprehension | Duan et al., 2019 |
CMRC 2018 | Reading Comprehension | Cui et al., 2019 |
CMRC 2017 | Reading Comprehension | Cui et al., 2018 |
PD&CFT | Reading Comprehension | Cui et al., 2016 |
Name | Description | Paper |
---|---|---|
TextBrewer | Knowledge Distillation for NLP | Yang et al., 2020 |
Name | Description | Paper |
---|---|---|
iFLYChecker | A Chinese Grammar Checking System | - |
IFlyLegal | A Chinese Legal System for Consultation & Law Searching | Wang et al., 2019 |
Name | Description |
---|---|
CAIL 2020 | Judiciary Reading Comprehension |
CMRC 2019 | Sentence Cloze Reading Comprehension |
CAIL 2019 | Judiciary Reading Comprehension |
CMRC 2018 | Span-Extraction Reading Comprehension |
CMRC 2017 | Cloze-style Reading Comprehension |
Paper | Authors | Venue | Note |
---|---|---|---|
A Sentence Cloze Dataset for Chinese Machine Reading Comprehension | Yiming Cui, Ting Liu, Ziqing Yang, Zhipeng Chen, Wentao Ma, Wanxiang Che, Shijin Wang, Guoping Hu | COLING 2020 | GitHub |
CharBERT: Character-aware Pre-trained Language Model | Wentao Ma, Yiming Cui, Chenglei Si, Ting Liu, Shijin Wang, Guoping Hu | COLING 2020 | GitHub |
Revisiting Pre-Trained Models for Chinese Natural Language Processing | Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Shijin Wang, Guoping Hu | Findings of EMNLP | GitHub |
Is Graph Structure Necessary for Multi-hop Question Answering? | Nan Shao, Yiming Cui, Ting Liu, Shijin Wang, Guoping Hu | EMNLP 2020 | - |
Benchmarking Robustness of Machine Reading Comprehension Models | Chenglei Si, Ziqing Yang, Yiming Cui, Wentao Ma, Ting Liu, Shijin Wang | - | - |
TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing | Ziqing Yang, Yiming Cui, Zhipeng Chen, Wanxiang Che, Ting Liu, Shijin Wang, Guoping Hu | ACL 2020 Demo | GitHub |
Conversational Word Embedding for Retrieval-based Dialog System | Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang, Guoping Hu | ACL 2020 | GitHub |
Discriminative Sentence Modeling for Story Ending Prediction | Yiming Cui, Wanxiang Che, Wei-Nan Zhang, Ting Liu, Shijin Wang, Guoping Hu | AAAI 2020 | - |
Cross-Lingual Machine Reading Comprehension | Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Shijin Wang, Guoping Hu | EMNLP 2019 | GitHub |
A Span-Extraction Dataset for Chinese Machine Reading Comprehension | Yiming Cui, Ting Liu, Wanxiang Che, Li Xiao, Zhipeng Chen, Wentao Ma, Shijin Wang, Guoping Hu | EMNLP 2019 | GitHub |
IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis | Ziyue Wang, Baoxin Wang, Xingyi Duan, Dayong Wu, Shijin Wang, Guoping Hu, Ting Liu | EMNLP 2019 Demo | - |
TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots | Wentao Ma, Yiming Cui, Nan Shao, Su He, Wei-Nan Zhang, Ting Liu, Shijin Wang, Guoping Hu | CoNLL 2019 | GitHub |
Pre-Training with Whole Word Masking for Chinese BERT | Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Ziqing Yang, Shijin Wang, Guoping Hu | - | GitHub1, GitHub2 |
Improving Machine Reading Comprehension via Adversarial Training | Ziqing Yang, Yiming Cui, Wanxiang Che, Ting Liu, Shijin Wang, Guoping Hu | - | - |
Contextual Recurrent Units for Cloze-style Reading Comprehension | Yiming Cui, Wei-Nan Zhang, Wanxiang Che, Ting Liu, Zhipeng Chen, Shijin Wang, Guoping Hu | - | - |
CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension | Xingyi Duan, Baoxin Wang, Ziyue Wang, Wentao Ma, Yiming Cui, Dayong Wu, Shijin Wang, Ting Liu, Tianxiang Huo, Zhen Hu, Heng Wang, Zhiyuan Liu | CCL 2019 | GitHub |
Convolutional Spatial Attention Model for Reading Comprehension with Multiple-Choice Questions | Zhipeng Chen, Yiming Cui, Wentao Ma, Shijin Wang, Guoping Hu | AAAI 2019 | - |
Disconnected Recurrent Neural Networks for Text Categorization | Baoxin Wang | ACL 2018 | - |
HFL-RC System at SemEval-2018 Task 11: Hybrid Multi-Aspects Model for Commonsense Reading Comprehension | Zhipeng Chen, Yiming Cui*, Wentao Ma, Shijin Wang, Ting Liu, Guoping Hu | - | - |
Dataset for the First Evaluation on Chinese Machine Reading Comprehension | Yiming Cui, Ting Liu, Zhipeng Chen, Wentao Ma, Shijin Wang, Guoping Hu | LREC 2018 | GitHub |
Chinese Grammatical Error Diagnosis using Statistical and Prior Knowledge driven Features with Probabilistic Ensemble Enhancement | Ruiji Fu, Zhengqi Pei, Jiefu Gong, Wei Song, Dechuan Teng, Wanxiang Che, Shijin Wang, Guoping Hu, Ting Liu | NLP-TEA@ACL 2018 | - |
面向作文自动评分的优美句识别 | 付瑞吉,王栋,王士进,胡国平,刘挺 | 中文信息学报 | - |
Attention-over-Attention Neural Networks for Reading Comprehension | Yiming Cui, Zhipeng Chen, Si Wei, Shijin Wang, Ting Liu, Guoping Hu | ACL 2017 | - |
Generating and Exploiting Large-scale Pseudo Training Data for Zero Pronoun Resolution | Ting Liu, Yiming Cui, Qingyu Yin, Wei-Nan Zhang, Shijin Wang, Guoping Hu | ACL 2017 | - |
Consensus Attention-based Neural Networks for Chinese Reading Comprehension | Yiming Cui, Ting Liu, Zhipeng Chen, Shijin Wang, Guoping Hu | COLING 2016 | GitHub |
LSTM Neural Reordering Feature for Statistical Machine Translation | Yiming Cui, Shijin Wang, Jianfeng Li | NAACL 2016 | - |
Follow our official WeChat account to keep updated with our latest technologies!