Transfer Deep Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network

Jingjing Xu^18,19,
Shuming Ma^18,19,
Yi Zhang^18,19,
Bingzhen Wei^18,19,
Xiaoyan Cai²⁰ &
…
Xu Sun^18,19

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10619))

Included in the following conference series:

National CCF Conference on Natural Language Processing and Chinese Computing

Abstract

Recent studies have shown effectiveness in using neural networks for Chinese word segmentation. However, these models rely on large-scale data and are less effective for low-resource datasets because of insufficient training data. We propose a transfer learning method to improve low-resource word segmentation by leveraging high-resource corpora. First, we train a teacher model on high-resource corpora and then use the learned knowledge to initialize a student model. Second, a weighted data similarity method is proposed to train the student model on low-resource data. Experiment results show that our work significantly improves the performance on low-resource datasets: 2.3% and 1.5% F-score on PKU and CTB datasets. Furthermore, this paper achieves state-of-the-art results: 96.1%, and 96.2% F-score on PKU and CTB datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Neural Chinese Word Segmentation with Dictionary Knowledge

Addressing Domain Adaptation for Chinese Word Segmentation with Instances-Based Transfer Learning

Construction and Evaluation of Chinese Word Segmentation Datasets in Malay Archipelago

References

Cai, D., Zhao, H.: Neural word segmentation learning for Chinese. In: Meeting of the Association for Computational Linguistics (2016)
Google Scholar
Chen, X., Qiu, X., Zhu, C., Huang, X.: Gated recursive neural network for Chinese word segmentation. In: ACL (1), pp. 1744–1753. The Association for Computer Linguistics (2015)
Google Scholar
Chen, X., Qiu, X., Zhu, C., Liu, P., Huang, X.: Long short-term memory neural networks for Chinese word segmentation. In: EMNLP, pp. 1197–1206. The Association for Computational Linguistics (2015)
Google Scholar
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
MATH Google Scholar
Emerson, T.: The second international Chinese word segmentation bakeoff. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, pp. 123–133 (2005)
Google Scholar
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014)
Google Scholar
Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the Eighteenth International Conference on Machine Learning, Number 8 in ICML 2001, pp. 282–289 (2001)
Google Scholar
Liu, Y., Zhang, Y., Che, W., Liu, T., Wu, F.: Domain adaptation for CRF-based Chinese word segmentation using free annotations. In: Moschitti, A., Pang, B., Daelemans, W. (eds.) EMNLP, pp. 864–874. ACL (2014)
Google Scholar
Ma, J., Hinrichs, E.W.: Accurate linear-time Chinese word segmentation via embedding matching. In: ACL (1), pp. 1733–1743 (2015)
Google Scholar
Pei, W., Ge, T., Chang, B.: Max-margin tensor neural network for Chinese word segmentation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland, Long Papers, vol. 1, pp. 293–303. Association for Computational Linguistics (2014)
Google Scholar
Peng, F., Feng, F., McCallum, A.: Chinese segmentation and new word detection using conditional random fields. In: Proceedings of the 20th International Conference on Computational Linguistics, Stroudsburg, PA, USA, COLING 2004. Association for Computational Linguistics (2004)
Google Scholar
Recht, B., Ré, C., Wright, S.J., Niu, F.: HOGWILD: a lock-free approach to parallelizing stochastic gradient descent. In: NIPS, pp. 693–701 (2011)
Google Scholar
Sun, W., Xu, J.: Enhancing Chinese word segmentation using unlabeled data. In: Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, 27–31 July 2011, John Mcintyre Conference Centre, Edinburgh, UK, A Meeting of SIGDAT, A Special Interest Group of the ACL, pp. 970–979 (2011)
Google Scholar
Sun, X.: Structure regularization for structured prediction. In: Advances in Neural Information Processing Systems 27, pp. 2402–2410 (2014)
Google Scholar
Sun, X.: Asynchronous parallel learning for neural networks and structured models with dense features. In: COLING (2016)
Google Scholar
Sun, X., Li, W., Wang, H., Qin, L.: Feature-frequency-adaptive on-line training for fast and accurate natural language processing. Comput. Linguist. 40(3), 563–586 (2014)
Article MathSciNet Google Scholar
Sun, X., Wang, H., Li, W.: Fast online training with frequency-adaptive learning rates for Chinese word segmentation and new word detection. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju Island, Korea, Long Papers, vol. 1, pp. 253–262. Association for Computational Linguistics (2012)
Google Scholar
Tseng, H.: A conditional random field word segmenter. In: Fourth SIGHAN Workshop on Chinese Language Processing (2005)
Google Scholar
Xue, N., Shen, L.: Chinese word segmentation as LMR tagging. In: Proceedings of the 2nd SIGHAN Workshop on Chinese Language Processing (2003)
Google Scholar
Zhang, M., Zhang, Y., Che, W., Liu, T.: Type-supervised domain adaptation for joint segmentation and POS-tagging. In: EACL, pp. 588–597 (2014)
Google Scholar
Zhang, M., Zhang, Y., Fu, G.: Transition-based neural word segmentation. In: Meeting of the Association for Computational Linguistics, pp. 421–431 (2016)
Google Scholar
Zhang, R., Kikui, G., Sumita, E.: Subword-based tagging by conditional random fields for Chinese word segmentation. In: Proceedings of the Human Language Technology Conference of the NAACL, Stroudsburg, PA, USA, NAACL-Short 2006, Companion Volume, Short Papers, pp. 193–196. Association for Computational Linguistics (2006)
Google Scholar
Zhang, Y., Clark, S.: Chinese segmentation with a word-based perceptron algorithm. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic, pp. 840–847. Association for Computational Linguistics (2007)
Google Scholar
Zhao, H., Huang, C., Li, M., Lu, B.-L.: A unified character-based tagging framework for Chinese word segmentation. ACM Trans. Asian Lang. Inf. Process. 9(2), 5 (2010)
Article Google Scholar
Zhao, K., Huang, L.: Minibatch and parallelization for online large margin structured learning. In: HLT-NAACL, pp. 370–379. The Association for Computational Linguistics (2013)
Google Scholar
Zheng, X., Chen, H., Xu, T.: Deep learning for Chinese word segmentation and POS tagging. In: EMNLP, pp. 647–657. ACL (2013)
Google Scholar

Download references

Acknowledgments

We thank the anonymous reviewers for their valuable comments. This work was supported in part by National High Technology Research and Development Program of China (863 Program, No. 2015AA015404), National Natural Science Foundation of China (No. 61673028).

Author information

Authors and Affiliations

MOE Key Laboratory of Computational Linguistics, Peking University, Beijing, China
Jingjing Xu, Shuming Ma, Yi Zhang, Bingzhen Wei & Xu Sun
School of Electronics Engineering and Computer Science, Peking University, Beijing, China
Jingjing Xu, Shuming Ma, Yi Zhang, Bingzhen Wei & Xu Sun
School of Automation, Northwestern Polytechnical University, Xi’an, China
Xiaoyan Cai

Authors

Jingjing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Shuming Ma
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bingzhen Wei
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyan Cai
View author publications
You can also search for this author in PubMed Google Scholar
Xu Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jingjing Xu .

Editor information

Editors and Affiliations

Fudan University, Shanghai, China
Xuanjing Huang
Singapore Management University, Singapore, Singapore
Jing Jiang
Peking University, Beijing, China
Dongyan Zhao
Peking University, Beijing, China
Yansong Feng
Soochow University, Suzhou, China
Yu Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, J., Ma, S., Zhang, Y., Wei, B., Cai, X., Sun, X. (2018). Transfer Deep Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_62

Download citation

DOI: https://doi.org/10.1007/978-3-319-73618-1_62
Published: 05 January 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Transfer Deep Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Neural Chinese Word Segmentation with Dictionary Knowledge

Addressing Domain Adaptation for Chinese Word Segmentation with Instances-Based Transfer Learning

Construction and Evaluation of Chinese Word Segmentation Datasets in Malay Archipelago

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Transfer Deep Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Neural Chinese Word Segmentation with Dictionary Knowledge

Addressing Domain Adaptation for Chinese Word Segmentation with Instances-Based Transfer Learning

Construction and Evaluation of Chinese Word Segmentation Datasets in Malay Archipelago

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation