Abstract
Anaphora resolution plays an important role in Chinese micro-blog information mining. Based on the linguistic features of personal pronouns in Chinese micro-blog texts, this paper proposes a multi-strategy method for the resolution of personal pronoun anaphora. Firstly, according to part of speech tagging and named entity recognition, personal pronouns and their candidate antecedents are extracted from Chinese micro-blog texts, and the rules for judging the consistency between a personal pronoun and its antecedents in grammar, semantics, gender and singular-plural are established. The antecedents which are inconsistent with the personal pronoun in these four aspects are preliminarily filtered, and Candidate Set 1 of antecedents is obtained. Then, SVM is used to classify the antecedents in Candidate Set 1, and the antecedents which have certain anaphoric relations with the current personal pronoun are selected to construct Candidate Set 2 of antecedents. Finally, by combination of the four linguistic characteristics of grammatical role, co-occurrence relation, reference distance and appositive dependency, the best antecedent is found out from Candidate Set 2 through the priority selection policy. At the same time, a strategy of extending antecedent is provided to solve the problem that the antecedent of the pronoun can’t be found according to the above method. In this paper, the validity of the proposed method is verified by using NLPCC2013 micro-blog corpus as the experimental data set. The experimental results show that the F value of the proposed method is 91.7% in Chinese micro-blog texts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gao, J.W.: Study of key problems on Chinese anaphora resolution. Soochow University (2012). (in Chinese), https://doi.org/10.7666/d.y2120835
Fan, J.Y., Xu, Z.M.: Research on natural language understanding and modern linguistics. Lang. Res. 5, 7–22 (1983). (in Chinese)
Lee, H., Peirsman, Y., Chang A., et al.: Stanford’s multi-pass sieve coreference resolution system at the CoNLL-2011 Shared Task, Portland, Oregon, pp. 28–34 (2011)
Zhang, X., Wu, C., Zhao, H.: Chinese coreference resolution via ordered filtering. In: Joint Conference on EMNLP and CoNLL-Shared Task. Association for Computational Linguistics, Jeju, Republic of Korea, pp. 95–99 (2012)
Zhou, X.Y., Liu, J., Shao, P., et al.: Chinese anaphora resolution based on multi-pass Sieve model. J. Jilin Univ. 46(4), 1209–1215 (2016). (in Chinese). https://doi.org/10.13229/j.cnki.jdxbgxb201604029
Li, G.C., Luo, Y.F.: Chinese pronominal anaphora resolution via a preference selection approach. J. Chin. Inf. Process. 19(4), 24–30 (2005). (in Chinese), https://doi.org/10.3969/j.issn.1003-0077.2005.04.004
Dong, G.Z., Zhu, Y.Q., Cheng, X.Y.: Research on personal pronoun anaphora resolution in Chinese. Appl. Res. Comput. 28(5), 1774–1776 (2011). (in Chinese), https://doi.org/10.3969/j.issn.1001-3695.2011.05.051
Zhang, W.Y., Li, C.H., Zhong, Z.M.: Resolution of Chinese personal pronouns with combination of semantics and rules. Data Acquis. Process. 32(1), 149–156 (2017)
Xu, M., Qiu, Y.H., Wang, N.Z.: A multi-strategy solution to the problem of Chinese reference. In: Chinese Academic Conference on Machine Learning (2000). (in Chinese)
Wang, H.F., He, T.: Research on Chinese pronominal anaphora resolution. Chin. J. Comput. 24(2), 136–143 (2001). (in Chinese)
Song, W., Qin, B., Lang, J., Liu, T.: Combining syntax and word sense for Chinese pronoun resolution. J. Chin. Inf. Process. 22(06), 8–13 (2008). (in Chinese), https://doi.org/10.3969/j.issn.1003-0077.2008.06.002
Zhang, Y., Liang, L., Hou, M., et al.: The characteristics and resolution strategies of personal pronouns in topic based micro-blog. J. Hainan Univ. 32(2), 119–126 (2014). (in Chinese)
Acknowledgements
This work was supported by grants from National Nature Science Foundation of China (No. 61772081), Science and Technology Development Project of Beijing Municipal Education Commission (No. KM201711232014, No. KM201711232022).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Peng, Y., Zhang, Y., Huang, S., Chen, R., You, J. (2018). Resolution of Personal Pronoun Anaphora in Chinese Micro-blog. In: Hong, JF., Su, Q., Wu, JS. (eds) Chinese Lexical Semantics. CLSW 2018. Lecture Notes in Computer Science(), vol 11173. Springer, Cham. https://doi.org/10.1007/978-3-030-04015-4_51
Download citation
DOI: https://doi.org/10.1007/978-3-030-04015-4_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04014-7
Online ISBN: 978-3-030-04015-4
eBook Packages: Computer ScienceComputer Science (R0)