Abstract
As for ontology construction from Chinese text, the non-hierarchical relation extraction is harder than the concept extraction and its extraction effect is still not satisfactory. In this paper, we put forward a scalable corpus model, which uses Tongyici Cilin and word2vec to calculate terms’ similarity and add the qualified candidate terms to the corpora. In this way we can expand the scalable corpus while extracting non-hierarchical relations. In turn, the scalable corpus that has been expanded with the new terms will facilitate the non-hierarchical relation extraction further. We carry out the experiment with Chinese texts in the domain of Computer, whose results show that with expansion of the corpus, the extraction effect will be better and better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
In the paper, the concept refers to some concept or the instance of some concept.
- 2.
References
Maedche, A., Staab, S.: Ontology learning for the semantic web. IEEE Intell. Syst. 16(2), 72–79 (2001)
Jia, X., Wen, D.: A survey of ontology learning from text. Comput. Sci. 34(2), 181–185 (2007)
Navigli, R., Velardi, P.: Learning domain ontologies from document warehouses and dedicated web sites. Comput. Linguist. 30(2), 151–179 (2004)
He, H., Shanhong, Z., et al.: Research on domain ontology the concept extraction based on association rule and semantic rules. J. Jilin Univ. (Info. Sci. Edt.) 32(06), 657–663 (2014)
Hearst, M.A.: Automatic acquisition of hyponyms on large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, pp. 539–545, Nantes, France (1992)
Buitelaar, P., Daniel, O., et al.: A Protege plug-in for ontology extraction from text based on linguistic analysis. In: Proceedings of the 1st European Semantic Web Symposium (2004)
Gu, J., Yan, M., et al.: Research on ontology relation acquisition based on improved association rule. Info. Stud. Theo. Appl. 34(12), 121–125 (2011)
Yu, F., Cheng, H., et al.: Non-hierarchical relations extraction of chinese texts based on grammar rules and improved association rules. Lib. Info. Ser. 57(22), 126–131 (2013)
Zhang, Y., Yang, F., et al.: Study on context based domain ontology the concept extraction and the relation extraction. Appl. Res. Comput. 27(1), 74–76 (2010)
Tian, J., Zhao, W.: The method of word similarity calculation based on synonym word lin. J. Jilin. Univ. 28(6), 602–608 (2010)
Agrawal, R., Ramakrishnan, S.: Fast algorithms for mining association rule in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499. VLDB (1994)
Zhang, Y., Yang, F.: Study on context based domain ontology the concept extraction and the relation extraction. Appl. Res. Comput. 27(1), 74–76 (2010)
Mikolov, T., Chen et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Acknowledgments
Hai Wan’s research was in part supported by the National Natural Science Foundation of China under grant 61573386, Natural Science Foundation of Guangdong Province under grant 2016A030313292, Guangdong Province Science and Technology Plan projects under grant 2016B030305007, and Sun Yat-sen University Young Teachers Cultivation Project under grant 16lgpy40.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Su, X., Wan, H., Chen, R., Liu, Q., Zhang, W., Du, J. (2016). Non-hierarchical Relation Extraction of Chinese Text Based on Scalable Corpus. In: Li, YF., et al. Semantic Technology. JIST 2016. Lecture Notes in Computer Science(), vol 10055. Springer, Cham. https://doi.org/10.1007/978-3-319-50112-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-50112-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50111-6
Online ISBN: 978-3-319-50112-3
eBook Packages: Computer ScienceComputer Science (R0)