Non-hierarchical Relation Extraction of Chinese Text Based on Scalable Corpus

Xiaoheng Su²⁰,
Hai Wan²⁰,
Ruibin Chen²⁰,
Qi Liu²⁰,
Wenxuan Zhang²⁰ &
…
Jianfeng Du²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10055))

Included in the following conference series:

Joint International Semantic Technology Conference

829 Accesses

Abstract

As for ontology construction from Chinese text, the non-hierarchical relation extraction is harder than the concept extraction and its extraction effect is still not satisfactory. In this paper, we put forward a scalable corpus model, which uses Tongyici Cilin and word2vec to calculate terms’ similarity and add the qualified candidate terms to the corpora. In this way we can expand the scalable corpus while extracting non-hierarchical relations. In turn, the scalable corpus that has been expanded with the new terms will facilitate the non-hierarchical relation extraction further. We carry out the experiment with Chinese texts in the domain of Computer, whose results show that with expansion of the corpus, the extraction effect will be better and better.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automatic Extraction of Semantic Relations from Text Documents

The Study of Indian Domain Ontology Building Based on the Framework of HNC

A Construction Method for the Semantic Relation Corpus of Traditional Chinese Medicine

Notes

1.
In the paper, the concept refers to some concept or the instance of some concept.
2.
https://github.com/fxsjy/jieba.

References

Maedche, A., Staab, S.: Ontology learning for the semantic web. IEEE Intell. Syst. 16(2), 72–79 (2001)
Article Google Scholar
Jia, X., Wen, D.: A survey of ontology learning from text. Comput. Sci. 34(2), 181–185 (2007)
MathSciNet Google Scholar
Navigli, R., Velardi, P.: Learning domain ontologies from document warehouses and dedicated web sites. Comput. Linguist. 30(2), 151–179 (2004)
Article MATH Google Scholar
He, H., Shanhong, Z., et al.: Research on domain ontology the concept extraction based on association rule and semantic rules. J. Jilin Univ. (Info. Sci. Edt.) 32(06), 657–663 (2014)
Google Scholar
Hearst, M.A.: Automatic acquisition of hyponyms on large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, pp. 539–545, Nantes, France (1992)
Google Scholar
Buitelaar, P., Daniel, O., et al.: A Protege plug-in for ontology extraction from text based on linguistic analysis. In: Proceedings of the 1st European Semantic Web Symposium (2004)
Google Scholar
Gu, J., Yan, M., et al.: Research on ontology relation acquisition based on improved association rule. Info. Stud. Theo. Appl. 34(12), 121–125 (2011)
Google Scholar
Yu, F., Cheng, H., et al.: Non-hierarchical relations extraction of chinese texts based on grammar rules and improved association rules. Lib. Info. Ser. 57(22), 126–131 (2013)
Google Scholar
Zhang, Y., Yang, F., et al.: Study on context based domain ontology the concept extraction and the relation extraction. Appl. Res. Comput. 27(1), 74–76 (2010)
Google Scholar
Tian, J., Zhao, W.: The method of word similarity calculation based on synonym word lin. J. Jilin. Univ. 28(6), 602–608 (2010)
Google Scholar
Agrawal, R., Ramakrishnan, S.: Fast algorithms for mining association rule in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499. VLDB (1994)
Google Scholar
Zhang, Y., Yang, F.: Study on context based domain ontology the concept extraction and the relation extraction. Appl. Res. Comput. 27(1), 74–76 (2010)
Google Scholar
Mikolov, T., Chen et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

Download references

Acknowledgments

Hai Wan’s research was in part supported by the National Natural Science Foundation of China under grant 61573386, Natural Science Foundation of Guangdong Province under grant 2016A030313292, Guangdong Province Science and Technology Plan projects under grant 2016B030305007, and Sun Yat-sen University Young Teachers Cultivation Project under grant 16lgpy40.

Author information

Authors and Affiliations

School of Data and Computer Science, Sun Yat-sen University, Guangzhou, 510006, China
Xiaoheng Su, Hai Wan, Ruibin Chen, Qi Liu & Wenxuan Zhang
Guangdong University of Foreign Studies, Guangzhou, 510006, China
Jianfeng Du

Authors

Xiaoheng Su
View author publications
You can also search for this author in PubMed Google Scholar
Hai Wan
View author publications
You can also search for this author in PubMed Google Scholar
Ruibin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wenxuan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jianfeng Du
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hai Wan .

Editor information

Editors and Affiliations

Information Technology, Monash University, Melbourne, Victoria, Australia
Yuan-Fang Li
Computer Science and Technology, Nanjing University, Nanjing, China
Wei Hu
Computer Science, National University of Singapore, Singapore, Singapore
Jin Song Dong
University of Huddersfield, Huddersfield, United Kingdom
Grigoris Antoniou
Information and Communication Technology, Griffith University, Brisbane, Queensland, Australia
Zhe Wang
ISTD, Singapore University of Technology and Design, Singapore, Singapore
Jun Sun
Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
Yang Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Su, X., Wan, H., Chen, R., Liu, Q., Zhang, W., Du, J. (2016). Non-hierarchical Relation Extraction of Chinese Text Based on Scalable Corpus. In: Li, YF., et al. Semantic Technology. JIST 2016. Lecture Notes in Computer Science(), vol 10055. Springer, Cham. https://doi.org/10.1007/978-3-319-50112-3_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-50112-3_17
Published: 27 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50111-6
Online ISBN: 978-3-319-50112-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics