[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Non-hierarchical Relation Extraction of Chinese Text Based on Scalable Corpus

  • Conference paper
  • First Online:
Semantic Technology (JIST 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10055))

Included in the following conference series:

  • 822 Accesses

Abstract

As for ontology construction from Chinese text, the non-hierarchical relation extraction is harder than the concept extraction and its extraction effect is still not satisfactory. In this paper, we put forward a scalable corpus model, which uses Tongyici Cilin and word2vec to calculate terms’ similarity and add the qualified candidate terms to the corpora. In this way we can expand the scalable corpus while extracting non-hierarchical relations. In turn, the scalable corpus that has been expanded with the new terms will facilitate the non-hierarchical relation extraction further. We carry out the experiment with Chinese texts in the domain of Computer, whose results show that with expansion of the corpus, the extraction effect will be better and better.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 35.99
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 44.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    In the paper, the concept refers to some concept or the instance of some concept.

  2. 2.

    https://github.com/fxsjy/jieba.

References

  1. Maedche, A., Staab, S.: Ontology learning for the semantic web. IEEE Intell. Syst. 16(2), 72–79 (2001)

    Article  Google Scholar 

  2. Jia, X., Wen, D.: A survey of ontology learning from text. Comput. Sci. 34(2), 181–185 (2007)

    MathSciNet  Google Scholar 

  3. Navigli, R., Velardi, P.: Learning domain ontologies from document warehouses and dedicated web sites. Comput. Linguist. 30(2), 151–179 (2004)

    Article  MATH  Google Scholar 

  4. He, H., Shanhong, Z., et al.: Research on domain ontology the concept extraction based on association rule and semantic rules. J. Jilin Univ. (Info. Sci. Edt.) 32(06), 657–663 (2014)

    Google Scholar 

  5. Hearst, M.A.: Automatic acquisition of hyponyms on large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, pp. 539–545, Nantes, France (1992)

    Google Scholar 

  6. Buitelaar, P., Daniel, O., et al.: A Protege plug-in for ontology extraction from text based on linguistic analysis. In: Proceedings of the 1st European Semantic Web Symposium (2004)

    Google Scholar 

  7. Gu, J., Yan, M., et al.: Research on ontology relation acquisition based on improved association rule. Info. Stud. Theo. Appl. 34(12), 121–125 (2011)

    Google Scholar 

  8. Yu, F., Cheng, H., et al.: Non-hierarchical relations extraction of chinese texts based on grammar rules and improved association rules. Lib. Info. Ser. 57(22), 126–131 (2013)

    Google Scholar 

  9. Zhang, Y., Yang, F., et al.: Study on context based domain ontology the concept extraction and the relation extraction. Appl. Res. Comput. 27(1), 74–76 (2010)

    Google Scholar 

  10. Tian, J., Zhao, W.: The method of word similarity calculation based on synonym word lin. J. Jilin. Univ. 28(6), 602–608 (2010)

    Google Scholar 

  11. Agrawal, R., Ramakrishnan, S.: Fast algorithms for mining association rule in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499. VLDB (1994)

    Google Scholar 

  12. Zhang, Y., Yang, F.: Study on context based domain ontology the concept extraction and the relation extraction. Appl. Res. Comput. 27(1), 74–76 (2010)

    Google Scholar 

  13. Mikolov, T., Chen et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

Download references

Acknowledgments

Hai Wan’s research was in part supported by the National Natural Science Foundation of China under grant 61573386, Natural Science Foundation of Guangdong Province under grant 2016A030313292, Guangdong Province Science and Technology Plan projects under grant 2016B030305007, and Sun Yat-sen University Young Teachers Cultivation Project under grant 16lgpy40.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hai Wan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Su, X., Wan, H., Chen, R., Liu, Q., Zhang, W., Du, J. (2016). Non-hierarchical Relation Extraction of Chinese Text Based on Scalable Corpus. In: Li, YF., et al. Semantic Technology. JIST 2016. Lecture Notes in Computer Science(), vol 10055. Springer, Cham. https://doi.org/10.1007/978-3-319-50112-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50112-3_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50111-6

  • Online ISBN: 978-3-319-50112-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics