[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3056662.3056683acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Comparative study on book similarity measurement method based on characters in the body of a book written in Korean

Published: 26 February 2017 Publication History

Abstract

This paper presents a comparative study on similarity measurement method based on characters in the body of a book written in Korean. We deduce using ratio of characters in the body of a book for measuring similarity and compare results of book similarity using three different similarity measurement methods through the experiment. And we measure similarity using SMC (Simple Matching Coefficient), Jaccard coefficient and cosine similarity based on extracted characters in six books which are history category. The proposed method can measure book similarity more detail than existing research and result of measurement using cosine similarity is more accurate than other two methods because cosine similarity considers not only presence of characters also frequency of characters in the body of a book. If measuring book similarity based on characters in the body of books, we expect quick and detail performance at books which are categorized history, biography, etc.

References

[1]
Raymond J. Mooney and R. Loriene. 1999. Content-based book recommending using learning for text categorization. In Proceeding of the SIGIR-99 workshop on recommender system: Algorithm and Evaluation (Berkeley, CA, August 1999).
[2]
Orphee D. Clercq, Michael S, and Simone P. Ponzetto, 2014. Veronique Hoste. Exploiting framenet for content-based book recommendation. CBRecSys 2014 (October 6, 2014, Silicon Valley, CA, USA).
[3]
Basilico, Justin, and Thomas Hofmann. 2004. Unifying collaborative and content-based filtering. Proceeding of the twenty-first international conference on Machine Learning.
[4]
M. J. Pazzani and D. Billsus. 2007. Content-based recommendation systems. The adaptive web, volume 4321 of the series lecture notes in computer science, 325--341.
[5]
Donald A. Jackson, Keith M. Somers, and Harold H. Harvey. 1989. Similarity coefficients: Measures of co-occurrence and association or simply measures of occurrence. The University of Chicago Press for the American Society of Naturalists. Vol. 133, No. 3 (Mar., 1989). 436--453.
[6]
Anna Huang. 2008. Similarity measures for test document clustering. NZCSRSC 2008 (April 2008, Christchurch, New Zealand)
[7]
Gun-Hee Choi, Hee-Jeong Ahn, Jin-Soo Park, Seung-Hoon Kim. 2014. A Study on Extraction of Keywords in the Body of a Book. Korea Intelligent Information Systems Society 2014 Fall Conference. Nov. 2014, 191--193.
[8]
Hee-Jeong Ahn, Gun-Hee Choi, Seung-Hoon Kim. 2015. Thematic Word Extraction from Book based on Keyword Weighting Method. Korea Computer Information Society 2015 Winter Conference. Vol. 23, No. 1, Jan. 2015. 19--22.
[9]
Kyung-hee Lee, Ju-ho Lee, Myung-seok Choi, and Gil-chang Kim. 2000. Study on named entity recognition in Korean text. 12th Korean Language Information Conference 2000. 292--299.
[10]
Kyoung-man Bae, Sung-hyun Kim, Young-joong Ko, and Jong-hoon Kim. 2014. An efficient named entity and topic word recognition method based on named entity pattern in a natural language interface. Journal of Korea Information Technology, Vol. 12, No. 1. 121--129.
[11]
Seo-Hee Kim, Tae-Keun Park, and Seung-Hoon Kim. 2016. A Recognition Method for Main Characters Name in Korean Novels. Journal of the Korea Institute of Information & Electronic Communication Technology (JKIIECT). Vol. 9, No. 1. 75--81.

Index Terms

  1. Comparative study on book similarity measurement method based on characters in the body of a book written in Korean

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICSCA '17: Proceedings of the 6th International Conference on Software and Computer Applications
      February 2017
      339 pages
      ISBN:9781450348577
      DOI:10.1145/3056662
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 26 February 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. book recommendation
      2. character identification
      3. cosine similarity
      4. jaccard coefficient
      5. similarity measurement
      6. simple matching coefficient

      Qualifiers

      • Research-article

      Conference

      ICSCA 2017

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 63
        Total Downloads
      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 06 Jan 2025

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media