[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/379437.379439acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

Integrating automatic genre analysis into digital libraries

Published: 01 January 2001 Publication History

Abstract

With the number and types of documents in digital library systems incr easing, tools for automatically organizing and presenting the content have to be found. While many approaches focus on topic-based organization and structuring, hardly any system incorporates automatic structural analysis and representation. Yet, genre information (unconsciously) forms one of the most distinguishing features in conventional libraries and in information searches. In this paper we present an approach to automatically analyze the structure of documents and to integrate this information into an automatically created content-based organization. In the resulting visualization, documents on similar topics, yet representing different genres, are depicted as books in differing colors. This representation supports users intuitively in locating relevant information presented in a relevant form.

References

[1]
D. Biber. Variations across Speech and Writing. Cambridge University Press, UK, 1988.]]
[2]
D. Biber. A typology of english texts. Linguistics, 27:3 - 43, 1989.]]
[3]
I. Bretan, J. Dewe, A. Hallberg, N. Wolkert, and J. Karlgren. Web-specific genre visualization. In Proc of WebNet '98, Orlando, FL, November 1998. http://www.stacken.kth.se/~dewe/.]]
[4]
H. Chen, C. Schuels, and R. Orwig. Internet categorization and search: A self-organizing approach. Journal of Visual Communication and Image Representation, 7(1):88-102, 1996. http://ai.BPA.arizona.edu/papers/.]]
[5]
H. Chernoff. The use of faces to represent points in k-dimensional space graphically. Journal American Statistical Association, 68:361-368, 1973.]]
[6]
L. Cherra and W. Vesterman. Writing tools: The STYLE and DICTION programs. Technical Report 91, Bell Laboratories, Murray Hill, NJ, 1981. Republished as part 4.4BSD User's Supplementary Documents by O'Reilly.]]
[7]
J. Himberg. A SOM based cluster visualization and its application for false coloring. In Proc Int'l Joint Conf on Neural Networks (IJCNN 2000), Como, Italy, July 24. - 27. 2000. IEEE Computer Society.]]
[8]
J. Karlgren. Stylistic experiments in information retrieval. In T. Strzalkowski, editor, Natural Language Information Retrieval. Kluwer, 1999. http://www.sics.se/~jussi/Artiklar/.]]
[9]
J. Karlgren, I. Bretan, J. Dewe, A. Hallberg, and N. Wolkert. Iterative information retrieval using fast clustering and usage-specific genres. In Proc Eighth DELOS Workshop on User Interfaces in Digital Libraries, pages 85-92, Stockholm, Sweden, October 1998. http://www.stacken.kth.se/~dewe/.]]
[10]
J. Karlgren and D. Cutting. Recognizing text genres with simple metrics using discriminant analysis. In Proc 15. Int'l Conf on Computational Linguistics (COLING '94), Kyoto, Japan, 1994. http://www.sics.se/~jussi/Artiklar/.]]
[11]
B. Kessler, G. Nunberg, and H. Schutze. Automatic detection of text genre. In Proc 8. Conf Europ. Chapter of the Association for Computational Linguistics (ACL/EACL97), pages 32-38, Madrid, Spain, 1997. http://spell.psychology.wayne.edu/~bkessler/.]]
[12]
T. Kohonen. Self-organized formation of topologically correct feature maps. Biological Cybernetics, 43, 1982.]]
[13]
T. Kohonen. Self-organizing maps. Springer-Verlag, Berlin, 1995.]]
[14]
T. Kohonen, S. Kaski, K. Lagus, J. Salojarvi, J. Honkela, V. Paatero, and A. Saarela. Self-organization of a massive document collection. IEEE Transactions on Neural Networks, 11(3):574-585, May 2000. http://ieeexplore.ieee.org/.]]
[15]
D. Merkl and A. Rauber. Document classification with unsupervised neural networks. In F. Crestani and G. Pasi, editors, Soft Computing in Information Retrieval, pages 102-121. Physica Verlag, 2000. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]
[16]
A. Rauber. LabelSOM: On the labeling of self-organizing maps. In Proc Int'l Joint Conf on Neural Networks (IJCNN'99), Washington, DC, July 10 - 16. 1999. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]
[17]
A. Rauber. SOMLib: A digital library system based on neural networks. In E. Fox and N. Rowe, editors, Proc ACM Conf on Digital Libraries (ACMDL'99), pages 240-241, Berkeley, CA,August 11 - 14. 1999. ACM. http://www.acm.org/dl.]]
[18]
A. Rauber and H. Bina. Visualizing electronic document repositories: Drawing books and papers in a digital library. In Advances in Visual Database Systems: Proc IFIP TC2 Working Conf on Visual Database Systems, pages95- 114, Fukuoka, Japan, May, 10.- 12. 2000. Kluwer Academic Publishers. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]
[19]
A. Rauber and D. Merkl. The SOMLib Digital Library System. InProc 3.Europ. Conf on Research and Advanced Technology for Digital Libraries (ECDL99), LNCS 1696, pages 323-342, Paris, France, September 22. - 24. 1999. Springer. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]
[20]
A. Rauber, M. Dittenbach, and D. Merkl. Automatically detecting and organizing documents into topic hierarchies: A neural-network based approach to bookshelf creation and arrangement. In Proc 4. Europ. Conf on Research and Advanced Technologies for Digital Libraries (ECDL2000), LNCS 1923, pages 348-351, Lisboa, Portugal, September 18. - 20. 2000. Springer. http://www.ifs.tuwien.ac.at/~andi/LoP.html.]]
[21]
K. Ries. Towards the detection and description of textual meaning indicators in spontaneous conversations. In Proc Europ. Conf on Speech Communication and Technology (EUROSPEECH99), Budapest, Hungary, September 5-9 1999.]]
[22]
G. Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading, MA, 1989.]]

Cited By

View all
  • (2018)An ensemble scheme based on language function analysis and feature engineering for text genre classificationJournal of Information Science10.1177/016555151667791144:1(28-47)Online publication date: 1-Feb-2018
  • (2017)Fine-grained opinion mining of product review using sentiment and semantic orientationInternational Journal of Business Information Systems10.1504/IJBIS.2017.08327425:1(1-17)Online publication date: 1-Jan-2017
  • (2016)A research study of sentiment analysis and various techniques of sentiment classificationInternational Journal of Data Analysis Techniques and Strategies10.1504/IJDATS.2016.0774858:2(122-142)Online publication date: 1-Jan-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '01: Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
January 2001
481 pages
ISBN:1581133456
DOI:10.1145/379437
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 2001

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. SOMLib
  2. document clustering
  3. genre analysis
  4. metaphor graphics
  5. self-organizing map (SOM)
  6. visualization

Qualifiers

  • Article

Conference

JCDL01
Sponsor:

Acceptance Rates

JCDL '01 Paper Acceptance Rate 76 of 250 submissions, 30%;
Overall Acceptance Rate 415 of 1,482 submissions, 28%

Upcoming Conference

JCDL '24
The 2024 ACM/IEEE Joint Conference on Digital Libraries
December 16 - 20, 2024
Hong Kong , China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)An ensemble scheme based on language function analysis and feature engineering for text genre classificationJournal of Information Science10.1177/016555151667791144:1(28-47)Online publication date: 1-Feb-2018
  • (2017)Fine-grained opinion mining of product review using sentiment and semantic orientationInternational Journal of Business Information Systems10.1504/IJBIS.2017.08327425:1(1-17)Online publication date: 1-Jan-2017
  • (2016)A research study of sentiment analysis and various techniques of sentiment classificationInternational Journal of Data Analysis Techniques and Strategies10.1504/IJDATS.2016.0774858:2(122-142)Online publication date: 1-Jan-2016
  • (2014)Evolving Digital CommunicationTechnological Advancements and the Impact of Actor-Network Theory10.4018/978-1-4666-6126-4.ch013(222-237)Online publication date: 2014
  • (2012)Innovation in CommunicationInternational Journal of Actor-Network Theory and Technological Innovation10.4018/jantti.20120101044:1(39-51)Online publication date: 1-Jan-2012
  • (2012)Determining Dimensions of Social WebsitesProceedings of the 2012 45th Hawaii International Conference on System Sciences10.1109/HICSS.2012.205(1728-1736)Online publication date: 4-Jan-2012
  • (2012)Genre identification for office document search and browsingInternational Journal on Document Analysis and Recognition10.1007/s10032-011-0163-715:3(167-182)Online publication date: 1-Sep-2012
  • (2010)Marrying Relevance and Genre Rankings: An Exploratory StudyGenres on the Web10.1007/978-90-481-9178-9_9(191-208)Online publication date: 16-Aug-2010
  • (2010)Web Genre Analysis: Use Cases, Retrieval Models, and Implementation IssuesGenres on the Web10.1007/978-90-481-9178-9_8(167-189)Online publication date: 16-Aug-2010
  • (2010)Formulating Representative Features with Respect to Genre ClassificationGenres on the Web10.1007/978-90-481-9178-9_6(129-147)Online publication date: 16-Aug-2010
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media