[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1164820.1164829acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiixConference Proceedingsconference-collections
Article

Towards genre classification for IR in the workplace

Published: 18 October 2006 Publication History

Abstract

Use of document genre in information retrieval systems has the potential to improve the task-appropriateness of results. However, genre classification remains a challenging problem. We describe a case study of genre classification in a software engineering workplace domain, which includes the development of a genre taxonomy and experiments in automatic genre classification using supervised machine learning. We present results based on evaluation using real-life enterprise data from this work domain.

References

[1]
Hawking, D.: Challenges in Enterprise Search, presented at the Australasian Database Conference, Dunedin, New Zealand (2004)
[2]
Fagin, R. Kumar, K. S. McCurley, J. Novak, D. Sivakumar, Tomlin, J. A., Williamson D. P.: Searching the Workplace Web. presented at WWW '03 International World Web Conference (2003)
[3]
Broder, A., Ciccolo, A. C.: Towards the Next Generation of Enterprise Search Technology. IBM Systems Journal 43 (2004) 451--454
[4]
Jarvelin, K., Ingwersen, P.: Information Seeking Research Needs Extensions towards Tasks and Technology. Information Research 10 (2004)
[5]
Anderson, C. J., Glassman, M, McAfee R. B., Pinelli, T.: An Investigation of Factors Afffecting how Engineers and Scientists Seek Information. Journal of Engineering and Technology Management 18 (2001) 131--155
[6]
Hertzum, M.: The Importance of Trust in Software Engineers' Assessment and Choice of Information Sources. Information and Organization 12 (2002) 1--18
[7]
Fidel, R., Green, M.: The Many Faces of Accessibility: Engineers' Perception of Information Sources. Information Processing & Management 40 (2004) 563--581
[8]
Freund, L., Toms, E. G., Clarke, C. L. A.: Modeling Task-Genre Relationships for IR in the Workplace. Annual International ACM SIGIR Conference, Salvador, Brazil (2005)
[9]
Freund, L., Toms, E. G., Waterhouse, J.: Modeling the Information Behaviour of Software Engineers using a Work -Task Framework. 68th Annual Meeting of the American Society for Information Science and Technology, Charlotte, NC (2005).
[10]
Vakkari, P.: Task-Based Information Searching. Annual Review of Information Science and Technology 37 (2003) 413--463
[11]
Bystrom, K., Hansen, P.: Conceptual Framework for Tasks in Information Studies. Journal of the American Society for Information Science and Technology 56 (2005) 1050--1061
[12]
Bystrom, K.: Information and Information Sources in Tasks of Varying Complexity. Journal of the American Society for Information Science 53 (2002) 581--591
[13]
Orlikowski, W. J., Yates, J.: Genre Repertoire: the Structuring of Communicative Practices in Organizations. Administrative Science Quarterly 39 (1994) 541--574
[14]
Toms, E. G.: Recognizing Digital Genre. Bulletin of the American Society for information Science and Technology 27 (2001) 20--22
[15]
Yoshioka, T., Herman, G., Yates, J., Orlikowski, W. J.: Genre Taxonomy: a Knowledge Repository of Communicative Actions. ACM Transactions on Information Systems 19 (2001) 431--456
[16]
Karlgren, J.: Non-Topical Factors in Information Access. Webnet '99, Honolulu, 1999.
[17]
Roussinov, D. G., Crowston, K., Nilan, M., Kwasnik, B., Cai, J., Liu, X.: Genre Based Navigation on the Web. presented at Hawai'i International Conference on Systems Sciences, Maui, Hawai'i (2001)
[18]
Muresan, G., Smith, C. L., Cole, M. Liu, L. Belkin, N. J.: Detecting Document Genre for Personalization in Information Retrieval. Proceedings of the Hawaii International Conference on System Sciences, Kauai, Hawai'I (2006)
[19]
Bretan, I., Dewe, J., Hallberg, A., Wolkert, N., Karlgren, J.: Web-Specific Genre Visualization. presented at WebNet '98, Orlando Florida (1998)
[20]
Glover, E. J., Lawrence, S., Gordon, M. D., Birmingham, W. P., Giles, C. L.: Web Search -- Your Way. Communications of the ACM 44 (2001) 97--102
[21]
Dewdney, N., VanEss-Dykema, C., MacMillan, R.: The Form is the Substance: Classification of Genres in Text. Proceedings of ACL Workshop on Human Language Technology and Knowledge Management (2001)
[22]
Meyer zu Eissen, S., Stein, B.: Genre Classification of Web Pages. Proceedings of the 27th German Conference on Artificial Intelligence, Ulm, Germany (2004)
[23]
Lee, Y.-B., Myaeng, S. H.: Text Genre Classification with Genre Revealing and Subject-Revealing Features. Proceedings of the 25th Annual International ACM SIGIR Conference (2002)
[24]
Stamatatos, E., Fakotakis, N., Kokkinakis, G.: Text Genre Detection using Common Word Frequencies. Proceedings of the 18th International Conference on Computational Linguistics (2000)
[25]
Finn, A., Kushmerick, N.: Learning to Classify Documents according to Genre. presented at IJCAI Workshop on Computational Approaches to Style Analysis and Synthesis (2003)
[26]
Shepherd, M., Watters, C., Kennedy, A.: Cybergenre: Automatic Identification of Home Pages on the Web. Journal of Web Engineering 3 (2004) 236--251
[27]
Joachims, T.: Learning to Classify Text using Support Vector Machines: Methods, Theory and Algorithms, Kluwer, Amsterdam (2002)
[28]
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. Proceedings of the European Conference on Machine Learning (1998)

Cited By

View all
  • (2024)Using genres for laboratory work, report writing and design projects: comparing second and fourth year undergraduate engineersInformation and Learning Sciences10.1108/ILS-03-2024-0031Online publication date: 17-Sep-2024
  • (2024)Mapping the relationship between genres and tasks: A study of undergraduate engineersJournal of the Association for Information Science and Technology10.1002/asi.24897Online publication date: 20-Apr-2024
  • (2023)Automatic genre identification: a surveyLanguage Resources and Evaluation10.1007/s10579-023-09695-8Online publication date: 16-Nov-2023
  • Show More Cited By

Index Terms

  1. Towards genre classification for IR in the workplace

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    IIiX: Proceedings of the 1st international conference on Information interaction in context
    October 2006
    187 pages
    ISBN:1595934820
    DOI:10.1145/1164820
    • Program Chair:
    • Ian Ruthven
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 October 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. contextual information retrieval
    2. enterprise search
    3. genre classification
    4. genre-dependent applications

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate 21 of 45 submissions, 47%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Using genres for laboratory work, report writing and design projects: comparing second and fourth year undergraduate engineersInformation and Learning Sciences10.1108/ILS-03-2024-0031Online publication date: 17-Sep-2024
    • (2024)Mapping the relationship between genres and tasks: A study of undergraduate engineersJournal of the Association for Information Science and Technology10.1002/asi.24897Online publication date: 20-Apr-2024
    • (2023)Automatic genre identification: a surveyLanguage Resources and Evaluation10.1007/s10579-023-09695-8Online publication date: 16-Nov-2023
    • (2023)Tracing Information Use Over Time: A Comparative Study of Undergraduate EngineersProceedings of the Association for Information Science and Technology10.1002/pra2.90460:1(938-940)Online publication date: 22-Oct-2023
    • (2021)Registerial Adaptation vs. Innovation Across Situational Contexts: 18th Century Women in TransitionFrontiers in Artificial Intelligence10.3389/frai.2021.6099704Online publication date: 1-Jun-2021
    • (2019)Cross-lingual genre classification using linguistic groupingsJournal of Computing Sciences in Colleges10.5555/3306465.330647934:3(91-96)Online publication date: 1-Jan-2019
    • (2019)Register in computational language researchRegister Studies10.1075/rs.18015.arg1:1(100-135)Online publication date: 26-Apr-2019
    • (2018)Argumentation MiningSynthesis Lectures on Human Language Technologies10.2200/S00883ED1V01Y201811HLT04011:2(1-191)Online publication date: 20-Dec-2018
    • (2018)Classification of Textual Genres Using Discourse InformationComputational Linguistics and Intelligent Text Processing10.1007/978-3-319-75477-2_46(636-647)Online publication date: 21-Mar-2018
    • (2017)Classifying news versus opinions in newspapers: Linguistic features for domain independenceNatural Language Engineering10.1017/S135132491700004323:05(687-707)Online publication date: 21-Feb-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media