[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2134254.2134276acmotherconferencesArticle/Chapter ViewAbstractPublication PagesisecConference Proceedingsconference-collections
research-article

Automatic categorization of bug reports using latent Dirichlet allocation

Published: 22 February 2012 Publication History

Abstract

Software developers, particularly in open-source projects, rely on bug repositories to organize their work. On a bug report, the component field is used to indicate to which team of developers a bug should be routed. Researchers have shown that incorrect categorization of newly received bug reports to components can cause potential delays in the resolution of bug reports. Approaches have been developed that consider the use of machine learning approaches, specifically Support Vector Machines (svm), to automatically categorize bug reports into the appropriate component to help streamline the process of solving a bug. One drawback of an SVM-based approach is that the results of categorization can be uneven across various components in the system if some components receive less reports than others. In this paper, we consider broadening the consistency of the recommendations produced by an automatic approach by investigating three approaches to automating bug report categorization: an approach similar to previous ones based on an SVM classifier and Term Frequency Inverse Document Frequency(svm-tf-idf), an approach using Latent Dirichlet Allocation (LDA) with SVM (svm-lda) and an approach using LDA and Kullback Leibler divergence (lda-kl). We found that lda-kl produced recalls similar to those found previously but with better consistency across all components for which bugs must be categorized.

References

[1]
J. Anvik. Assisting bug report triage through recommendation. PhD thesis, University of British Columbia, 2007.
[2]
J. Anvik, L. Hiew, and G. C. Murphy. Who should fix this bug? In Proc. of the 28th Int'l Conf. on Soft. Eng., pages 361--370. ACM, 2006.
[3]
J. Anvik and G. C. Murphy. Reducing the effort of bug report triage: Recommenders for development-oriented decisions. ACM Trans. on Soft. Eng. and Methodology, 20(3), 2011.
[4]
D.M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[5]
G. Canfora and L. Cerulo. Supporting change request assignment in open source development. In Proc. of the 2006 ACM Symp. on Applied Computing (SAC), pages 1767--1772, 2006.
[6]
B. Carpenter and B. Baldwin. Text Analysis with LingPipe 4. Lingpipe publishing, 2010.
[7]
D. Cubranic and G. C. Murphy. Automatic bug triage using text categorization. In Proc. of the 16th Int'l Conf. on Soft. Eng. & Knowledge Eng. (SEKE), pages 92--97, 2004.
[8]
M. Di Penta G. A. Di Lucca and S. Gradara. An approach to classify software maintenance requests. In Proc. of the Int'l Conf. on Soft. Maint. (ICSM'02), pages 93--102. IEEE Computer Society, 2002.
[9]
P.J. Guo, T. Zimmermann, N. Nagappan, and B. Murphy. "Not my bug!" and other reasons for software bug report reassignments. In Proc. of the ACM 2011 Conf. on Computer Supported Cooperative Work, CSCW '11, pages 395--404. ACM, 2011.
[10]
C.E. Veni Madhavan R. Arun, V. Suresh and M. Narasimha Murty. On finding the natural number of topics with latent dirichlet allocation: Some observations. Advances in Knowledge Discovery and Data Mining, pages 291--402, 2010.
[11]
T.L.Griffiths and M. Steyvers. Finding scientific topics. In Proc. of the National Academy of Sciences, 101 (suppl. 1), pages 5228--5235, 2004.

Cited By

View all
  • (2023)Predicting the Change Impact of Resolving Defects by Leveraging the Topics of Issue Reports in Open Source Software SystemsACM Transactions on Software Engineering and Methodology10.1145/359380232:6(1-34)Online publication date: 30-Sep-2023
  • (2023)Still Confusing for Bug-Component Triaging? Deep Feature Learning and Ensemble Setting to Rescue2023 IEEE/ACM 31st International Conference on Program Comprehension (ICPC)10.1109/ICPC58990.2023.00046(316-327)Online publication date: May-2023
  • (2023)A multi-model framework for semantically enhancing detection of quality-related bug report descriptionsEmpirical Software Engineering10.1007/s10664-022-10280-w28:2Online publication date: 11-Feb-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ISEC '12: Proceedings of the 5th India Software Engineering Conference
February 2012
174 pages
ISBN:9781450311427
DOI:10.1145/2134254
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IITK: Indian Institute of Technology Kanpur

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. component recommendation
  2. recommendation system
  3. software bug triage

Qualifiers

  • Research-article

Conference

ISEC '12
Sponsor:
  • IITK
ISEC '12: India Software Engineering Conference 2012
February 22 - 25, 2012
Kanpur, India

Acceptance Rates

ISEC '12 Paper Acceptance Rate 26 of 107 submissions, 24%;
Overall Acceptance Rate 76 of 315 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Predicting the Change Impact of Resolving Defects by Leveraging the Topics of Issue Reports in Open Source Software SystemsACM Transactions on Software Engineering and Methodology10.1145/359380232:6(1-34)Online publication date: 30-Sep-2023
  • (2023)Still Confusing for Bug-Component Triaging? Deep Feature Learning and Ensemble Setting to Rescue2023 IEEE/ACM 31st International Conference on Program Comprehension (ICPC)10.1109/ICPC58990.2023.00046(316-327)Online publication date: May-2023
  • (2023)A multi-model framework for semantically enhancing detection of quality-related bug report descriptionsEmpirical Software Engineering10.1007/s10664-022-10280-w28:2Online publication date: 11-Feb-2023
  • (2022)Text Mining Approach to Improve Mobile Role Playing Games Using Users’ ReviewsApplied Sciences10.3390/app1212624312:12(6243)Online publication date: 19-Jun-2022
  • (2022)Automatic Bug Triaging via Deep Reinforcement LearningApplied Sciences10.3390/app1207356512:7(3565)Online publication date: 31-Mar-2022
  • (2022)Automatic Bug Triaging Analysis using Machine Learning Techniques: A Review2022 3rd International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)10.1109/ICICT55121.2022.10064589(1-8)Online publication date: 11-Nov-2022
  • (2022)SPANKnowledge-Based Systems10.1016/j.knosys.2021.107711236:COnline publication date: 25-Jan-2022
  • (2022)An empirical study of COVID-19 related posts on Stack OverflowJournal of Systems and Software10.1016/j.jss.2021.111089182:COnline publication date: 22-Apr-2022
  • (2022)Improving software maintenance with improved bug triagingJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2021.10.01134:10(8757-8764)Online publication date: Nov-2022
  • (2022)Taxonomy of bug tracking process smellsInformation and Software Technology10.1016/j.infsof.2022.106972150:COnline publication date: 4-Aug-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media