[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3590837.3590918acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimmiConference Proceedingsconference-collections
research-article

A Review on Software Defect Prediction Using Machine Learning

Published: 30 May 2023 Publication History

Abstract

Software plays an important role in many of the systems and devices that make up our modern societies. In order to provide their customers with software of a higher quality in a shorter amount of time, numerous software companies are developing software systems of varying sizes for various purposes. It is too challenging to produce high-quality software in a shorter amount of time due to the constraints of software development and the growing size of software data. Therefore, prior to delivering the software product, defect prediction can significantly contribute to a project's success in terms of; cost and quality to evaluate the quality of their software. The goal of the literature review is to investigate about the current trends of software defect prediction approaches. Conclusion of the literature review introduce that many machine learning algorithms are implemented named with Random forest, Logistic regression, Naïve Bayes and Artificial neutral Network etc. with different software metrics like CK metrics, Source code metric etc. The performance measurement of the model done by various methods like accuracy, precision etc.

References

[1]
Erturk, E., & Sezer, E. A. (2015). A comparison of some soft computing methods for software fault prediction. Expert systems with applications, 42(4), 1872-1879.
[2]
Caglayan, B., Tosun Misirli, A., Bener, A. B., & Miranskyy, A. (2015). Predicting defective modules in different test phases. Software Quality Journal, 23(2), 205-227.
[3]
Couto, C., Pires, P., Valente, M. T., Bigonha, R. S., & Anquetil, N. (2014). Predicting software defects with causality tests. Journal of Systems and Software, 93, 24-41.
[4]
Malhotra, R., & Jain, A. (2012). Fault prediction using statistical and machine learning methods for improving software quality. Journal of Information Processing Systems, 8(2), 241-262.
[5]
Park, M., & Hong, E. (2014). Software fault prediction model using clustering algorithms determining the number of clusters automatically. International Journal of Software Engineering and Its Applications, 8(7), 199-204.
[6]
Dejaeger, K., Verbraken, T., & Baesens, B. (2012). Toward comprehensible software fault prediction models using bayesian network classifiers. IEEE Transactions on Software Engineering, 39(2), 237-257.
[7]
Peters, F., Menzies, T., & Marcus, A. (2013, May). Better cross company defect prediction. In 2013 10th Working Conference on Mining Software Repositories (MSR) (pp. 409-418). IEEE.
[8]
Li, M., Zhang, H., Wu, R., & Zhou, Z. H. (2012). Sample-based software defect prediction with active and semi-supervised learning. Automated Software Engineering, 19(2), 201-230.
[9]
Ma, Y., Luo, G., Zeng, X., & Chen, A. (2012). Transfer learning for cross-company software defect prediction. Information and Software Technology, 54(3), 248-256.
[10]
Chatterjee, S., Nigam, S., Singh, J. B., & Upadhyaya, L. N. (2012). Software faultprediction using Nonlinear Autoregressive with exogenous Inputs (NARX) network. Applied Intelligence, 37(1), 121-129.
[11]
Shin, Y., Meneely, A., Williams, L., & Osborne, J. A. (2010). Evaluating complexity, code churn, and developer activity metrics as indicators of software vulnerabilities. IEEE transactions on software engineering, 37(6), 772-787.
[12]
Pandey, A. K., & Goyal, N. K. (2010). Predicting fault-prone software module using data mining technique and fuzzy logic. International Journal of Computer and Communication Technology, 2(2), 56-63.
[13]
Catal, C., & Diri, B. (2007, July). Software fault prediction with object-oriented metrics based artificial immune recognition system. In International Conference on Product Focused Software Process Improvement (pp. 300-314). Springer, Berlin, Heidelberg.
[14]
Seliya, N., & Khoshgoftaar, T. M. (2007). Software quality estimation with limited fault data: a semi-supervised learning perspective. Software Quality Journal, 15(3), 327- 344.
[15]
Jiang, Y., Cukic, B., & Ma, Y. (2008). Techniques for evaluating fault prediction models. Empirical Software Engineering, 13(5), 561-595.
[16]
Kanmani, S., Uthariaraj, V. R., Sankaranarayanan, V., & Thambidurai, P. (2007). Object-oriented software fault prediction using neural networks. Information and software technology, 49(5), 483-492.
[17]
Challagulla, V. U. B., Bastani, F. B., Yen, I. L., & Paul, R. A. (2008). Empirical assessment of machine learning based software defect prediction techniques. International Journal on Artificial Intelligence Tools, 17(02), 389-400.
[18]
Challagulla, V. U., Bastani, F. B., & Yen, I. L. (2006, November). A unified framework for defect data analysis using the mbr technique. In 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06) (pp. 39-46). IEEE.
[19]
Ohlsson N, Zhao M, Helander M (1998) Application of multivariate analysis for software fault prediction. Softw Qual J 7(1):51–66.
[20]
Vandecruys, O., Martens, D., Baesens, B., Mues, C., De Backer, M., & Haesen, R. (2008). Mining software repositories for comprehensible software fault prediction models. Journal of Systems and software, 81(5), 823-839.
[21]
Turhan, B., & Bener, A. (2009). Analysis of Naive Bayes’ assumptions on software fault data: An empirical study. Data & Knowledge Engineering, 68(2), 278-290.
[22]
Menzies, T., Greenwald, J., & Frank, A. (2006). Data mining static code attributes to learn defect predictors. IEEE transactions on software engineering, 33(1), 2-13.
[23]
Rathore, S. S., & Kumar, S. (2016). A decision tree regression based approach for the number of software faults prediction. ACM SIGSOFT Software Engineering Notes, 41(1), 1-6.
[24]
Yadav, H. B., & Yadav, D. K. (2015). A fuzzy logic based approach for phase-wise software defects prediction using software metrics. Information and Software Technology, 63, 44-57.
[25]
Ostrand, T. J., Weyuker, E. J., & Bell, R. M. (2010, September). Programmer-based fault prediction. In Proceedings of the 6th International Conference on Predictive Models in Software Engineering (pp. 1-10).
[26]
Cruz, A. E. C., & Ochimizu, K. (2009, October). Towards logistic regression models for predicting fault-prone code across software projects. In 2009 3rd international symposium on empirical software engineering and measurement (pp. 460- 463). IEEE.
[27]
Janes, A., Scotto, M., Pedrycz, W., Russo, B., Stefanovic, M., & Succi, G. (2006). Identification of defect-prone classes in telecommunication software systems using design metrics. Information sciences, 176(24), 3711-3734.
[28]
Pan, C., Lu, M., Xu, B., & Gao, H. (2019). An improved CNN model for withinproject software defect prediction. Applied Sciences, 9(10), 2138.
[29]
Manjula, C., & Florence, L. (2019). Deep neural network based hybrid approach for software defect prediction using software metrics. Cluster Computing, 22(4), 9847-9863.
[30]
Özakıncı, R., & Tarhan, A. (2018). Early software defect prediction: A systematic map and review. Journal of Systems and Software, 144, 216-239.
[31]
Gao, K., Khoshgoftaar, T. M., Wang, H., & Seliya, N. (2011). Choosing software metrics for defect prediction: an investigation on feature selection techniques. Software: Practice and Experience, 41(5), 579-606.
[32]
Shippey, T., Bowes, D., & Hall, T. (2019). Automatically identifying code features for software defect prediction: Using AST N-grams. Information and Software Technology, 106, 142-160

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIMMI '22: Proceedings of the 4th International Conference on Information Management & Machine Intelligence
December 2022
749 pages
ISBN:9781450399937
DOI:10.1145/3590837
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Datasets
  2. Machine Learning
  3. Software Defect Prediction
  4. Software Metrics
  5. Statement Level

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICIMMI 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 89
    Total Downloads
  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)2
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media