Computer Science > Software Engineering

arXiv:2309.04797 (cs)

[Submitted on 9 Sep 2023]

Title:A Full-fledged Commit Message Quality Checker Based on Machine Learning

Authors:David Faragó, Michael Färber, Christian Petrov

View PDF

Abstract:Commit messages (CMs) are an essential part of version control. By providing important context in regard to what has changed and why, they strongly support software maintenance and evolution. But writing good CMs is difficult and often neglected by developers. So far, there is no tool suitable for practice that automatically assesses how well a CM is written, including its meaning and context. Since this task is challenging, we ask the research question: how well can the CM quality, including semantics and context, be measured with machine learning methods? By considering all rules from the most popular CM quality guideline, creating datasets for those rules, and training and evaluating state-of-the-art machine learning models to check those rules, we can answer the research question with: sufficiently well for practice, with the lowest F$_1$ score of 82.9\%, for the most challenging task. We develop a full-fledged open-source framework that checks all these CM quality rules. It is useful for research, e.g., automatic CM generation, but most importantly for software practitioners to raise the quality of CMs and thus the maintainability and evolution speed of their software.

Comments:	published at COMPSAC'23
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2309.04797 [cs.SE]
	(or arXiv:2309.04797v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2309.04797

Submission history

From: Michael Färber [view email]
[v1] Sat, 9 Sep 2023 13:43:43 UTC (417 KB)

Computer Science > Software Engineering

Title:A Full-fledged Commit Message Quality Checker Based on Machine Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:A Full-fledged Commit Message Quality Checker Based on Machine Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators