Computer Science > Software Engineering

arXiv:2406.15676 (cs)

[Submitted on 21 Jun 2024]

Title:Inferring Pluggable Types with Machine Learning

Authors:Kazi Amanul Islam Siddiqui, Martin Kellogg

Abstract:Pluggable type systems allow programmers to extend the type system of a programming language to enforce semantic properties defined by the programmer. Pluggable type systems are difficult to deploy in legacy codebases because they require programmers to write type annotations manually. This paper investigates how to use machine learning to infer type qualifiers automatically. We propose a novel representation, NaP-AST, that encodes minimal dataflow hints for the effective inference of type qualifiers. We evaluate several model architectures for inferring type qualifiers, including Graph Transformer Network, Graph Convolutional Network and Large Language Model. We further validated these models by applying them to 12 open-source programs from a prior evaluation of the NullAway pluggable typechecker, lowering warnings in all but one unannotated project. We discovered that GTN shows the best performance, with a recall of .89 and precision of 0.6. Furthermore, we conduct a study to estimate the number of Java classes needed for good performance of the trained model. For our feasibility study, performance improved around 16k classes, and deteriorated due to overfitting around 22k classes.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.15676 [cs.SE]
	(or arXiv:2406.15676v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2406.15676

Submission history

From: Kazi Amanul Islam Siddiqui [view email]
[v1] Fri, 21 Jun 2024 22:32:42 UTC (181 KB)

Computer Science > Software Engineering

Title:Inferring Pluggable Types with Machine Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Inferring Pluggable Types with Machine Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators