Computer Science > Machine Learning

arXiv:2303.01433 (cs)

[Submitted on 2 Mar 2023 (v1), last revised 6 Jun 2023 (this version, v2)]

Title:Do Machine Learning Models Learn Statistical Rules Inferred from Data?

Authors:Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong

View PDF

Abstract:Machine learning models can make critical errors that are easily hidden within vast amounts of data. Such errors often run counter to rules based on human intuition. However, rules based on human knowledge are challenging to scale or to even formalize. We thereby seek to infer statistical rules from the data and quantify the extent to which a model has learned them. We propose a framework SQRL that integrates logic-based methods with statistical inference to derive these rules from a model's training data without supervision. We further show how to adapt models at test time to reduce rule violations and produce more coherent predictions. SQRL generates up to 300K rules over datasets from vision, tabular, and language settings. We uncover up to 158K violations of those rules by state-of-the-art models for classification, object detection, and data imputation. Test-time adaptation reduces these violations by up to 68.7% with relative performance improvement up to 32%. SQRL is available at this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2303.01433 [cs.LG]
	(or arXiv:2303.01433v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.01433

Submission history

From: Aaditya Naik [view email]
[v1] Thu, 2 Mar 2023 17:47:02 UTC (6,348 KB)
[v2] Tue, 6 Jun 2023 23:22:16 UTC (7,345 KB)

Computer Science > Machine Learning

Title:Do Machine Learning Models Learn Statistical Rules Inferred from Data?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Do Machine Learning Models Learn Statistical Rules Inferred from Data?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators