Computer Science > Artificial Intelligence

arXiv:2310.09926 (cs)

[Submitted on 15 Oct 2023 (v1), last revised 26 Nov 2023 (this version, v2)]

Title:Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data

Authors:Shiladitya Dutta, Hongbo Wei, Lars van der Laan, Ahmed M. Alaa

View PDF

Abstract:Foundation models are trained on vast amounts of data at scale using self-supervised learning, enabling adaptation to a wide range of downstream tasks. At test time, these models exhibit zero-shot capabilities through which they can classify previously unseen (user-specified) categories. In this paper, we address the problem of quantifying uncertainty in these zero-shot predictions. We propose a heuristic approach for uncertainty estimation in zero-shot settings using conformal prediction with web data. Given a set of classes at test time, we conduct zero-shot classification with CLIP-style models using a prompt template, e.g., "an image of a <category>", and use the same template as a search query to source calibration data from the open web. Given a web-based calibration set, we apply conformal prediction with a novel conformity score that accounts for potential errors in retrieved web data. We evaluate the utility of our proposed method in Biomedical foundation models; our preliminary results show that web-based conformal prediction sets achieve the target coverage with satisfactory efficiency on a variety of biomedical datasets.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2310.09926 [cs.AI]
	(or arXiv:2310.09926v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2310.09926

Submission history

From: Shiladitya Dutta [view email]
[v1] Sun, 15 Oct 2023 19:24:52 UTC (4,553 KB)
[v2] Sun, 26 Nov 2023 05:54:48 UTC (4,558 KB)

Computer Science > Artificial Intelligence

Title:Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators