Out of Distribution Detection for Medical Images

Oliver Zhang²⁰,
Jean-Benoit Delbrouck²⁰ &
Daniel L. Rubin²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12959))

Included in the following conference series:

International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging
International Workshop on Preterm, Perinatal and Paediatric Image Analysis

2090 Accesses
5 Citations

Abstract

Neural network architectures behave in unpredictable ways when testing on inputs which do not resemble their training data. It is valuable to detect any out-of-distribution (OOD) inputs to make any overseers aware of the limitations of the model’s output. To address this need, a large number of methods for detecting OOD inputs have been proposed and tested on small datasets such as CIFAR10, SVHN, or LSUN. The purpose of this study is to determine the effectiveness of different methods for OOD detection on the domain of medical images. We investigate three common OOD detection methods (Maximum Softmax Probability, Confidence Branch, and Outlier Exposure) and report their effectiveness on widely used medical image datasets. We find that OOD detection metrics are volatile and can have large changes in performance in a short amount of training steps. Moreover, we also observe that OOD detection is sensitive to the choice of hyperparameters. Our code is reproducible at this link (https://github.com/oliverzhang42/ood_medical_images).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 43.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 54.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Calibrated Resampling for Imbalanced and Long-Tails in Deep Learning

AUAAC: Area Under Accuracy-Accuracy Curve for Evaluating Out-of-Distribution Detection

Abstract: MOOD 2020

References

Bendale, A., Boult, T.: Towards open set deep networks. arXiv:1511.06233 [cs], November 2015. http://arxiv.org/abs/1511.06233
California Healthcare Foundation, EyePACS: Diabetic Retinopathy Detection (2015). https://www.kaggle.com/c/diabetic-retinopathy-detection/overview
Cao, T., Huang, C.W., Hui, D.Y.T., Cohen, J.P.: A benchmark of medical out of distribution detection (2020)
Google Scholar
DeVries, T., Taylor, G.W.: Learning confidence for out-of-distribution detection in neural networks. arXiv preprint arXiv:1802.04865 (2018)
Gal, Y., Ghahramani, Z.: Dropout as a bayesian approximation: representing model uncertainty in deep learning, October 2016. https://arxiv.org/abs/1506.02142
Gao, L., Wu, S.: Response score of deep learning for out-of-distribution sample detection of medical images. J. Biomed. Inform. 107, 103442 (2020). https://doi.org/10.1016/j.jbi.2020.103442
Article Google Scholar
Halabi, S.S., et al.: The RSNA pediatric bone age machine learning challenge. Radiology 290(2), 498–503 (2019)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385 (2015)
Hendrycks, D., Gimpel, K.: A baseline for detecting misclassified and out-of-distribution examples in neural networks. In: International Conference on Learning Representations (2016)
Google Scholar
Hendrycks, D., Mazeika, M., Dietterich, T.: Deep anomaly detection with outlier exposure. In: International Conference on Learning Representations (2018)
Google Scholar
Henriksson, J., Berger, C., Borg, M., Tornberg, L., Raman Sathyamoorthy, S., Englund, C.: Performance analysis of out-of-distribution detection on trained neural networks. Inform. Softw. Technol. 130, 106409 (2021). https://doi.org/10.1016/j.infsof.2020.106409
Article Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167 [cs], March 2015. http://arxiv.org/abs/1502.03167
Johnson, A.E., et al.: MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs. arXiv preprint arXiv:1901.07042 (2019)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization, January 2017. https://arxiv.org/abs/1412.6980
Krizhevsky, A.: Learning multiple layers of features from tiny images. University of Toronto (2009)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
Google Scholar
Lakshminarayanan, B., Pritzel, A., Blundell, C.: Simple and scalable predictive uncertainty estimation using deep ensembles, November 2017. https://arxiv.org/abs/1612.01474
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998). https://doi.org/10.1109/5.726791
Article Google Scholar
Lee, K., Lee, H., Lee, K., Shin, J.: Training confidence-calibrated classifiers for detecting out-of-distribution samples. In: International Conference on Learning Representations (2018)
Google Scholar
Li, X., Lu, Y., Desrosiers, C., Liu, X.: Out-of-distribution detection for skin lesion images with deep isolation forest. In: Liu, M., Yan, P., Lian, C., Cao, X. (eds.) MLMI 2020. LNCS, vol. 12436, pp. 91–100. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59861-7_10
Chapter Google Scholar
Liang, S., Li, Y., Srikant, R.: Enhancing the reliability of out-of-distribution image detection in neural networks. In: International Conference on Learning Representations (2018)
Google Scholar
Linmans, J., van der Laak, J., Litjens, G.: Efficient out-of-distribution detection in digital pathology using multi-head convolutional neural networks. In: Arbel, T., Ben Ayed, I., de Bruijne, M., Descoteaux, M., Lombaert, H., Pal, C. (eds.) Proceedings of the Third Conference on Medical Imaging with Deep Learning. Proceedings of Machine Learning Research, vol. 121, pp. 465–478. PMLR, 06–08 July 2020. http://proceedings.mlr.press/v121/linmans20a.html
Nguyen, A., Yosinski, J., Clune, J.: Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 427–436 (2015)
Google Scholar
Rajpurkar, P., et al.: MURA: large dataset for abnormality detection in musculoskeletal radiographs. arXiv preprint arXiv:1712.06957 (2017)
Roady, R., Hayes, T.L., Kemker, R., Gonzales, A., Kanan, C.: Are open set classification methods effective on large-scale datasets? Plos One 15(9) (2020). https://doi.org/10.1371/journal.pone.0238302
Wang, N., Chen, C., Xie, Y., Ma, L.: Brain tumor anomaly detection via latent regularized adversarial network. CoRR abs/2007.04734 (2020). https://arxiv.org/abs/2007.04734
Wu, J., Zhang, Q., Xu, G.: Tiny ImageNet challenge. Technical report, Stanford University (2017)
Google Scholar
Yu, F., Seff, A., Zhang, Y., Song, S., Funkhouser, T., Xiao, J.: LSUN: construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365 (2015)

Download references

Author information

Authors and Affiliations

Stanford University, Stanford, CA, USA
Oliver Zhang, Jean-Benoit Delbrouck & Daniel L. Rubin

Authors

Oliver Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Benoit Delbrouck
View author publications
You can also search for this author in PubMed Google Scholar
Daniel L. Rubin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oliver Zhang .

Editor information

Editors and Affiliations

University College London/King's College London, London, UK
Carole H. Sudre
Medical University of Vienna and TU Wien, Vienna, Austria
Roxane Licandro
University of Tübingen, Tübingen, Germany
Christian Baumgartner
King's College London, London, UK
Andrew Melbourne
Massachusetts General Hospital, Harvard Medical School, MIT, Cambridge, MA, USA
Adrian Dalca
King's College London, London, UK
Jana Hutter
Microsoft Research/University College London, London, UK
Ryutaro Tanno
Boston Children's Hospital, Boston, MA, USA
Esra Abaci Turk
Technical University Denmark, Kongens Lyngby, Denmark
Koen Van Leemput
Hewlett Packard, Barcelona, Spain
Jordina Torrents Barrena
Harvard Medical School/Brigham and Women's Hospital, Boston, MA, USA
William M. Wells
The Hospital For Sick Children, University of Toronto, Toronto, ON, Canada
Christopher Macgowan

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 298 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, O., Delbrouck, JB., Rubin, D.L. (2021). Out of Distribution Detection for Medical Images. In: Sudre, C.H., et al. Uncertainty for Safe Utilization of Machine Learning in Medical Imaging, and Perinatal Imaging, Placental and Preterm Image Analysis. UNSURE PIPPI 2021 2021. Lecture Notes in Computer Science(), vol 12959. Springer, Cham. https://doi.org/10.1007/978-3-030-87735-4_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-87735-4_10
Published: 25 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87734-7
Online ISBN: 978-3-030-87735-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Out of Distribution Detection for Medical Images

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Calibrated Resampling for Imbalanced and Long-Tails in Deep Learning

AUAAC: Area Under Accuracy-Accuracy Curve for Evaluating Out-of-Distribution Detection

Abstract: MOOD 2020

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 298 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Out of Distribution Detection for Medical Images

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Calibrated Resampling for Imbalanced and Long-Tails in Deep Learning

AUAAC: Area Under Accuracy-Accuracy Curve for Evaluating Out-of-Distribution Detection

Abstract: MOOD 2020

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 298 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation