More Web Proxy on the site http://driver.im/

research-article

VR-DiagNet: Medical Volumetric and Radiomic Diagnosis Networks with Interpretable Clinician-like Optimizing Visual Inspection

Authors:

Nengjun ZhuAuthors Info & Claims

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

Pages 10459 - 10467

https://doi.org/10.1145/3664647.3680863

Published: 28 October 2024 Publication History

Abstract

Interpretable and robust medical diagnoses are essential traits for practicing clinicians. Most computer-augmented diagnostic systems suffer from three major problems: non-interpretability, limited modality analysis, and narrow focus. Existing frameworks can either deal with multimodality to some extent but suffer from non-interpretability or partially interpretable but provide a limited modality and multifaceted capabilities. Our work aims to integrate all these aspects in one complete framework to fully utilize the full spectrum of information offered by multiple modalities and facets. We propose our solution via our novel architecture VR-DiagNet, consisting of a planner and a classifier, optimized iteratively and cohesively. VR-DiagNet simulates the perceptual process of clinicians via the use of volumetric imaging information integrated with radiomic features modality; at the same time, it recreates human thought processes via a customized Monte Carlo Tree Search (MCTS) which constructs a volume-tailored experience tree to identify slices of interest (SoIs) in our multi-slice perception space. We conducted extensive experiments across two diagnostic tasks comprising six public medical volumetric benchmark datasets. Our findings showcase superior performance, as evidenced by heightened accuracy and area under the curve (AUC) metrics, reduced computational overhead, and expedited convergence while conclusively illustrating the immense value of integrating volumetric and radiomic modalities for our current problem setup.

References

[1]

Julián N. Acosta, Guido J. Falcone, Pranav Rajpurkar, and Eric J. Topol. 2022. Multimodal biomedical AI. Nature Medicine, Vol. 28, 9 (01 Sep 2022), 1773--1784.

[2]

Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. 2002. Finite-time analysis of the multiarmed bandit problem. Machine learning, Vol. 47 (2002), 235--256.

Digital Library

[3]

Subrato Bharati, M. Rubaiyat Hossain Mondal, and Prajoy Podder. 2024. A Review on Explainable Artificial Intelligence for Healthcare: Why, How, and When? IEEE Transactions on Artificial Intelligence, Vol. 5, 4 (April 2024), 1429--1442. https://doi.org/10.1109/tai.2023.3266418

[4]

Shouyu Chen, Xin Guo, Jianping Zhu, and Yin Wang. 2023. GSDG: Exploring a Global Semantic-Guided Dual-Stream Graph Model for Automated Volume Differential Diagnosis and Prognosis. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 462--471.

Digital Library

[5]

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248--255.

[6]

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, et al. 2020. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations.

[7]

Matthias Feurer, Aaron Klein, Katharina Eggensperger, Jost Springenberg, Manuel Blum, and Frank Hutter. 2015. Efficient and robust automated machine learning. Advances in neural information processing systems, Vol. 28 (2015).

[8]

Xin-Yue Ge, Zhong-Kai Lan, Qiao-Qing Lan, Hua-Shan Lin, Guo-Dong Wang, and Jing Chen. 2023. Diagnostic accuracy of ultrasound-based multimodal radiomics modeling for fibrosis detection in chronic kidney disease. European Radiology, Vol. 33, 4 (2023), 2386--2398.

[9]

Dan Guo, Kun Li, Bin Hu, Yan Zhang, and Meng Wang. 2024. Benchmarking Micro-action Recognition: Dataset, Method, and Application. IEEE Transactions on Circuits and Systems for Video Technology (2024).

[10]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]

Siqi Hong, Hankz Hankui Zhuo, Kebing Jin, Guang Shao, and Zhanwen Zhou. 2023. Retrosynthetic planning with experience-guided Monte Carlo tree search. Communications Chemistry, Vol. 6, 1 (2023), 120.

[12]

Jinseong Jang and Dosik Hwang. 2022. M3T: three-dimensional Medical image classifier using Multi-plane and Multi-slice Transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 20718--20729.

[13]

Haifeng Jin, Qingquan Song, and Xia Hu. 2019. Auto-keras: An efficient neural architecture search system. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 1946--1956.

Digital Library

[14]

Philippe Lambin, Ralph TH Leijenaar, Timo M Deist, Jurgen Peerlings, Evelyn EC De Jong, Janita Van Timmeren, Sebastian Sanduleanu, Ruben THM Larue, Aniek JG Even, Arthur Jochems, et al. 2017. Radiomics: the bridge between medical imaging and personalized medicine. Nature reviews Clinical oncology, Vol. 14, 12 (2017), 749--762.

[15]

Chen Liu, Jinze Cui, Dailin Gan, and Guosheng Yin. 2021. Beyond covid-19 diagnosis: Prognosis with hierarchical graph representation learning. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 283--292.

Digital Library

[16]

Shiyi Liu, Ruofei Ma, Chuyi Zhao, Zhenbang Li, Jianpeng Xiao, and Quan Li. 2023. BPCoach: Exploring Hero Drafting in Professional MOBA Tournaments via Visual Analytics. arXiv preprint arXiv:2311.05912 (2023).

[17]

Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017).

[18]

Dwarikanath Mahapatra, Zongyuan Ge, and Mauricio Reyes. 2022. Self-supervised generalized zero shot learning for medical image classification using novel interpretable saliency maps. IEEE Transactions on Medical Imaging, Vol. 41, 9 (2022), 2443--2456.

[19]

Dwarikanath Mahapatra, Alexander Poellinger, and Mauricio Reyes. 2022. Interpretability-guided inductive bias for deep learning based medical image. Medical image analysis, Vol. 81 (2022), 102551.

[20]

Andriy Marusyk, Vanessa Almendro, and Kornelia Polyak. 2012. Intra-tumour heterogeneity: a looking glass for cancer? Nature reviews cancer, Vol. 12, 5 (2012), 323--334.

[21]

Zhenyuan Ning, Jiaxiu Luo, Qing Xiao, Longmei Cai, Yuting Chen, Xiaohui Yu, Jian Wang, and Yu Zhang. 2021. Multi-modal magnetic resonance imaging-based grading analysis for gliomas by integrating radiomics and deep features. Annals of Translational Medicine, Vol. 9, 4 (2021).

[22]

Zohaib Salahuddin, Henry C Woodruff, Avishek Chatterjee, and Philippe Lambin. 2022. Transparency of deep neural networks for medical image analysis: A review of interpretability methods. Computers in biology and medicine, Vol. 140 (2022), 105111.

[23]

Claude Elwood Shannon. 1948. A mathematical theory of communication. The Bell system technical journal, Vol. 27, 3 (1948), 379--423.

[24]

David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature, Vol. 529, 7587 (2016), 484--489.

[25]

Maciej Świechowski, Konrad Godlewski, Bartosz Sawicki, and Jacek Mańdziuk. 2023. Monte Carlo tree search: A review of recent modifications and applications. Artificial Intelligence Review, Vol. 56, 3 (2023), 2497--2562.

Digital Library

[26]

Aiham Taleb, Winfried Loetzsch, Noel Danz, Julius Severin, Thomas Gaertner, Benjamin Bergner, and Christoph Lippert. 2020. 3d self-supervised methods for medical imaging. Advances in Neural Information Processing Systems, Vol. 33 (2020), 18158--18172.

[27]

Mahbubunnabi Tamal, Maha Alshammari, Meernah Alabdullah, Rana Hourani, Hossain Abu Alola, and Tarek M Hegazi. 2021. An integrated framework with machine learning and radiomics for accurate and rapid early diagnosis of COVID-19 from Chest X-ray. Expert systems with applications, Vol. 180 (2021), 115152.

[28]

Shohei Tanaka, Noriyuki Kadoya, Yuto Sugai, Mariko Umeda, Miyu Ishizawa, Yoshiyuki Katsuta, Kengo Ito, Ken Takeda, and Keiichi Jingu. 2022. A deep learning-based radiomics approach to predict head and neck tumor regression for adaptive radiotherapy. Scientific Reports, Vol. 12, 1 (2022), 8899.

[29]

Joost JM Van Griethuysen, Andriy Fedorov, Chintan Parmar, Ahmed Hosny, Nicole Aucoin, Vivek Narayan, Regina GH Beets-Tan, Jean-Christophe Fillion-Robin, Steve Pieper, and Hugo JWL Aerts. 2017. Computational radiomics system to decode the radiographic phenotype. Cancer research, Vol. 77, 21 (2017), e104--e107.

[30]

Rami S Vanguri, Jia Luo, Andrew T Aukerman, Jacklynn V Egger, Christopher J Fong, Natally Horvat, Andrew Pagano, Jose de Arimateia Batista Araujo-Filho, Luke Geneslaw, Hira Rizvi, et al. 2022. Multimodal integration of radiology, pathology and genomics for prediction of response to PD-(L) 1 blockade in patients with non-small cell lung cancer. Nature cancer, Vol. 3, 10 (2022), 1151--1164.

[31]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, Vol. 30 (2017).

[32]

Xudong Wang, Shizhong Han, Yunqiang Chen, Dashan Gao, and Nuno Vasconcelos. 2019. Volumetric attention for 3D medical image segmentation and detection. In International conference on medical image computing and computer-assisted intervention. Springer, 175--184.

Digital Library

[33]

Xue Wang, Zhanshan Li, Yongping Huang, and Yingying Jiao. 2022. Multimodal medical image segmentation using multi-scale context-aware network. Neurocomputing, Vol. 486 (2022), 135--146.

Digital Library

[34]

Jinyu Wen, Feiwei Qin, Jiao Du, Meie Fang, Xinhua Wei, CL Philip Chen, and Ping Li. 2023. Msgfusion: Medical semantic guided two-branch network for multimodal brain image fusion. IEEE Transactions on Multimedia (2023).

[35]

Jiancheng Yang, Xiaoyang Huang, Yi He, Jingwei Xu, Canqian Yang, Guozheng Xu, and Bingbing Ni. 2021. Reinventing 2d convolutions for 3d images. IEEE Journal of Biomedical and Health Informatics, Vol. 25, 8 (2021), 3009--3018.

[36]

Jiancheng Yang, Rui Shi, Donglai Wei, Zequan Liu, Lin Zhao, Bilian Ke, Hanspeter Pfister, and Bingbing Ni. 2023. MedMNIST v2-A large-scale lightweight benchmark for 2D and 3D biomedical image classification. Scientific Data, Vol. 10, 1 (2023), 41.

[37]

Jianpeng Zhang, Yutong Xie, Qi Wu, and Yong Xia. 2019. Medical image classification using synergic deep learning. Medical image analysis, Vol. 54 (2019), 10--19.

[38]

Yichi Zhang, Qingcheng Liao, Le Ding, and Jicong Zhang. 2022. Bridging 2D and 3D segmentation networks for computation-efficient volumetric medical image segmentation: An empirical study of 2.5 D solutions. Computerized Medical Imaging and Graphics, Vol. 99 (2022), 102088.

[39]

Xueyi Zheng, Zhao Yao, Yini Huang, Yanyan Yu, Yun Wang, Yubo Liu, Rushuang Mao, Fei Li, Yang Xiao, Yuanyuan Wang, et al. 2020. Deep learning radiomics can predict axillary lymph node status in early-stage breast cancer. Nature communications, Vol. 11, 1 (2020), 1236.

[40]

Zhuoran Zheng and Xiuyi Jia. 2023. Complex Mixer for MedMNIST Classification Decathlon. arXiv preprint arXiv:2304.10054 (2023).

[41]

Alexander Ziller, Alp Güvenir, Ayhan Can Erdur, Tamara T Mueller, Philip Müller, Friederike Jungmann, Johannes Brandt, Jan Peeken, Rickmer Braren, Daniel Rueckert, et al. 2023. Explainable 2D Vision Models for 3D Medical Data. arXiv preprint arXiv:2307.06614 (2023).

[42]

Alex Zwanenburg, Martin Vallières, Mahmoud A Abdalah, Hugo JWL Aerts, Vincent Andrearczyk, Aditya Apte, Saeed Ashrafinia, Spyridon Bakas, Roelof J Beukinga, Ronald Boellaard, et al. 2020. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology, Vol. 295, 2 (2020), 328--338.

Index Terms

VR-DiagNet: Medical Volumetric and Radiomic Diagnosis Networks with Interpretable Clinician-like Optimizing Visual Inspection
1. Computing methodologies
  1. Artificial intelligence
    1. Planning and scheduling
      1. Planning under uncertainty

Recommendations

Multi-region radiomics for artificially intelligent diagnosis of breast cancer using multimodal ultrasound
Abstract Purpose
The ultrasound (US) diagnosis of breast cancer is usually based on a single-region of a whole breast tumor from a single ultrasonic modality, which limits the diagnostic performance. Multiple regions on multimodal US ...
Highlights
- We propose an AI-based diagnosis system with multi-region multimodal radiomics to diagnose breast cancer.
Breast cancer classification through multivariate radiomic time series analysis in DCE-MRI sequences
Abstract
Breast cancer is the most prevalent disease that poses a significant threat to women’s health. Despite the Dynamic Contrast-Enhanced MRI (DCE-MRI) has been widely used for breast cancer classification, its diagnostic performance is still ...
Highlights
- Modeling of breast cancer classification in DCE-MRI through time series algorithms.
- Radiomics enables the extraction of intelligible features also in small-dataset.
- The intelligibility of radiomic features enables accurate and ...
Application of artificial intelligence radiomics in the diagnosis, treatment, and prognosis of hepatocellular carcinoma
Abstract
Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer, with an increasing incidence and poor prognosis. In the past decade, artificial intelligence (AI) technology has undergone rapid development in the field of clinical ...
Graphical abstract

Display Omitted
Highlights
- Artificial intelligence radiomics is useful in the diagnosis, prediction of prognosis, and optimization of individualized treatment for hepatocellular carcinoma.
- Artificial intelligence radiomics is a promising non-invasive tool for ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

October 2024

11719 pages

ISBN:9798400706868

DOI:10.1145/3664647

General Chairs:
Jianfei Cai
Monash University, Australia
,
Mohan Kankanhalli
NUS, Singapore
,
Balakrishnan Prabhakaran
UT Dallas, USA
,
Susanne Boll
University of Oldenburg, Germany
,
Program Chairs:
Ramanathan Subramanian
University of Canberra & IIT Ropar, Australia
,
Liang Zheng
Australian National University, Australia
,
Vivek K. Singh
Rutgers University, USA
,
Pablo Cesar
Centrum Wiskunde & Informatica, Netherlands
,
Lexing Xie
Australian National University, Australia
,
Dong Xu
University of Hong Kong, Hong Kong

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

MM '24

Sponsor:

SIGMM

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne VIC, Australia

Acceptance Rates

MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
38
Total Downloads

Downloads (Last 12 months)38
Downloads (Last 6 weeks)38

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents