Quantitative Biology > Quantitative Methods

arXiv:2204.01593 (q-bio)

[Submitted on 31 Mar 2022 (v1), last revised 24 Apr 2022 (this version, v2)]

Title:Optimize Deep Learning Models for Prediction of Gene Mutations Using Unsupervised Clustering

Authors:Zihan Chen, Xingyu Li, Miaomiao Yang, Hong Zhang, Xu Steven Xu

View PDF

Abstract:Deep learning has become the mainstream methodological choice for analyzing and interpreting whole-slide digital pathology images (WSIs). It is commonly assumed that tumor regions carry most predictive information. In this paper, we proposed an unsupervised clustering-based multiple-instance learning, and apply our method to develop deep-learning models for prediction of gene mutations using WSIs from three cancer types in The Cancer Genome Atlas (TCGA) studies (CRC, LUAD, and HNSCC). We showed that unsupervised clustering of image patches could help identify predictive patches, exclude patches lack of predictive information, and therefore improve prediction on gene mutations in all three different cancer types, compared with the WSI based method without selection of image patches and models based on only tumor regions. Additionally, our proposed algorithm outperformed two recently published baseline algorithms leveraging unsupervised clustering to assist model prediction. The unsupervised-clustering-based approach for mutation prediction allows identification of the spatial regions related to mutation of a specific gene via the resolved probability scores, highlighting the heterogeneity of a predicted genotype in the tumor microenvironment. Finally, our study also demonstrated that selection of tumor regions of WSIs is not always the best way to identify patches for prediction of gene mutations, and other tissue types in the tumor micro-environment may provide better prediction ability for gene mutations than tumor tissues.

Subjects:	Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2204.01593 [q-bio.QM]
	(or arXiv:2204.01593v2 [q-bio.QM] for this version)
	https://doi.org/10.48550/arXiv.2204.01593

Submission history

From: Xingyu Li [view email]
[v1] Thu, 31 Mar 2022 11:48:21 UTC (3,765 KB)
[v2] Sun, 24 Apr 2022 15:01:53 UTC (3,682 KB)

Quantitative Biology > Quantitative Methods

Title:Optimize Deep Learning Models for Prediction of Gene Mutations Using Unsupervised Clustering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Quantitative Methods

Title:Optimize Deep Learning Models for Prediction of Gene Mutations Using Unsupervised Clustering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators