[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

IPMM: Cancer Subtype Clustering Model Based on Multiomics Data and Pathway and Motif Information

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12447))

Included in the following conference series:

Abstract

Multiomics compiles data from different genome levels to study the effects of interactions between various omics molecules on disease processes. Integrated analysis of different omics data can more comprehensively evaluate their role in human health and complex diseases. Previous studies have used SNF and SNF-CC for multiomics integration. Although the effect of multiomics integrative algorithm is significantly increased, these methods did not consider the effects of a biologically significant correlation within and between omics. A large body of evidence has shown that cancer occurs due to interactions and synergistic effects of multiple genes. The correlation relationships between genes can be reflected through gene pathway and motif information. In this paper, we define the IPMM(Integration Pathway and Motif information Model), which combines pathway and motif information with multiomics data to study their effects on cancer subtype classification. To facilitate the use of gene association information, we employ the Isomap method for dimensionality reduction analysis of expression data from the genomes in a pathway and motif. Selection of K values in Isomap dimensionality reduction is used to maximize the presentation of the relationship of genes in pathway and motif data with dimensionality reduced to one. SNF and SNF-CC are used for integrative analysis of gene-expression data, methylation data, miRNA data, and pathway and motif data after dimensionality reduction in two cancer datasets. Results show that clustering effects display varying increases in different methods after pathway and motif information are integrated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 71.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 89.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Yugi, K., Kubota, H., Hatano, A., Kuroda, S.: Trans-omics: how to reconstruct biochemical networks across multiple’omic’ layers. Trends Biotechnol. 34, 276–290 (2016)

    Article  Google Scholar 

  2. Lin, E., Lane, H.Y.: Machine learning and systems genomics approaches for multi-omics data. Biomark. Res. 5, 2 (2017)

    Article  Google Scholar 

  3. Ritchie, M.D., Holzinger, E.R., Li, R., Pendergrass, S.A., Kim, D.: Methods of integrating data to uncover genotype-phenotype interactions. Nat. Rev. Genet. 16, 85–97 (2015)

    Article  Google Scholar 

  4. Guo, Y., Liu, S.: BCDForest: a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data. BMC Bioinf. 19, 118 (2018)

    Article  Google Scholar 

  5. Hasin, Y., Seldin, M.: Multi-omics approaches to disease. Genome Biol. 18, 1–5 (2017)

    Article  Google Scholar 

  6. Torshizi, A.D., Petzold, L.R.: Graph-based semi-supervised learning with genomic data integration using condition-responsive genes applied to phenotype classification. J. Am. Med. Inform. Assoc. 25, 99–108 (2018)

    Article  Google Scholar 

  7. Zhao, J., Cheng, F., Jia, P., Cox, N., Denny, J.C., Zhao, Z.: An integrative functional genomics framework for effective identification of novel regulatory variants in genome-phenome studies. Genome Med. 10, 7 (2018)

    Article  Google Scholar 

  8. Romanowska, J.: From genotype to phenotype: through chromatin. Genes 10(2), 76 (2019)

    Article  Google Scholar 

  9. Chu, S.H., Huang, Y.T.: Integrated genomic analysis of biological gene sets with applications in lung cancer prognosis. BMC Bioinf. 18, 336 (2017)

    Article  Google Scholar 

  10. Yuan, L., Huang, D.S.: A network-guided association mapping approach from DNA methylation to disease. Sci. Rep. 9, 5601 (2019)

    Article  Google Scholar 

  11. Wilk, G., Braun, R.: Integrative analysis reveals disrupted pathways regulated by microRNAs in cancer. Nucleic Acids Res. 46, 1089–1101 (2018)

    Article  Google Scholar 

  12. Jung, K.: Multidimensional Scaling I. In: Wright, J.D. (ed.) International Encyclopedia of the Social & Behavioral Sciences, 2nd edn, pp. 34–39. Elsevier, Oxford (2015)

    Chapter  Google Scholar 

  13. Tenenbaum, J.B.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000)

    Article  Google Scholar 

  14. Shi, J., Luo, Z.: Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples. Comput. Biol. Med. 40(8), 723–732 https://doi.org/10.1016/j.compbiomed.2010.06.007

  15. Sebastiani, P.: Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data. Mach. Learn. 52(1–2), 91–118 (2003)

    Google Scholar 

  16. Wilkerson, M.D., Hayes, D.N.: ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics 26, 1572–1573 (2010)

    Article  Google Scholar 

  17. Tenenbaum, J.B., De Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000)

    Article  Google Scholar 

  18. Silhouettes, R.P.J.: A graphical aid to the interpretation and validation of cluster analysis, J. Comput. Appl. 20, 53–65 (1987)

    Google Scholar 

  19. Hosmer Jr, D.W., Lemeshow, S.: Applied survival analysis: regression modeling of time to event data. J. Am. Stat. Assoc. (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xuequn Shang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Guo, X., Lu, Y., Yin, Z., Shang, X. (2020). IPMM: Cancer Subtype Clustering Model Based on Multiomics Data and Pathway and Motif Information. In: Yang, X., Wang, CD., Islam, M.S., Zhang, Z. (eds) Advanced Data Mining and Applications. ADMA 2020. Lecture Notes in Computer Science(), vol 12447. Springer, Cham. https://doi.org/10.1007/978-3-030-65390-3_42

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-65390-3_42

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-65389-7

  • Online ISBN: 978-3-030-65390-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics