Abstract
While Information Systems (IS) principles have been successfully applied to the design, implementation and management of a diverse set of domains, the Bioinformatics domain in general and the Genomic one in particular, often lacks a rigorous IS background, based on elaborating a precise Conceptual Model where the relevant concepts of the domain were properly defined. On the contrary, current genomic data repositories focus on the solution space in the form of diverse, ad-hoc databases that use to be hard to manage, evolve and intercommunicate. Conceptual Modeling as a central strategy is then far from the current biological data source ontologies that are heterogeneous, imprecise and too often even inconsistent when compared among them. To solve this problem, a concrete Conceptual Schema for the Human Genome (CSHG) is introduced in its latest version on this chapter. With a holistic perspective, the CSHG focuses on the different genomic views that must be integrated and emphasizes the value of the approach in order to deal appropriately the challenge of correctly interpreting the human genome.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Olivé, A.: Conceptual Modelling of Information Systems. Springer, Heidelberg (2007)
Falkenberg, E., Hesse, W., Lindgreen, W., Nilsson, E., Han, J., Rolland, C., Stamper, R., Van Assche, F., Verrijn-Stuart, A., Voss, K.: A Framework of Information System Concepts. IFIP (1998)
Pastor, O., Molina, J.C.: Model-Driven Architecture in Practice. Springer, Heidelberg (2007)
Thorisson, G.A., Muilu, J., Brookes, A.: Genotype-phenotype databases: challenges and solutions for the post-genomic era. Nature Reviews – Genetics 10 (2009)
Stein, L.: Creating a bioinformatics nation. Nature 417, 119–120 (2002)
Crowd sourcing human mutations (Editorial). Nature Genetics 43(4) (2011)
The Gene Ontology Consortium.: Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000)
http://www.geneontology.org/ (accessed December 05, 2011)
http://www.ncbi.nlm.nih.gov/gene (accessed December 05, 2011)
Hubbard, T., et al.: The Ensembl genome database project. Nucleic Acids Research 30(1), 38–41 (2002)
http://www.ensembl.org (accessed December 05, 2011)
http://www.ncbi.nlm.nih.gov/genbank/ (accessed December 05, 2011)
http://www.hgmd.org/ (accessed December 05, 2011)
Cooper, D.N., Krawczak, M.: Human gene mutation database. Hum. Genet. 98(5), 629 (1996)
Hamosh, A., Scott, A.F., Amberger, J., Valle, D., McKusick, V.A.: Online Mendelian Inheritance in Man (OMIM). Hum. Mutat. 15(1), 57–61 (2000)
http://www.ncbi.nlm.nih.gov/omim (accessed December 05, 2011)
http://www.genome.jp/kegg/ (accessed December 05, 2011)
Ogata, H., Goto, S., Sato, K., Fujibuchi, W., Bono, H., Kanehisa, M.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 27(1), 29–34 (1999)
Apweiler, R., et al.: UniProt: the Universal Protein knowledgebase. Nucleic Acids Res. 32 (Database issue): D115-9 (2004)
http://www.uniprot.org/ (accessed December 05, 2011)
Apweiler, R., et al.: The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 29(1), 37–40 (2001)
http://www.ebi.ac.uk/interpro/ (accessed December 05, 2011)
Croft, D., et al.: Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. (Database issue): D691-7 (2011)
http://www.reactome.org/ReactomeGWT/entrypoint.html (accessed December 05, 2011)
Pastor, O.: Conceptual Modeling Meets the Human Genome. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds.) ER 2008. LNCS, vol. 5231, pp. 1–11. Springer, Heidelberg (2008)
Paton, W.N., Khan, S., Hayes, A., Moussouni, F., Brass, A., Eilbeck, K., Globe, C., Hubbard, S., Oliver, S.: Conceptual modeling of genomic information. Bioinformatics 16(6), 548–557 (2000)
Ram, S., Wei, W.: Modeling the Semantics of 3D Protein Structures. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 696–708. Springer, Heidelberg (2004)
Garwood, K., Garwood, C., Hedeler, C., Griffiths, T., Swainston, N., Oliver, S., Paton, W.: Model-driven user interface for bioinformatics data resources: regenerating the wheel as an alternative to reinventing it. Bioinformatics 7, 532, 1–14 (2006)
Bornberg-Bauer, E., Paton, N.: Conceptual data modelling for bioinformatics. Briefings in Bioinformatics 3(2), 166–180 (2002)
e-fungi Project, http://www.cs.man.ac.uk/cornell/eFungi/index.html
Hedeler, C., Wong, H.M., Cornell, M.J., Alam, I., Soanes, D., Rattray, M., Hubbrad, S.J., Talbot, N.J., Oliver, S.G., Paton, N.: e-Fungi: a data resource for comparative analysis of fungal genomes. BMC Genomics 8, 426, 1–15 (2007)
Pastor, O., Levin, A., Celma, M., Casamayor, J., Virrueta, A., Eraso, L.: Model-Based Engineering Applied to the Interpretation of the Human Genome. In: Kaschek, R., Delcambre, L. (eds.) The Evolution of Conceptual Modeling. LNCS, vol. 6520, pp. 306–330. Springer, Heidelberg (2011)
Pastor, O., van der Kroon, M., Levin, A., Casamayor, J.C., Celma, M.: A Conceptual Modeling Approach to Improve Human Genome Understanding. In: Embley, D., Thalheim, B. (eds.) Handbook of Conceptual Modeling: Theory, Practice and Research Challenges, pp. 517–541. Springer, Heidelberg (2011)
International Human Genome Sequencing Consortium: Initial sequencing and analysis of the human genome. Nature 409(6822), 860–921 (2001)
Gene Nomenclature Committee, http://www.genenames.org
National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov
Gerstein, M.B., Bruce, C., Rozowsky, J., Zheng, D., Du, J., Korbel, J., Emanuelsson, O., Zhang, Z., Weissman, S., Snyder, M.: What is a gene, post-ENCODE? History and updated definition. Genome Res. 17, 669–681 (2007)
Blitzblau, H.G., Bell, G.W., Rodriguez, J., Bell, S.P., Hochwagen, A.: Mapping of Meiotic Single-Stranded DNA Reveals Double-Strand-Break Hotspots near Centromeres and Telomeres. Current Biology 17(23), 2003–2012 (2007)
Paigen, K., Petkov, P.: Mammalian recombination hot spots: properties, control and evolution. Nature Reviews Genetics 11, 221–233 (2010)
Holmquist, G.P.: Chromosome bands, their chromatin flavors, and their functional features. Am. J. Hum. Genet. 51(1), 17–37 (1992)
Alberts, B., Johnson, A., Lewis, J., Raff, M., Roberts, K., Walter, P.: Molecular Biology of the Cell. Garland Science, New York (2002), http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=mboc4
Collins, F.S.: The Language of Life: DNA and the Revolution in Personalized Medicine. Harper Colllins Publishers (2010)
Wheeler, D.A., et al.: The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872–877 (2008)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Pastor, O., Casamayor, J.C., Celma, M., Mota, L., Pastor, M.Á., Levin, A.M. (2012). Conceptual Modeling of Human Genome: Integration Challenges. In: Düsterhöft, A., Klettke, M., Schewe, KD. (eds) Conceptual Modelling and Its Theoretical Foundations. Lecture Notes in Computer Science, vol 7260. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28279-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-28279-9_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28278-2
Online ISBN: 978-3-642-28279-9
eBook Packages: Computer ScienceComputer Science (R0)