Abstract
Biodiversity datasets are generally stored in different formats. This makes it difficult for biologists to combine and integrate them to retrieve useful information for the purpose of, for example, efficiently classify specimens. In this paper, we present BioKET, a data warehouse which is a consolidation of heterogeneous data sources stored in different formats. For the time being, the scopus of BioKET is botanical. We had, among others things, to list all the existing botanical ontologies and relate terms in BioKET with terms in these ontologies. We demonstrate the usefulness of such a resource by applying FIST, a combined biclustering and conceptual association rule extraction method on a dataset extracted from BioKET to analyze the risk status of plants endemic to Laos. Besides, BioKET may be interfaced with other resources, like GeoCAT, to provide a powerful analysis tool for biodiversity data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Benniamin, A., Irudayaraj, V., Manickam, V.S.: How to identify rare and endangered ferns and fern allies. Ethnobotanical Leaflets 12, 108–117 (2008)
Biodiversity informatics and co-operation in taxonomy for interactive shared knowledge base (BIOTIK), http://www.biotik.org (accessed September 2011)
Botanical research and herbarium management system (BRAHMS), http://herbaria.plants.ox.ac.uk/bol/ (accessed January 2013)
http://wiki.openstreetmap.org/wiki/Bounding_box (Accessed April 2014)
De Craenel, L.R., Wanntorp, L.: Floral development and anatomy of salvadoraceae. Ecological Applications 104(5), 913–923 (2009)
Eldredge, N.: Life on Earth: An Encyclopedia of Biodiversity, Ecology, and Evolution, Life on Earth, vol. 1. ABC-CLIO (2002)
Fritsch, P.W., Bush, C.M.: A new species of gaultheria (ericaceae) from mount kinabalu, borneo, malaysia. Novon: A Journal for Botanical Nomenclature 21(3), 338–342 (2011), http://dx.doi.org/10.1371/journal.pone.0005725
Geocat: Geospatial conservation assessment tool, http://geocat.kew.org/ (accessed April 2014)
Global biodiversity outlook 3, http://www.cbd.int/gbo3 (accessed January 2013)
Grillo, O., Venora, G. (eds.): Biological Diversity and Sustainable Resources Use. InTech (2011)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San Francisco (2011)
Hochachka, W.M., Caruana, R., Fink, D., Munson, A., Riedewald, M., Sorokina, D., Kellings, S.: Data-mining discovery of pattern and process in ecological systems. The Journal of Wildlife Management 71(7), 2427–2437 (2007)
Institute, W.R.: Ecosystems and human well-being: Biodiversity synthesis. Millennium Ecosystem Assessment (2005)
Marbán, O., Mariscal, G., Segovia, J.: A data mining & knowledge discovery process model. In: Data Mining and Knowledge Discovery in Real Life Applications, InTech, Vienna (2009)
Mariscal, G., Marbán, O., Fernández, C.: A survey of data mining and knowledge discovery process models and methodologies. The Knowledge Engineering Review 25(2), 137–166 (2010), http://journals.cambridge.org/article_S0269888910000032
Midgley, G.: Biodiversity and ecosystem function. Science 335(6065), 174–175 (2012), http://www.sciencemag.org/content/335/6065/174.short
Mondal, K.C., Pasquier, N., Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: A new approach for association rule mining and bi-clustering using formal concept analysis. In: MLDM 2012, pp. 86–101 (2012)
Natural products information system (NAPIS), http://whitepointsystems.com (accessed February 2013)
Obrst, L.: Ontologies for semantically interoperable systems. In: CIKM 2003, pp. 366–369 (2003), http://doi.acm.org/10.1145/956863.956932
Peters, C., Peters, D., Cota-Sánchez, J.: Data mining and mapping of herbarium specimens using geographic information systems: A look at the biodiversity informatics project of the W. P. Fraser Herbarium, SASK (2009), http://www.herbarium.usask.ca/research/Data%20Mining,%20CBA%202009.pdf
Rahangdale, S.S., Rahangdale, S.R.: Plant species composition on two rock outcrops from the northern western ghats, maharashtra, india. Journal of Threatened Taxa 6(4), 5593–5612 (2014)
Shah, A.: Why Is Biodiversity Important? Who Cares? Global Issues (April 2011), http://www.globalissues.org/article/170/why-is-biodiversity-important-who-cares
So, N.V.: The potential of local tree species to accelerate natural forest succession on marginal grasslands in southern vietnam, http://www.forru.org/extra/forru/PDF_Files/frfwcpdf/part2/p28
Spehn, E.M., Korner, C. (eds.): Data Mining for Global Trends in Mountain Biodiversity. CRC Press (2009)
Talent, J.: Earth and Life: Global Biodiversity, Extinction Intervals and Biogeographic Perturbations Through Time. International Year of Planet Earth. Springer (2012)
The convention on biological diversity (CBD), http://www.cbd.int (accessed September 2013)
The IUCN Red List of Threatened Species, http://www.iucnredlist.org/ (accessed January 2014)
Whetzel, P., Noy, N., Shah, N., Alexander, P., Nyulas, C., Tudorache, T., Musen, M.: What are ontologies (accessed March 2013), http://www.bioontology.org/learning-about-ontologies
Wickneswari, R.: Hopea odorata roxb, http://www.apforgen.org/apfCD/Information
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Inthasone, S., Pasquier, N., Tettamanzi, A.G.B., da Costa Pereira, C. (2014). The BioKET Biodiversity Data Warehouse: Data and Knowledge Integration and Extraction. In: Blockeel, H., van Leeuwen, M., Vinciotti, V. (eds) Advances in Intelligent Data Analysis XIII. IDA 2014. Lecture Notes in Computer Science, vol 8819. Springer, Cham. https://doi.org/10.1007/978-3-319-12571-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-12571-8_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12570-1
Online ISBN: 978-3-319-12571-8
eBook Packages: Computer ScienceComputer Science (R0)