[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Component-based end-user database design for ecologists

Published: 01 August 2007 Publication History

Abstract

To solve today's ecological problems, scientists need well documented, validated, and coherent data archives. Historically, however, ecologists have collected and stored data idiosyncratically, making data integration even among close collaborators difficult. Further, effective ecology data warehouses and subsequent data mining require that individual databases be accurately described with metadata against which the data themselves have been validated. Using database technology would make documenting data sets for archiving, integration, and data mining easier, but few ecologists have expertise to use database technology and they cannot afford to hire programmers. In this paper, we identify the benefits that would accrue from ecologists' use of modern information technology and the obstacles that prevent that use. We describe our prototype, the Canopy DataBank , through which we aim to enable individual ecologists in the forest canopy research community to be their own database programmers. The key feature that makes this possible is domain-specific database components, which we call templates . We also show how additional tools that reuse these components, such as for visualization, could provide gains in productivity and motivate the use of new technology. Finally, we suggest ways in which communities might share database components and how components might be used to foster easier data integration to solve new ecological problems.

References

[1]
Beard-Tisdale, K., Kahl, J. S., Pettigrew, N., Hunter, M., & Lutz, M. (2003). BDEI: Event and process tagging for information integration for the international gulf of maine watershed. In NSF Workshop on Biodiversity & Ecosystem Informatics . Olympia, WA.
[2]
Beck, K. (2000). Extreme programming explained . Boston, MA: Addison Wesley.
[3]
Bernstein, P. A., & Rahm, E. (2000). Data warehouse scenarios for model management. In ER2000 conference proceedings (pp. 1-15). Salt Lake City, UT: Springer.
[4]
Brooks, F. P. J. (1995). No silver bullet--essence and accident in software engineering. In F. P. Jr. Brooks (Ed.), The mythical man-month anniversary edition . Reading, MA: Addison Wesley.
[5]
Burnett, M., Atwood, J., Djang, R. W., Gottfried, H., Reichwein, J., & Yang, S. (2001). Forms/3: A first-order visual language to explore the boundaries of the spreadsheet paradigm. Journal of Functional Programming, 11 , 155-206.
[6]
Cushing, J. B., Nadkarni, N. M., Delcambre, L., Healy, K., Maier, D., & Ordway, E. (2002a). The development of databases and database tools for forest canopy researchers: a model for database enhancement in the ecological sciences. In SSGRR2002W , L'Aquila, Italy.
[7]
Cushing, J. B., Nadkarni, N. M., Delcambre, L., Healy, K., Maier, D., & Ordway, E. (2002b). Template-driven end-user ecological database design. In SCI2002 . Orlando, FL.
[8]
Cushing, J. B., Nadkarni, N. M., Finch, M., & Kim, Y. (2003). The canopy database project: Component-driven database design and visualization for ecologists. In Poster. VIS 2003 . Seattle, WA.
[9]
Cushing, J. B., & Wilson, T. (July 2005). Eco-Informatics for Decision Makers--Advancing a Research Agenda. Invited paper, 2nd international workshop on data integration in the life sciences . In L. Raschid, & B. Ludaescher (Eds.). San Diego, CA.
[10]
Delcambre, L., Maier, D., Weaver, M., Shapiro, L., & Cushing, J. B. (2003). Superimposing spatial enrichments in traditional information. In International workshop on next generation geospatial information . Cambridge (Boston), MA.
[11]
Dunne, J. (2005). Emerging ecoinformatic tools and accomplishments for synthetic ecological research across scales. Ecological Society of America Annual Meeting , August 7-12. Session presenters: J. Cushing, M. Weiser, J. Alroy, M. Jones, J. Quinn, N. Martinez, J. Dunne, and U. Brose.
[12]
Dunne, J., Martinez, N., & Williams, R. (2005). Webs on the web: Ecoinformatic approaches to synthetic food-web research from cambrian to contemporary ecosystems. In emerging ecoinformatic tools and accomplishments for synthetic ecological research across scales. Ecological Society of America Annual Meeting , August 7-12.
[13]
Finch, M. The canopy database project: Component-driven database design and visualization for ecologists . In Demonstration. VIS 2003 . Seattle, WA.
[14]
Fowler, M., & Scott, K. (1997). UML distilled . Reading, MA: Addison-Wesley.
[15]
Gamma, E., Helm, R., Johnson, R., & Vlissides, J. (1995). Design patterns . Boston, MA: Addison Wesley.
[16]
Gause, D. C., & Weinberg, G. M. (1989). Exploring requirements . New York: Dorset House.
[17]
Gruber, T. R. (1993). A translation approach to portable ontologies. Knowledge Acquisition, 5 , 199-220.
[18]
Henebry, G. M., & Merchant, J. W. (2001). Geospatial data in time: limits and prospects for predicting species occurrences. In J. M. Scott, P. J. Heglund, & M. Morrison (Eds.), Predicting species occurrences: issues of scale and accuracy . Covello, CA: Island.
[19]
Hook, J., & Widen, T. (1998). Software design automation: Language design in the context of domain engineering. In Proceedings of SEE '98 . San Francisco, CA.
[20]
Jagadish, H. V., Olken, F., et al. (2003). NSF/NLM workshop on data management for molecular and cell biology, report data management for the biosciences. OMICS: A Journal of Integrative Biology 7 , 1.
[21]
Kieburtz, R. (2000). Defining and implementing closed domain-specific languages. OGI Technical Report http://www-internal.cse.ogi.edu/PacSoft/publications/phaseiiiq13papers/design_and_impl.pdf.
[22]
Lowman, M. D., & Nadkarni, N. M. (1995). Forest canopies . San Diego, CA: Academic.
[23]
Maier, D., Cushing, J. B., Hansen, D. M., Purvis III, G. D., Bair, R. A., DeVaney, D. M., et al. (1993). Object data models for shared molecular structures. In R. Lysakowski (Ed.), First international symposium on computerized chemical data standards: databases, data interchange, and information systems . Atlanta, GA: ASTM.
[24]
Maier, D., Landis, E., Frondorf, A., Silverschatz, A., Schnase, J., & Cushing, J. B. (2001). Report of an NSF, USGS, NASA workshop on biodiversity and ecosystem informatics. http://www.evergreen.edu/bdei/2001/
[25]
Metacat, & Morpho (2003). http://knb.ecoinformatics.org/software/.
[26]
Michener, W., & Brunt, J. (Eds.) (2001). Ecological data-design, management and processing . Blackwell Science Methods in Ecology Series.
[27]
Michener, W., Brunt, J., Helly, J., Kirchner, T., & Stafford, S. (1997). Non-spatial metadata for the ecological sciences. Ecological Applications, 7 , 330-342.
[28]
Michener, W., Porter, J. H., & Stafford, S. (Eds.) (1998). Data and information management in the ecological sciences: a resource guide . Albuquerque, NM: LTER Network Office, University of New Mexico.
[29]
Miller, R. J., Haas, L. M., & Hernandez, M. (2000). Schema mapping as query discovery. In Proceedings of the international conference on very large Data bases (VLDB) (pp. 77-88). Cairo, Egypt.
[30]
Miller, R. J., Hernandez, M. A., Haas, L. M., Yan, L., Ho, C. T. H., Fagin, R., et al. (2001). The clio project: Managing heterogeneity. SIGMOD Record, 30 , 78-83.
[31]
Musen, M. A., Fergerson, R. W., Grosso, W. E., Noy, N. F., Crubezy, M., & Gennari, J. H. (2000). Component-based support for building knowledge-acquisition systems. In Conference on intelligent information processing (IIP 2000) of the international federation for information processing world computer congress (WCC 2000) . Beijing, China.
[32]
Nadkarni, N. M., & Cushing, J. B. (1995). Final report: Designing the forest canopy researcher's workbench: computer tools for the 21st century . Olympia, WA: International Canopy Network.
[33]
Nadkarni, N. M., & Cushing, J. B. (2001). Lasers in the jungle: The forest canopy database project. Bulletin of the Ecological Society of America, 82 , 200-201.
[34]
Nadkarni, N. M., & Parker, G. G. (1994). A profile of forest canopy science and scientists--who we are, what we want to know, and obstacles we face: Results of an international survey. Selbyana, 15 , 38-50.
[35]
Nottrott, R., Jones, M. B., & Schildhauer, M. (1999). Using Xml-structured metadata to automate quality assurance processing for ecological data. In Third IEEE computer society metadata conference, Bethesda, MD: IEEE Computer Society.
[36]
NRC. National Research Council. (1995). Finding the forest for the trees: The challenge of combining diverse environmental data-selected case studies . Washington, DC: National Academy.
[37]
NRC. National Research Council. (1997). Bits of power: issues in global access to scientific data . Washington, DC: National Academy.
[38]
Peyton-Jones, S. (2003). Spreadsheets--functional programming for the masses. Invited talk. Technical symposium on software, science & society . Oregon Graduate Institute of the Oregon Health and Science University, Friday, December 5, 2003. http://web.cecs.pdx.edu/~black/S3S/speakers.html and http://web. cecs.pdx.edu/~black/S3S/PJ.html.
[39]
Raguenaud, C., & Kennedy, J. (2002). Multiple overlapping classifications: issues and solutions. In 14th international conference on scientific and statistical database management--SSDBM 2002 (pp. 77-86). Edinburgh, Scotland: IEEE Computer Society.
[40]
Romanello, S., Beach, J., Bowers, S., Jones, M., Ludäscher, B., Michener, W., et al. (2005). Creating and providing data management services for the biological and ecological sciences: science environment for ecological knowledge. In 17th International Conference on Scientific and Statistical Database Management-SSDBM 2005 .
[41]
Schnase, J. L., Cushing, J., Frame, M., Frondorf, A., Landis, E., Maier, D., et al. (2003). Information technology challenges of biodiversity and ecosystems informatics, special issue on data management in bioinformatics, Information Systems . In: M. J. Zaki, & J. T. L. Wang (Eds.) Volume 28, 4., June 2003. (pp 241-367). Elsevier Science.
[42]
Schroeder, W., Martin, K., & Lorensen, B. (1998). The visualization toolkit . Upper Saddle River, NJ: Prentice Hall.
[43]
Sheard, T. (2001). Accomplishments and research challenges in meta-programming. Invited talk. In Semantics, applications, and implementation of program generation 2001. LNCS , Volume 2196. (pp. 2- 44). Florence, Italy: Springer.
[44]
Sheard, T., & Jones, S. P. (2002). Templatemeta-programmingforhaskell. Haskell worshop . Pittsburg, PA: ACM.
[45]
Sowa, J. F. (1984). Conceptual structures: information processing in mind and machine . Reading, MA: Addison Wesley.
[46]
Spycher, G., Cushing, J. B., Henshaw, D. L., Stafford, S. G., & Nadkarni, N. M. (1996). Solving problems for validation, federation, and migration of ecological databases. Global networks for environmental information. In Proceedings of Eco-Informa '96 (pp. 695-700). Lake Buena Vista, FL.: Ann Arbor, MI: Environmental Research Institute of Michigan (ERIM).
[47]
Stemple, D., & Sheard, T. (1991). A recursive base for database programming primitives. In Proceedings of next generation information system technology, LNCS , (pp. 311-332). Springer.
[48]
Szyperski, C. A. (1997). Component software . Addison-Wesley.
[49]
Van Pelt, R., & Nadkarni, N. M. (2004). Horizontal and vertical distribution of canopy structural elements of pseudotsuga menziesii forests in the pacific northwest, Forest Science, 50 : 326-341.
[50]
Villa, F. (2001). Integrating modelling architecture: A declarative framework for multi-paradigm, multi-scale ecological modeling. Ecological Modelling, 137 , 23-42.
[51]
Wang, B., Liu, X., & Kerridge, J. (2003). Agenerative and component based approach to reuse in database applications. In 5th generative programming and component engineering young researcher workshop . (September).
[52]
Weaver, M., Delcambre, L., & Maier, D. (2001). A superimposed architecture for enhanced metadata. In DELOS workshop on interoperability in digital libraries, held in conjunction with European Conference on Digital Libraries (ECDL 2001) . Darmstadt, Germany.
[53]
Wood, W. A., & Kleb, W. L. (2003). Exploring XP for scientific research. IEEE Software, 20 , 30-36.

Cited By

View all
  • (2018)Research data management in the field of EcologyProceedings of the 2018 International Conference on Dublin Core and Metadata Applications10.5555/3308533.3308548(87-94)Online publication date: 10-Sep-2018
  • (2011)Building eCommerce systems from shared micro-schemasProceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part I10.5555/2074356.2074382(284-301)Online publication date: 17-Oct-2011
  • (2008)Visualizing tree crowns for forest managersProceedings of the 2008 international conference on Digital government research10.5555/1367832.1367918(411-412)Online publication date: 18-May-2008
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Intelligent Information Systems
Journal of Intelligent Information Systems  Volume 29, Issue 1
August 2007
142 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 August 2007

Author Tags

  1. Domain-specific data structures
  2. Ecosystem informatics
  3. End-user programming
  4. Scientific visualization
  5. Spatial databases

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Research data management in the field of EcologyProceedings of the 2018 International Conference on Dublin Core and Metadata Applications10.5555/3308533.3308548(87-94)Online publication date: 10-Sep-2018
  • (2011)Building eCommerce systems from shared micro-schemasProceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part I10.5555/2074356.2074382(284-301)Online publication date: 17-Oct-2011
  • (2008)Visualizing tree crowns for forest managersProceedings of the 2008 international conference on Digital government research10.5555/1367832.1367918(411-412)Online publication date: 18-May-2008
  • (2008)Indirectly driven knowledge modelling in ecologyInternational Journal of Metadata, Semantics and Ontologies10.1504/IJMSO.2008.0235693:3(210-225)Online publication date: 1-Feb-2008
  • (2007)From measurement to managementProceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains10.5555/1248460.1248515(272-273)Online publication date: 20-May-2007

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media