[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

The future low-temperature geochemical data-scape as envisioned by the U.S. geochemical community

Published: 01 December 2021 Publication History

Abstract

Data sharing benefits the researcher, the scientific community, and the public by allowing the impact of data to be generalized beyond one project and by making science more transparent. However, many scientific communities have not developed protocols or standards for publishing, citing, and versioning datasets. One community that lags in data management is that of low-temperature geochemistry (LTG). This paper resulted from an initiative from 2018 through 2020 to convene LTG and data scientists in the U.S. to strategize future management of LTG data. Through webinars, a workshop, a preprint, a townhall, and a community survey, the group of U.S. scientists discussed the landscape of data management for LTG – the data-scape. Currently this data-scape includes a “street bazaar” of data repositories. This was deemed appropriate in the same way that LTG scientists publish articles in many journals. The variety of data repositories and journals reflect that LTG scientists target many different scientific questions, produce data with extremely different structures and volumes, and utilize copious and complex metadata. Nonetheless, the group agreed that publication of LTG science must be accompanied by sharing of data in publicly accessible repositories, and, for sample-based data, registration of samples with globally unique persistent identifiers. LTG scientists should use certified data repositories that are either highly structured databases designed for specialized types of data, or unstructured generalized data systems. Recognizing the need for tools to enable search and cross-referencing across the proliferating data repositories, the group proposed that the overall data informatics paradigm in LTG should shift from “build data repository, data will come” to “publish data online, cybertools will find”. Funding agencies could also provide portals for LTG scientists to register funded projects and datasets, and forge approaches that cross national boundaries. The needed transformation of the LTG data culture requires emphasis in student education on science and management of data.

Highlights

Scientists use a wide variety of data repositories for heterogeneous LTG datasets.
Both structured and unstructured databases are needed to store LTG data online.
Powerful search tools and data portals are needed to enable LTG data discovery.

References

[1]
F. Albarede, K. Lehnert, The Scientific Impact of Large Geochemical Data SetsAmerican Geophysical Union Fall Meeting 2019, San Francisco, CA, 2019.
[2]
H.M. Amos, C.F. Miniat, J. Lynch, J. Compton, P.H. Templer, L.A. Sprague, D. Shaw, D. Burns, A. Rea, D. Whitall, L. Myles, D. Gay, M. Nilles, J. Walker, A.K. Rose, J. Bales, J. Deacon, R. Pouyat, What goes up must come down: integrating air and water quality monitoring for nutrients, Environ. Sci. Technol. 52 (2018) 11441–11448.
[3]
APHA, Standard Methods for the Examination of Water and Wastewater, American Public Health Association, Washington D.C, 1998.
[4]
K. Asch, I. Jackson, Commission for the management and application of geoscience information (CGI), Episodes 29 (2006),.
[5]
Aspen Institute, Internet of Water: Sharing and Integrating Water Data for Sustainability, 2017.
[6]
C.A. Ball, G. Sherlock, A. Brazma, Funding high-throughput data sharing, Nat. Biotechnol. 22 (2004) 1179–1183.
[7]
B.J. Benson, B.J. Bond, M.P. Hamilton, R.K. Monson, R. Han, Perspectives on next-generation technology for environmental sensor networks, Front. Ecol. Environ. 8 (2010) 193–2010,.
[8]
K.K. Beratan, B. Peer, N.W. Dunbar, R. Blom, A remote sensing approach to alteration mapping: AVIRIS data and extension-related potassium metasomatism, Socorro, New Mexico, Int. J. Rem. Sens. 18 (1997) 3595–3609.
[9]
K.J. Bergen, P.A. Johnson, M.V. de Hoop, G.C. Beroza, Machine learning for data-driven discovery in solid Earth geoscience, Science 363 (2019) 1299,.
[10]
S.L. Brantley, R.D. Vidic, K. Brasier, D. Yoxtheimer, J. Pollak, C. Wilderman, T. Wen, Engaging over data on fracking and water quality, Science 359 (2018) 395–397.
[11]
K.J. Brasier, K. Jalbert, A.J. Kinchy, S.L. Brantley, C. Unroe, Barriers to sharing water quality data: experiences from the Shale Network, J. Environ. Plann. Manag. (2016) dx.doi.org/10.1080/09640568.2016.1276435.
[12]
R.P. Breckenridge, A.B. Crockett, Determination of background concentrations of inorganics in soils and sediments at hazardous waste sites, Environ. Monit. Assess. 51 (1998) 621–656.
[13]
G.H. Brimhall, W.E. Dietrich, Constitutive mass balance relations between chemical composition, volume, density, porosity, and strain in metasomatic hydrochemical systems: results on weathering and pedogenesis, Geochem. Cosmochim. Acta 51 (1987) 567–587.
[14]
S.W. Christensen, C.C. Brandt, M.K. McCracken, Importance of data management in a long-term biological monitoring program, Environ. Manag. 47 (2009) 1112–1124,.
[15]
COPDESS, Commitment Statement in the Earth, Space, and Environmental Sciences, Coalition for Publishing Data in the Earth and Space Sciences, 2020, https://copdess.org/enabling-fair-data-project/commitment-statement-in-the-earth-space-and-environmental-sciences/.
[16]
CoreTrustSeal.org, CoreTrustSeal Certified Data Repositories, World Data System of the International Science Council and the Data Seal of Approval, 2020, accessed 11/8/2020 https://www.coretrustseal.org/.
[17]
H. Cousijn, A.G.E. Kenall, e. al, A data citation roadmap for scientific publishers, Sci Data 5 (2018) 180259,.
[18]
S.J.D. Cox, ISO 19156:2011 Geographic Information – Observations and Measurements, International Organization for Standardization, 2011.
[19]
CUAHSI, Consortium of Universities for the Advancement of Hydrologic Science Inc. (CUAHSI), CUAHSI Strategic Plan, 2018, https://www.cuahsi.org/uploads/pages/img/StrategicPlan_SinglePages.pdf.
[20]
Data Citation Synthesis Group, Joint declaration of data citation principles, in: M. Martone (Ed.), FORCE11, 2014,. San Diego, CA.
[21]
ESIP Data Preservation; Stewardship Committee (2019): Data citation guidelines for earth science data, ver. 2. Earth science information partners web page. https://doi.org/10.6084/m9.figshare.8441816.
[22]
M. Fleischer, Glossary of Mineral Species, Mineralogical Record, 2018.
[23]
Y. Gil, S.A. Pierce, H. Babaie, A. Banerjee, K. Borne, G. Bust, M. Cheatham, I. Ebert-Uphoff, C. Gomes, M. Hill, J. Horel, L. Hsu, J. Kinter, C. Knoblock, D. Krum, V. Kumar, P. Lermusiaux, Y. Liu, C. North, V. Pankratius, S. Peters, B. Plale, A. Pope, S. Ravela, J. Restrepo, A. Ridley, H. Samet, S. Shekhar, K. Skinner, P. Smyth, B. Tikoff, L. Yarmey, J. Zhang, Intelligent systems for geosciences: an essential research agenda, Commun. ACM 62 (2019) 76–84.
[24]
S. Goldstein, A. Hofmann, K. Lehnert, Requirements for the Publication of Geochemical Data, Version 1.0. . Interdisciplinary Earth Data Alliance (IEDA), 2014,.
[25]
L.C. Gomes, R.M. Faria, E. de Souza, G.V. Veloso, C.E.G. Schaefer, E.I. Fernandes Filho, Modelling and mapping soil organic carbon stocks in Brazil, Geoderma 340 (2019) 337–350.
[26]
J.D. Hemingway, D.H. Rothman, K.E. Grant, S.Z. Rosengard, T.I. Eglinton, L.A. Derry, V.V. Galy, Mineral protection regulates long-term global preservation of natural organic carbon, Nature 570 (2019) 228–231.
[27]
J. Hochella, F. M, D. Mogk, J. Ranville, I. Allen, G. Luther, L. Marr, E.P. McGrail, M. Murayama, N. Qafoku, K. Rosso, N. Sahai, P.A. Schroeder, P. Vikesland, P. Westerhoff, Y. Yang, Natural, incidental, and engineered nanomaterials and their impacts on the Earth system, Science (2019),.
[28]
J.S. Horsburgh, D.G. Tarboton, D.R. Maidment, I. Zaslavsky, Components of an environmental observatory information system, Comput. Geosci. 37 (2011) 207–218,.
[29]
International Federation of Library Associations and Institutions (2020): Archival resource key (ARK). accessed 11/8/2020 https://www.ifla.org/best-practice-for-national-bibliographic-agencies-in-a-digital-age/node/8793.
[30]
E. Kalnay, M. Kanamitsu, R. Kistler, W. Collins, D. Deaven, L. Gandin, M. Iredell, S. Saha, G. White, J. Woollen, Y. Zhu, A. Leetmaa, R. Reynolds, M. Chelliah, W. Ebisuzaki, W. Higgins, J. Janowiak, K.C. Mo, C. Ropelewski, J. Wang, R. Jenne, D. Joseph, The NCEP/NCAR 40-year reanalysis project, Bull. Am. Meteorol. Soc. 77 (1996) 35.
[31]
H. Kim, W.E. Dietrich, B.M. Thurnhoffer, J.K.B. Bishop, I.Y. Fung, Controls on solute concentration-discharge relationships revealed by simultaneous hydrochemistry observations of hillslope runoff and stream flow: the importance of critical zone structure, Water Resour. Res. 53 (2017) 1424–1443.
[32]
K. Lehnert, F. Albarede, The Scientific Impact of Large Geochemical Data SetsAmerican Geophysical Union , 2019, https://agu.confex.com/agu/fm19/meetingapp.cgi/Paper/492556.
[33]
Z. Liu, V. Mantas, J. Wei, M. Jin, D. Meyer, Creating data tool kits that everyone can use, Eos 101 (2020) 25–27,.
[34]
W.K. Michener, Meta-information concepts for ecological information management, Ecol. Inf. 1 (2006) 3–7.
[35]
National Academy of Sciences, Engineering, and Medicine, Assuring Data Quality at U.S. Geological Survey Laboratories, The National Academies Press, Washington, D.C., doi.org/10.17226/25524, 2019.
[36]
(2020): National science foundation biological and chemical oceanography data management office. BCO-DMO https://www.bco-dmo.org/.
[37]
X. Niu, J.Z. Williams, D. Miller, K.A. Lehnert, B. Bills, S.L. Brantley, An ontology driven relational geochemical database for the earth's critical zone: CZchemDB, Journal of Environmental Informatics 23 (2014) 13.
[38]
N.E.T.L., Energy Data eXchange, National Energy Technology Laboratory, U.S. Department of Energy, 2020.
[39]
N.R.C.S., National Cooperative Soil Survey United States Department of Agriculture, Natural Resources Conservation Service, 2020, accessed aa/8/2020 https://websoilsurvey.sc.egov.usda.gov/App/WebSoilSurvey.aspx.
[40]
N. Orlowski, L. Breuer, J.J. McDonnell, Critical issues with cryogenic extraction of soil water for stable isotope analysis, Ecohydrology 9 (2016) 1–5,.
[41]
C.L. Palmer, A.K. Thomer, K.S. Baker, K.M. Wickett, C.L. Hendrix, A. Rodman, S. Sigler, B.W. Fouke, Site-based data curation based on hot spring geobiology, PloS One 12 (2017) 15.
[42]
W.F. Pickering, Selective Chemical Extraction of Soil Components and Bound Metal Species, CRC Critical Reviews in Analytical Chemistry, CRC Press, Boca Raton, FL, 1981.
[43]
J. Podgorski, M. Berg, Global threat of arsenic in groundwater, Science 368 (2020) 845–850.
[44]
re3dataorg (2020): Registry of research data repositories. https://doi.org/10.17616/R3D.
[45]
D.H. Riedl, M.K. Dunn, Quality assurance mechanisms for the unregulated research environment, Trends Biotechnol. 31 (2013) 552–554,.
[46]
J. Ruegg, C. Gries, B. Bond-Lamberty, G.J. Bowen, B.S. Felzer, N.E. McIntyre, P.A. Soranno, K.L. Vanderbilt, K.C. Weathers, Completing the data life cycle: using information management in macrosystems ecology research, Front. Ecol. Environ. 12 (2014) 24–30,.
[47]
Paul Schroeder A, Clays in the Critical Zone, Cambridge University Press, United Kingdom, 2018, p. 246. ISBN: 9781316480083.
[48]
A.R. Shaughnessy, T. Wen, X. Niu, S.L. Brantley, Three principles to use in streamlining water waulity research through data uniformity, Environ. Sci. Technol. (2019),.
[49]
L.A. Sprague, G.P. Oelsner, D.M. Argue, Challenges with secondary use of multi-source water-quality data in the United States, Water Research 110 (2017) 252–261,.
[50]
S. Stall, L. Yarmey, J. Cutcher-Gershenfeld, B. Hanson, K. Lehnert, B. Nosek, M. Parsons, E. Robinson, L. Wyborn, Make scientific data FAIR, Nature 570 (2019) 27–29,.
[51]
C. Tenopir, E.D. Dalton, S. Allard, M. Frame, I. Pjesivac, B. Birch, D. Pollock, K. Dorsett, Changes in data sharing and data reuse practices and perceptions among scientists worldwide, PloS One 10 (2015) 24.
[52]
The FAIRsharing team, FAIRsharing.org, University of Oxford, Oxford e-Research Centre, 2020.
[53]
A.K. Thomer, K.M. Wickett, K.S. Baker, B.W. Fouke, C.L. Palmer, Documenting provenance in noncomputational workflows: research process models based on geobiology fieldwork in Yellowstone National Park, Journal of the Association for Information Science and Technology 69 (2018) 1234–1245.
[54]
U.S. National Academy of Sciences Engineering and Medicine, Investigative Strategies for Lead-Source Attribution at Superfund Sites Associated with Mining Activities, The National Academies Press, Washington D.C, 2017.
[55]
U.S. National Academy of Sciences Engineering and Medicine, Assuring Data Quality at U.S. Geological Survey Laboratories, The National Academies Press, Washington, D.C, 2019,.
[56]
U.S.G.S., Data Citation, The United States Geological Survey (USGS), 2020.
[57]
U.S.G.S. (2020): Data management. United states geological survey. https://www.usgs.gov/products/data-and-tools/data-management/training.
[58]
U.S.G.S. (2020): ScienceBase A U.S. Geological survey trusted digital repository. United states geological survey. accessed 11/8/2020 https://www.sciencebase.gov/catalog/.
[59]
C. Varadharajan, S. Cholia, C. Snavely, V. Hendrix, C. Procopiou, D. Swantek, W.J. Riley, D.A. Agarwal, Launching an accessible archive of environmental data, EOS, Transactions of the American Geophysical Union 100 (2019),.
[60]
T. Wen, Data sharing, Encyclopedia of Big Data, Springer International Publishing, Cham, 2020, pp. 1–3,.
[61]
T. Wen, A. Agarwal, L. Xue, A. Chen, A. Herman, Z. Li, S.L. Brantley, Assessing changes in groundwater chemistry in landscapes with more than 100 years of oil and gas development, Environmental Science Processes and Impacts 21 (2019) 384–396,.
[62]
T. Wen, C. Bandaragoda, L. Harris, Data Science in Earth and Environmental Sciences. HydroLearn, 2020, https://edx.hydrolearn.org/courses/course-v1:SyracuseUniversity+EAR601+2020_Fall/about.
[63]
T. Wen, M. Liu, J. Woda, G. Zheng, S.L. Brantley, Detecting anomalous methane in groundwater within hydrocarbon production areas across the United States, Water Res. 200 (2021) 117236,.
[64]
Mark D. Wilkinson, et al., The FAIR Guiding Principles for scientiic data management and stewardship, Scientific Data 3 (2016) 160018,.
[65]
C.L.S. Wiseman, Analytical methods for assessing metal bioaccessibility in airborne particulate matter: a scoping review, Anal. Chim. Acta 877 (2015) 9–18.
[66]
xDD (2020): Geodeepdive, A digital library and cyberinfrastructure facilitating the discovery and utilization of data and knowledge in published documents. Geodeepdive.org. https://geodeepdive.org/about.html.

Index Terms

  1. The future low-temperature geochemical data-scape as envisioned by the U.S. geochemical community
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image Computers & Geosciences
          Computers & Geosciences  Volume 157, Issue C
          Dec 2021
          299 pages

          Publisher

          Pergamon Press, Inc.

          United States

          Publication History

          Published: 01 December 2021

          Author Tags

          1. Data management
          2. Data repositories
          3. Geochemistry
          4. Metadata
          5. Data sharing
          6. Open science

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 05 Jan 2025

          Other Metrics

          Citations

          View Options

          View options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media