TY - CHAP
T1 - Geochemical databases
AU - Klöcking, Marthe
AU - Lehnert, Kerstin A.
AU - Wyborn, Lesley
N1 - Publisher Copyright:
© 2025 Elsevier Inc. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Geochemistry is a data-driven discipline. Modern laboratories produce highly diverse data, and the recent exponential increase in data volumes is challenging established practices and capabilities for organizing, analyzing, preserving, and accessing these data. At the same time, sophisticated computational techniques, including machine learning, are increasingly applied to geochemical research questions, which require easy access to large volumes of high-quality, well-organized, and standardized data. Data management has been important since the beginning of geochemistry but has recently become a necessity for the discipline to thrive in the age of digitalization and artificial intelligence. This paper summarizes the landscape of geochemical databases, distinguishing different types of data systems based on their purpose, and their evolution in a historic context. We apply the life cycle model of geochemical data; explain the relevance of current standards, practices, and policies that determine the design of modern geochemical databases and data management; the ethics of data reuse such as data ownership, data attribution, and data citation; and finally create a vision for the future of geochemical databases: data being born digital, connected to agreed community standards, and contributing to global democratization of geochemical data.
AB - Geochemistry is a data-driven discipline. Modern laboratories produce highly diverse data, and the recent exponential increase in data volumes is challenging established practices and capabilities for organizing, analyzing, preserving, and accessing these data. At the same time, sophisticated computational techniques, including machine learning, are increasingly applied to geochemical research questions, which require easy access to large volumes of high-quality, well-organized, and standardized data. Data management has been important since the beginning of geochemistry but has recently become a necessity for the discipline to thrive in the age of digitalization and artificial intelligence. This paper summarizes the landscape of geochemical databases, distinguishing different types of data systems based on their purpose, and their evolution in a historic context. We apply the life cycle model of geochemical data; explain the relevance of current standards, practices, and policies that determine the design of modern geochemical databases and data management; the ethics of data reuse such as data ownership, data attribution, and data citation; and finally create a vision for the future of geochemical databases: data being born digital, connected to agreed community standards, and contributing to global democratization of geochemical data.
KW - Artificial intelligence
KW - CARE
KW - Community standards
KW - Data ethics
KW - Data management
KW - Databases
KW - FAIR
KW - Geochemistry
KW - Machine learning
KW - Machine readable data
KW - Repository
KW - TRUST
UR - http://www.scopus.com/inward/record.url?scp=85218391719&partnerID=8YFLogxK
U2 - 10.1016/B978-0-323-99762-1.00123-6
DO - 10.1016/B978-0-323-99762-1.00123-6
M3 - Chapter
AN - SCOPUS:85218391719
SN - 9780323997621
VL - 8
SP - V8:97-V8:135
BT - Treatise on Geochemistry, Third Edition, 8 Volume Set
PB - Elsevier
ER -