Development and Application of the Branched and Isoprenoid GDGT Machine Learning Classification Algorithm (BIGMaC) for Paleoenvironmental Reconstruction
Martínez-Sosa, Pablo; Tierney, Jessica E.; Pérez-Angel, Lina C.; Stefanescu, Ioana C.; Guo, Jingjing; Kirkels, Frédérique; Sepúlveda, Julio; Peterse, Francien; Shuman, Bryan N.; Reyes, Alberto V.
(2023) Paleoceanography and Paleoclimatology, volume 38, issue 7, pp. 1 - 21
(Article)
Abstract
Glycerol dialkyl glycerol tetraethers (GDGTs), both archaeal isoprenoid GDGTs (isoGDGTs) and bacterial branched GDGTs (brGDGTs), have been used in paleoclimate studies to reconstruct environmental conditions. Since GDGTs are produced in many types of environments, their relative abundances also depend on the depositional setting. This suggests that the distribution of GDGTs
... read more
also preserves useful information that can be used more broadly to infer these depositional environments in the geological past. Here, we combined existing iso- and brGDGT relative abundance data with newly analyzed samples to generate a database of 1,153 samples from several modern sedimentary settings. We observed a robust relationship between the depositional environment and the relative abundances of GDGTs in our samples. This data set was used to train and test the Branched and isoGDGT Machine learning Classification (BIGMaC) algorithm, which identifies the environment a sample comes from based on the distribution of GDGTs with high precision and recall (F1 = 0.95). We tested the model on the sedimentary record from the Giraffe kimberlite pipe, an Eocene maar in subantarctic Canada, and found that the BIGMaC reconstruction agrees with independent stratigraphic and palynological information, provides new information about the paleoenvironment of this site, and helps improve its paleotemperature reconstruction. In contrast, we also include an example from the PETM-aged Cobham lignite as a cautionary example that illustrates the limitations of the algorithm. We propose that in cases where paleoenvironments are unknown or are changing, BIGMaC can be applied in concert with other proxies to generate more refined paleoclimate records.
show less
Download/Full Text
Embargo until January 05 2024
Keywords: GDGTs, machine learning, paleoenvironment, Taverne, Oceanography, Atmospheric Science, Palaeontology
ISSN: 2572-4517
Publisher: Wiley Online Library
Note: Funding Information: We would like to thank Patrick Murphy for his assistance with the lipid analysis, Dr. Jeffrey Donnelly and the Woods Hole Oceanographic Institution Seafloor Samples Laboratory for access to marine sediment samples, and Dr. Cody Routson for contributing Alaskan lake samples. The hyperparameter tuning of the models was performed using the Ocelote cluster from the University of Arizona. This research was funded by the American Chemical Society Petroleum Research Fund, Grant 60772‐ND2, and by CONACYT through the student scholarship 440897. Ioana Stefanescu and Bryan Shuman acknowledge support from the Microbial Ecology Collaborative Project through the National Science Foundation grant EPS‐1655726. Francien Peterse acknowledges funding from the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO) through Veni Grant 863.13.016 and Vidi Grant 192.074. Lina Pérez‐Ángel and Julio Sepúlveda acknowledge support from NSF Sedimentary Geology and Paleobiology Grant 1929199. We also thank Serhiy Buryak for assisting with the sampling of the Giraffe pipe sediments. Funding Information: We would like to thank Patrick Murphy for his assistance with the lipid analysis, Dr. Jeffrey Donnelly and the Woods Hole Oceanographic Institution Seafloor Samples Laboratory for access to marine sediment samples, and Dr. Cody Routson for contributing Alaskan lake samples. The hyperparameter tuning of the models was performed using the Ocelote cluster from the University of Arizona. This research was funded by the American Chemical Society Petroleum Research Fund, Grant 60772-ND2, and by CONACYT through the student scholarship 440897. Ioana Stefanescu and Bryan Shuman acknowledge support from the Microbial Ecology Collaborative Project through the National Science Foundation grant EPS-1655726. Francien Peterse acknowledges funding from the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO) through Veni Grant 863.13.016 and Vidi Grant 192.074. Lina Pérez-Ángel and Julio Sepúlveda acknowledge support from NSF Sedimentary Geology and Paleobiology Grant 1929199. We also thank Serhiy Buryak for assisting with the sampling of the Giraffe pipe sediments. Publisher Copyright: © 2023. American Geophysical Union. All Rights Reserved.
(Peer reviewed)
See more statistics about this item