Coupling data-driven geochemical analysis and ensemble machine learning for automatic identification of oceanic anoxic events

被引:1
|
作者
Allam, Sherif [1 ]
Al-Ramadan, Khalid [1 ,2 ]
Koeshidayatullah, Ardiansyah [1 ,2 ]
机构
[1] King Fahd Univ Petr & Minerals, Coll Petr Engn & Geosci, Dept Geosci, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Coll Petr Engn & Geosci, Ctr Integrat Petr Res, Dhahran, Saudi Arabia
关键词
Anoxic; OAE; machine learning; AI; Geochemistry; REDOX CONDITIONS; ORGANIC-CARBON; TETHYS; BURIAL; SEDIMENTARY; RECORD; LEVEL;
D O I
10.1016/j.jseaes.2024.106027
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Oceanic Anoxic Events (OAEs) have been recorded across the Phanerozoic and linked with catastrophic events in geological records, including the massive release of CO2 into the atmosphere and mass extinctions of marine animals, particularly during the Cretaceous period (e.g., OAE-2). Overall, the occurrence of OAEs were typically identified based on the deposition of organic -rich black shales, carbon isotopic excursions, and enrichment of redox-sensitive elements. While various OAE intervals have been extensively studied across the Tethys using multiproxy geochemical records, recognizing the expression, and understanding the duration of these events are rather challenging and a subject of active debate. This is further compounded by the time-consuming and expertdemanding analysis to interpret complex geochemical records associated with OAEs. To address these issues, we propose a novel approach by coupling data -driven geochemical analysis and ensemble machine learning to recognize and predict the occurrence of OAE-2 in the Upper Cretaceous based on key geochemical records (813Corg, TOC, Mo, V, U) collected from different areas geographically. Considering variation in data availability and completeness, we performed machine learning -based data imputation to fill the gaps in geochemical records without perturbing the overall trends and patterns. With this, our prediction of OAE in various locations using ensemble machine learning, achieving an accuracy of up to 90% in the validation and 78% in the blind test predictions. The model could also match the interpreted OAE-2 intervals from different locations with higher resolution prediction based on the 813Corg and the TOC as the most important parameters followed by the sensitive redox elements. This suggests that the model utilized similar parameters used by geologists in identifying OAEs, increasing the model interpretability. Application of machine learning and data -driven geochemical analysis could help in providing a robust and time -efficient identification of OAE and find new unexplored OAEs along the stratigraphic records.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Data-driven identification of predictive risk biomarkers for subgroups of osteoarthritis using interpretable machine learning
    Nielsen, Rikke Linnemann
    Monfeuga, Thomas
    Kitchen, Robert R.
    Egerod, Line
    Leal, Luis G.
    Schreyer, August Thomas Hjortshoj
    Gade, Frederik Steensgaard
    Sun, Carol
    Helenius, Marianne
    Simonsen, Lotte
    Willert, Marianne
    Tahrani, Abd A.
    McVey, Zahra
    Gupta, Ramneek
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [42] Toward data-driven production simulation modeling: dispatching rule identification by machine learning techniques
    Nagahara, Satoshi
    Sprock, Timothy A.
    Helu, Moneer M.
    52ND CIRP CONFERENCE ON MANUFACTURING SYSTEMS (CMS), 2019, 81 : 222 - 227
  • [43] Data-driven identification of predictive risk biomarkers for subgroups of osteoarthritis using interpretable machine learning
    Rikke Linnemann Nielsen
    Thomas Monfeuga
    Robert R. Kitchen
    Line Egerod
    Luis G. Leal
    August Thomas Hjortshøj Schreyer
    Frederik Steensgaard Gade
    Carol Sun
    Marianne Helenius
    Lotte Simonsen
    Marianne Willert
    Abd A. Tahrani
    Zahra McVey
    Ramneek Gupta
    Nature Communications, 15
  • [44] Data-Driven Load Forecasting Using Machine Learning and Meteorological Data
    Alrashidi A.
    Qamar A.M.
    Computer Systems Science and Engineering, 2023, 44 (03): : 1973 - 1988
  • [45] Design of a Data-Driven Multi Controllers Using VRFT and Ensemble Learning
    Kinoshita, Takuya
    Morota, Yuma
    Yamamoto, Toru
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 787 - 790
  • [46] Data-Driven Soil Analysis and Evaluation for Smart Farming Using Machine Learning Approaches
    Huang, Yixin
    Srivastava, Rishi
    Ngo, Chloe
    Gao, Jerry
    Wu, Jane
    Chiao, Sen
    AGRICULTURE-BASEL, 2023, 13 (09):
  • [47] A Data-Driven Comparative Analysis of Machine-Learning Models for Familial Hypercholesterolemia Detection
    Kocejko, Tomasz
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [48] Data-driven optimization and machine learning analysis of compatible molecules for halide perovskite material
    Wang, Shaojun
    Huang, Yiru
    Hu, Wenguang
    Zhang, Lei
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [49] Data-driven drug discovery and medical treatment by machine learning
    Yamanishi, Yoshihiro
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [50] Data-Driven Machine Learning Informed Maneuvering and Control Simulation
    Shan, Hua
    Jiang, Li
    Faller, Will
    Hess, David
    Atsavapranee, Paisan
    Drazen, David
    AIAA AVIATION FORUM AND ASCEND 2024, 2024,