Event recognition in marine seismological data using Random Forest machine learning classifier

被引:2
|
作者
Domel, Przemyslaw [1 ]
Hibert, Clement [2 ]
Schlindwein, Vera [3 ,4 ]
Plaza-Faverola, Andreia [1 ]
机构
[1] UiT Arctic Univ Norway, Dept Geosci, Dramsvegen 201, N-9010 Tromso, Norway
[2] Univ Strasbourg, ITES Inst Terre & Environm Strasbourg, CNRS UMR7063, CNRS, 5 Rue Descartes, F-67084 Strasbourg, France
[3] Alfred Wegener Inst, Helmholtz Ctr Polar & Marine Res, Alten Hafen 26, D-27568 Bremerhaven, Germany
[4] Univ Bremen, Fac Geosci, Klagenfurter Str 2-4, D-28359 Bremen, Germany
关键词
Machine learning; Arctic region; Computational seismology; Seismic noise; Wave propagation; Time-series analysis; P-PHASE PICKING; POLARIZATION ANALYSIS; AUTOMATIC PICKING; SEISMIC SIGNALS; VESTNESA RIDGE; GAS MIGRATION; OCEAN; SEA; PICKER; NOISE;
D O I
10.1093/gji/ggad244
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Automatic detection of seismic events in ocean bottom seismometer (OBS) data is difficult due to elevated levels of noise compared to the recordings from land. Popular deep-learning approaches that work well with earthquakes recorded on land perform poorly in a marine setting. Their adaptation to OBS data requires catalogues containing hundreds of thousands of labelled event examples that currently do not exist, especially for signals different than earthquakes. Therefore, the usual routine involves standard amplitude-based detection methods and manual processing to obtain events of interest. We present here the first attempt to utilize a Random Forest supervised machine learning classifier on marine seismological data to automate catalogue screening and event recognition among different signals [i.e. earthquakes, short duration events (SDE) and marine noise sources]. The detection approach uses the short-term average/long-term average method, enhanced by a kurtosis-based picker for a more precise recognition of the onset of events. The subsequent machine learning method uses a previously published set of signal features (waveform-, frequency- and spectrum-based), applied successfully in recognition of different classes of events in land seismological data. Our workflow uses a small subset of manually selected signals for the initial training procedure and we then iteratively evaluate and refine the model using subsequent OBS stations within one single deployment in the eastern Fram Strait, between Greenland and Svalbard. We find that the used set of features is well suited for the discrimination of different classes of events during the training step. During the manual verification of the automatic detection results, we find that the produced catalogue of earthquakes contains a large number of noise examples, but almost all events of interest are properly captured. By providing increasingly larger sets of noise examples we see an improvement in the quality of the obtained catalogues. Our final model reaches an average accuracy of 87 per cent in recognition between the classes, comparable to classification results for data from land. We find that, from the used set of features, the most important in separating the different classes of events are related to the kurtosis of the envelope of the signal in different frequencies, the frequency with the highest energy and overall signal duration. We illustrate the implementation of the approach by using the temporal and spatial distribution of SDEs as a case study. We used recordings from six OBSs deployed between 2019 and 2020 off the west-Svalbard coast to investigate the potential link of SDEs to fluid dynamics and discuss the robustness of the approach by analysing SDE intensity, periodicity and distance to seepage sites in relation to other published studies on SDEs.
引用
收藏
页码:589 / 609
页数:21
相关论文
共 50 条
  • [11] Accurate Classification of Pediatric Colonic Inflammatory Bowel Disease Subtype Using a Random Forest Machine Learning Classifier
    Dhaliwal, Jasbir
    Erdman, Lauren
    Drysdale, Erik
    Rinawi, Firas
    Muir, Jennifer
    Walters, Thomas D.
    Siddiqui, Iram
    Griffiths, Anne M.
    Church, Peter C.
    [J]. JOURNAL OF PEDIATRIC GASTROENTEROLOGY AND NUTRITION, 2021, 72 (02): : 262 - 269
  • [12] EEG based automatic emotion recognition using EMD and Random forest classifier
    Veeramallu, Gnana Keerthi Priya
    Anupalli, Yamuna
    Jilumudi, Sravan Kumar
    Bhattacharyya, Abhijit
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [13] The Random Forest Classifier Applied In Droplet Fingerprint Recognition
    Song, Qing
    Liu, Xiaoou
    Yang, Lu
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 722 - 726
  • [14] Features Selection in Character Recognition with Random Forest Classifier
    Homenda, Wladyslaw
    Lesinski, Wojciech
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT I, 2011, 6922 : 93 - +
  • [15] Machine Learning Random Forest Cluster Analysis for Large Overfitting Data: using R Programming
    Rimal, Yagyanath
    [J]. PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 1265 - 1271
  • [16] Facial expression recognition from image sequences using twofold random forest classifier
    Pu, Xiaorong
    Fan, Ke
    Chen, Xiong
    Ji, Luping
    Zhou, Zhihu
    [J]. NEUROCOMPUTING, 2015, 168 : 1173 - 1180
  • [17] Probabilistic Random Forest: A Machine Learning Algorithm for Noisy Data Sets
    Reis, Itamar
    Baron, Dalya
    Shahaf, Sahar
    [J]. ASTRONOMICAL JOURNAL, 2019, 157 (01):
  • [18] Retrieval of Interactive Requirements of Data Intensive Applications using Random Forest Classifier
    Raymond, Renita
    Anouncia, S. Margret
    [J]. Informatica (Slovenia), 2023, 47 (09): : 35 - 50
  • [19] Outlier Prediction Using Random Forest Classifier
    Mohandoss, Divya Pramasani
    Shi, Yong
    Suo, Kun
    [J]. 2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 27 - 33
  • [20] Data Linearity using Kernel PCA with Performance Evaluation of Random Forest for Training Data: A Machine Learning approach
    Biju, Vinai George
    Prashant, C. M.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2016,