Unsupervised Machine Learning Clustering of Seismic and Infrasound Data Quality Metrics

被引:0
|
作者
Coffey, Juliann R. [1 ]
Witsil, Alex J. C. [1 ,2 ]
Macpherson, Kenneth A. [1 ]
Fee, David [1 ,3 ]
机构
[1] Univ Alaska Fairbanks, Geophys Inst, Wilson Alaska Tech Ctr, Fairbanks, AK 99775 USA
[2] Appl Res Associates, Raleigh, NC USA
[3] Univ Alaska Fairbanks, Geophys Inst, Alaska Volcano Observ, Fairbanks, AK USA
关键词
SELECTION;
D O I
10.1785/0220230177
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Developing techniques for improving quality control (QC) schemes to catch seismic and infrasound data defects continues to be an area of active research. Selecting universal thresholds for the automation of data quality (DQ) checks is an efficient way to find QC issues, but these thresholds may not apply well to multiple stations with varying DQ characteristics. In addition, these thresholds may not catch subtle changes in DQ parameters that still indicate problems. Machine learning can be an alternative way of diagnosing QC issues. K-means clustering, an unsupervised machine learning clustering algorithm, has been effectively used in the past for geophysical pattern exploration. This study furthers k-means applications to DQ analysis through clustering on DQ metrics derived from day-long segments of nuclear explosion monitoring data. Our k-means implementation on broadband seismometer DQ metrics separately clustered mass recenters, calibrations lasting at least one hour, and days without either. Applying this technique to infrasound DQ metrics revealed clusters related to physical issues at the stations, such as missing back volume screws and the flooding of ported pipe inlets. These are both examples of QC issues that are difficult to diagnose or detect through the thresholding of metrics or by inspecting waveforms and spectra. Our results show that k-means clustering can be a useful QC tool in exploring DQ patterns to assist analyst review of station operation and maintenance. The learned knowledge from this exploration can then inform a thresholding workflow on how to tailor to individual stations, or the k-means model could classify data directly.
引用
收藏
页码:1812 / 1833
页数:22
相关论文
共 50 条
  • [21] Analyzing continuous infrasound from Stromboli volcano, Italy using unsupervised machine learning
    Witsil, Alex J. C.
    Johnson, Jeffrey B.
    COMPUTERS & GEOSCIENCES, 2020, 140
  • [22] Exploration of Feature Engineering Techniques and Unsupervised Machine Learning Clustering Algorithms for Geophysical Data on Levees
    Russo, Brittany M.
    Athanasopoulos-Zekkos, Adda
    GEO-CONGRESS 2024: GEOTECHNICAL DATA ANALYSIS AND COMPUTATION, 2024, 352 : 454 - 463
  • [23] Using Unsupervised Machine Learning for Data Quality. Application to Financial Governmental Data Integration
    Necba, Hanae
    Rhanoui, Maryem
    El Asri, Bouchra
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 197 - 209
  • [24] Unsupervised Machine Learning Via Transfer Learning and k-Means Clustering to Classify Materials Image Data
    Cohn, Ryan
    Holm, Elizabeth
    INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2021, 10 (02) : 231 - 244
  • [25] Unsupervised Machine Learning Via Transfer Learning and k-Means Clustering to Classify Materials Image Data
    Ryan Cohn
    Elizabeth Holm
    Integrating Materials and Manufacturing Innovation, 2021, 10 : 231 - 244
  • [26] Unsupervised feature selection based extreme learning machine for clustering
    Jichao Chen
    Yijie Zeng
    Yue Li
    Guang-Bin Huang
    NEUROCOMPUTING, 2020, 386 : 198 - 207
  • [27] Highway Project Clustering Using Unsupervised Machine Learning Approach
    Alikhani, Hamed
    Jeong, H. David
    COMPUTING IN CIVIL ENGINEERING 2021, 2022, : 172 - 179
  • [28] Unsupervised machine learning approach for building composite indicators with fuzzy metrics
    Jimenez-Fernandez, E.
    Sanchez, A.
    Sanchez Perez, E. A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [29] A Deep Unsupervised Learning Algorithm for Dynamic Data Clustering
    Pantula, Priyanka D.
    Miriyala, Srinivas S.
    Mitra, Kishalay
    2021 SEVENTH INDIAN CONTROL CONFERENCE (ICC), 2021, : 147 - 152
  • [30] TRUNC: A Transfer Learning Unsupervised Network for Data Clustering
    Xavier, Rita
    Peller, John
    de Castro, Leandro Nunes
    IEEE ACCESS, 2025, 13 : 46282 - 46298