Analysis and Modeling of Geodetic Data Based on Machine Learning

被引:0
|
作者
Wu, Tong [1 ]
机构
[1] School of Electrical and Information Engineering, Tianjin University, Tianjin,300072, China
关键词
Data mining - Decision trees - Discrete Fourier transforms - Frequency domain analysis - Gaussian noise (electronic) - Geodesy - Interpolation - Learning systems - Probability distributions - Reliability analysis - Signal processing - Support vector machines - Time domain analysis - Wavelet transforms;
D O I
暂无
中图分类号
学科分类号
摘要
This paper underscores the significance of earth deformation observation in analyzing earth tide curves and predicting earthquakes, positioning it as a cornerstone of Earth observation technology. We delve into the critical task of detecting and diagnosing anomalies in geodetic data. Utilizing Python for data preprocessing, our approach identifies missing values, categorizes them by their spatial occurrence, and employs spline interpolation and autoregressive prediction methods for data imputation. This process ensures the integrity of the dataset for subsequent analysis and modeling, reinforcing the precision and reliability of geodetic data analysis in Earth science research. For problem I To expand the data set, we propose three models. Model I: Adding gaussian noise to the data. Model II: Resample the data. Model III: Using machine learning methods to learn the internal laws of the data and predict itself to generate new data. For each model, we discuss its advantages and disadvantages. Finally, we structurally fuse the three models to complete data enhancement. For problem II To extract the noise, we use DB4 wavelet transform to denoise the data set and extract the noise. Then we make descriptive statistics on the noise distribution, and use Laplace distribution to fit the probability distribution of noise, and finally get the accurate noise distribution. For problem III We start from the time domain and frequency domain to extract the features of the data. First, 17 features are extracted in the time domain, then the discrete fourier transform algorithm is used to transform the data into frequency domain data, and 13 are extracted. Therefore, we encode each data as a feature vector with a length of 30. We first use the decision tree as the baseline model to establish the recognition model to select the features. Logistic Regression, KNN, Naive Bayes and SVM are used to establish the recognition model. Finally, we use the Voting ensemble learning method to fuse the model, achieving an accuracy of 86% on the test set. © 2023 Tong Wu, published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [41] Evaluation and classification of otoneurological data with new data analysis methods based on machine learning
    Siermala, Markku
    Juhola, Martti
    Laurikkala, Jorma
    Iltanen, Kati
    Kentala, Erna
    Pyykkoe, Mari
    INFORMATION SCIENCES, 2007, 177 (09) : 1963 - 1976
  • [42] Modeling urban scale human mobility through big data analysis and machine learning
    Yapan Liu
    Bing Dong
    Building Simulation, 2024, 17 : 3 - 21
  • [43] Machine Learning-Based Sensor Data Modeling Methods for Power Transformer PHM
    Li, Anyi
    Yang, Xiaohui
    Dong, Huanyu
    Xie, Zihao
    Yang, Chunsheng
    SENSORS, 2018, 18 (12)
  • [44] Data Modeling and Architecture for a Machine Learning-Based Vehicle Counting and Classification System
    Pe, Adrian Jenssen L.
    Coching, Jerahmeel K.
    Yeung, Seth Gabriel D.
    Akeboshi, Wynnezel
    Billones, Robert Kerwin C.
    Roxas, Nicanor
    Fillone, Alexis M.
    Dadios, Elmer P.
    2023 IEEE 15th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management, HNICEM 2023, 2023,
  • [45] Application of the machine learning methods for GRACE data based groundwater modeling, a systematic review
    Nourani, Vahid
    Paknezhad, Nardin Jabbarian
    Ng, Anne
    Wen, Zhang
    Dabrowska, Dominika
    Uzelaltinbulat, Selin
    GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2024, 25
  • [46] Dynamics Modeling of Industrial Robotic Manipulators: A Machine Learning Approach Based on Synthetic Data
    Segota, Sandi Baressi
    Andelic, Nikola
    Sercer, Mario
    Mestric, Hrvoje
    MATHEMATICS, 2022, 10 (07)
  • [47] Machine learning approach for GNSS geodetic velocity estimation
    Seda Özarpacı
    Batuhan Kılıç
    Onur Can Bayrak
    Murat Taşkıran
    Uğur Doğan
    Michael Floyd
    GPS Solutions, 2024, 28
  • [48] Analysis of Data Sets With Learning Conflicts for Machine Learning
    Ledesma, Sergio
    Ibarra-Manzano, Mario-Alberto
    Cabal-Yepez, Eduardo
    Almanza-Ojeda, Dora-Luz
    Avina-Cervantes, Juan-Gabriel
    IEEE ACCESS, 2018, 6 : 45062 - 45070
  • [49] Machine learning approach for GNSS geodetic velocity estimation
    Ozarpaci, Seda
    Kilic, Batuhan
    Bayrak, Onur Can
    Taskiran, Murat
    Dogan, Ugur
    Floyd, Michael
    GPS SOLUTIONS, 2024, 28 (02)
  • [50] MIMO Modeling Based on Extreme Learning Machine
    Liu, Junbiao
    Dong, Fang
    Cao, Jiuwen
    Jin, Xinyu
    PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 169 - 178