Analysis and Modeling of Geodetic Data Based on Machine Learning

被引:0
|
作者
Wu, Tong [1 ]
机构
[1] School of Electrical and Information Engineering, Tianjin University, Tianjin,300072, China
关键词
Data mining - Decision trees - Discrete Fourier transforms - Frequency domain analysis - Gaussian noise (electronic) - Geodesy - Interpolation - Learning systems - Probability distributions - Reliability analysis - Signal processing - Support vector machines - Time domain analysis - Wavelet transforms;
D O I
暂无
中图分类号
学科分类号
摘要
This paper underscores the significance of earth deformation observation in analyzing earth tide curves and predicting earthquakes, positioning it as a cornerstone of Earth observation technology. We delve into the critical task of detecting and diagnosing anomalies in geodetic data. Utilizing Python for data preprocessing, our approach identifies missing values, categorizes them by their spatial occurrence, and employs spline interpolation and autoregressive prediction methods for data imputation. This process ensures the integrity of the dataset for subsequent analysis and modeling, reinforcing the precision and reliability of geodetic data analysis in Earth science research. For problem I To expand the data set, we propose three models. Model I: Adding gaussian noise to the data. Model II: Resample the data. Model III: Using machine learning methods to learn the internal laws of the data and predict itself to generate new data. For each model, we discuss its advantages and disadvantages. Finally, we structurally fuse the three models to complete data enhancement. For problem II To extract the noise, we use DB4 wavelet transform to denoise the data set and extract the noise. Then we make descriptive statistics on the noise distribution, and use Laplace distribution to fit the probability distribution of noise, and finally get the accurate noise distribution. For problem III We start from the time domain and frequency domain to extract the features of the data. First, 17 features are extracted in the time domain, then the discrete fourier transform algorithm is used to transform the data into frequency domain data, and 13 are extracted. Therefore, we encode each data as a feature vector with a length of 30. We first use the decision tree as the baseline model to establish the recognition model to select the features. Logistic Regression, KNN, Naive Bayes and SVM are used to establish the recognition model. Finally, we use the Voting ensemble learning method to fuse the model, achieving an accuracy of 86% on the test set. © 2023 Tong Wu, published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [1] Machine Learning-Based Field Data Analysis and Modeling for Drone Communications
    Shan, Lin
    Miura, Ryu
    Kagawa, Toshinori
    Ono, Fumie
    Li, Huan-Bang
    Kojima, Fumihide
    IEEE ACCESS, 2019, 7 : 79127 - 79135
  • [2] Machine learning algorithms for analysis and modeling of GeoSpatial data
    Pozdnoukhov, Alexei
    Kanevski, Mikhail
    PROCEEDINGS OF THE IAMG '07: GEOMATHEMATICS AND GIS ANALYSIS OF RESOURCES, ENVIRONMENT AND HAZARDS, 2007, : 216 - +
  • [3] A machine learning based data modeling for medical diagnosis
    Mahoto, Naeem Ahmed
    Shaikh, Asadullah
    Sulaiman, Adel
    Reshan, Mana Saleh Al
    Rajab, Adel
    Rajab, Khairan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [4] Machine learning based kinetic modeling: A robust and reproducible solution for PET data analysis
    Pan, Leyun
    Cheng, Caixia
    Dimitrakopoulou-Strauss, Antonia
    Haberkorn, Uwe
    Strauss, Ludwig
    JOURNAL OF NUCLEAR MEDICINE, 2009, 50
  • [5] Machine Learning Based Real-Time Vehicle Data Analysis for Safe Driving Modeling
    Yadav, Pamul
    Jung, Sangsu
    Singh, Dhananjay
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 1355 - 1358
  • [6] Analysis of Machine Learning Based Imputation of Missing Data
    Rizvi, Syed Tahir Hussain
    Latif, Muhammad Yasir
    Amin, Muhammad Saad
    Telmoudi, Achraf Jabeur
    Shah, Nasir Ali
    CYBERNETICS AND SYSTEMS, 2023,
  • [7] Multidimensional meteorological data analysis based on machine learning
    Wang, Jianxin
    Li, Geng
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2023, 71 (03) : 244 - 250
  • [8] A Note on Combining Machine Learning with Statistical Modeling for Financial Data Analysis
    Maria Sarabia, Jose
    Prieto, Faustino
    Jorda, Vanesa
    Sperlich, Stefan
    RISKS, 2020, 8 (02)
  • [9] Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling
    Cuperlovic-Culf, Miroslava
    METABOLITES, 2018, 8 (01)
  • [10] Cloud Computing for Geodetic Imaging Data Processing, Analysis, and Modeling
    Donnellan, Andrea
    Parker, Jay W.
    Wang, Jun
    Ma, Yu
    Pierce, Marlon
    2014 IEEE AEROSPACE CONFERENCE, 2014,