Analysis and Modeling of Geodetic Data Based on Machine Learning

被引:0
|
作者
Wu, Tong [1 ]
机构
[1] School of Electrical and Information Engineering, Tianjin University, Tianjin,300072, China
关键词
Data mining - Decision trees - Discrete Fourier transforms - Frequency domain analysis - Gaussian noise (electronic) - Geodesy - Interpolation - Learning systems - Probability distributions - Reliability analysis - Signal processing - Support vector machines - Time domain analysis - Wavelet transforms;
D O I
暂无
中图分类号
学科分类号
摘要
This paper underscores the significance of earth deformation observation in analyzing earth tide curves and predicting earthquakes, positioning it as a cornerstone of Earth observation technology. We delve into the critical task of detecting and diagnosing anomalies in geodetic data. Utilizing Python for data preprocessing, our approach identifies missing values, categorizes them by their spatial occurrence, and employs spline interpolation and autoregressive prediction methods for data imputation. This process ensures the integrity of the dataset for subsequent analysis and modeling, reinforcing the precision and reliability of geodetic data analysis in Earth science research. For problem I To expand the data set, we propose three models. Model I: Adding gaussian noise to the data. Model II: Resample the data. Model III: Using machine learning methods to learn the internal laws of the data and predict itself to generate new data. For each model, we discuss its advantages and disadvantages. Finally, we structurally fuse the three models to complete data enhancement. For problem II To extract the noise, we use DB4 wavelet transform to denoise the data set and extract the noise. Then we make descriptive statistics on the noise distribution, and use Laplace distribution to fit the probability distribution of noise, and finally get the accurate noise distribution. For problem III We start from the time domain and frequency domain to extract the features of the data. First, 17 features are extracted in the time domain, then the discrete fourier transform algorithm is used to transform the data into frequency domain data, and 13 are extracted. Therefore, we encode each data as a feature vector with a length of 30. We first use the decision tree as the baseline model to establish the recognition model to select the features. Logistic Regression, KNN, Naive Bayes and SVM are used to establish the recognition model. Finally, we use the Voting ensemble learning method to fuse the model, achieving an accuracy of 86% on the test set. © 2023 Tong Wu, published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [21] Machine Learning-Based Sentiment Analysis of Twitter Data
    Karthiga, M.
    Kumar, Sathish G.
    Aravindhraj, N.
    Priyanka, S.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [22] Research and analysis of psychological data based on machine learning methods
    Chen G.
    Lv W.
    Ma J.
    Liang Y.
    International Journal of Wireless and Mobile Computing, 2022, 22 (01) : 1 - 8
  • [23] Data Analysis to Predictive Modeling of Marine Engine Performance Using Machine Learning
    Chan, T. K.
    Chin, C. S.
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2076 - 2080
  • [24] Modeling urban mobility with machine learning analysis of public taxi transportation data
    Song, Ha Yoon
    You, Dabin
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2018, 14 (01) : 73 - 87
  • [25] Outlier data mining model for sports data analysis based on machine learning
    Yin, Zhimeng
    Cui, Wei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 2733 - 2742
  • [26] Research on education management system based on machine learning and multidimensional data modeling
    Xu, Qiaonan
    Deng, Hui
    APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2023, 9 (01)
  • [27] Accuracy Analysis of Machine Learning-Based Performance Modeling for Microprocessors
    Tanaka, Yoshihiro
    Oka, Keitaro
    Ono, Takatsugu
    Inoue, Koji
    2016 FOURTH INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2016, : 83 - 86
  • [28] Machine learning-based kinetic modeling: a robust and reproducible solution for quantitative analysis of dynamic PET data
    Pan, Leyun
    Cheng, Caixia
    Haberkorn, Uwe
    Dimitrakopoulou-Strauss, Antonia
    PHYSICS IN MEDICINE AND BIOLOGY, 2017, 62 (09): : 3566 - 3581
  • [29] Teaching-Learning Activity Modeling Based on Data Analysis
    Kim, Kyungrog
    Choi, Yoo-Joo
    Kim, Mihui
    Lee, Jung-Won
    Park, Doo-Soon
    Moon, Nammee
    SYMMETRY-BASEL, 2015, 7 (01): : 206 - 219
  • [30] Topological data analysis and machine learning
    Leykam, Daniel
    Angelakis, Dimitris G.
    ADVANCES IN PHYSICS-X, 2023, 8 (01):