A heuristic algorithm for pattern identification in large multivariate analysis of geophysical data sets

被引:4
|
作者
da Silva Pereira, Joao Eduardo [1 ]
Strieder, Adelir Jose [2 ]
Amador, Janete Pereira [1 ]
Silverio da Silva, Jose Luiz [3 ]
Volcato Descovi Filho, Leonidas Luiz [3 ]
机构
[1] Univ Fed Santa Maria, Dept Stat, Santa Maria, RS, Brazil
[2] Univ Fed Rio Grande do Sul, Dept Min Engn, BR-90046900 Porto Alegre, RS, Brazil
[3] Univ Fed Santa Maria, Dept Geosci, Santa Maria, RS, Brazil
关键词
Factor analysis; Local search system; Patterns identification; Aero-geophysical data; MATLAB program; RECOGNITION;
D O I
10.1016/j.cageo.2009.03.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper aims to present a heuristic algorithm with factor analysis and a local search optimization system for pattern identification problems as applied to large and multivariate aero-geophysical data. The algorithm was developed in MATLAB code using both multivariate and univariate methodologies. Two main analysis steps are detailed in the MATLAB code: the first deals with multivariate factor analysis to reduce the problem of dimension, and to orient the variables in an independent and orthogonal structure; and the second with the application of a novel local research optimization system based on univariate structure. The process of local search is simple and consistent because it solves a multivariate problem by summing up univariate and independent problems. Thus, it can reduce computational time and render the efficiency of estimates independent of the data bank. The aero-geophysical data include the results of the magnetometric and gammaspectrometric (TC, K, Th, and U channels) surveys for the Santa Maria region (RS, Brazil). After the classification, when the observations are superimposed on the regional map, one can see that data belonging to the same subspace appear closer to each other revealing some physical law governing area pattern distribution. The analysis of variance for the original variables as functions of the subspaces obtained results in different mean behaviors for all the variables. This result shows that the use of factor transformation captures the discriminative capacity of the original variables. The proposed algorithm for multivariate factor analysis and the local search system open up new challenges in aero-geophysical data handling and processing techniques. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:83 / 90
页数:8
相关论文
共 50 条
  • [1] Significance tests for unsupervised pattern discovery in large continuous multivariate data sets
    Bolton, RJ
    Hand, DJ
    Crowder, M
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 46 (01) : 57 - 79
  • [2] A Heuristic Approach for Finding Similarity Indexes of Multivariate Data Sets
    Khan, Rahim
    Zakarya, Muhammad
    Khan, Ayaz Ali
    Rahman, Izaz Ur
    Abd Rahman, Mohd Amiruddin
    Karim, Muhammad Khalis Abdul
    Mustafa, Mohd Shafie
    [J]. IEEE ACCESS, 2020, 8 : 21759 - 21769
  • [3] AN ALGORITHM FOR THE PRINCIPAL COMPONENT ANALYSIS OF LARGE DATA SETS
    Halko, Nathan
    Martinsson, Per-Gunnar
    Shkolnisky, Yoel
    Tygert, Mark
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (05): : 2580 - 2594
  • [4] Multivariate Cutoff Level Analysis (MultiCoLA) of large community data sets
    Gobet, Angelique
    Quince, Christopher
    Ramette, Alban
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (15) : e155 - e155
  • [5] MANAGEMENT AND MULTIVARIATE-ANALYSIS OF LARGE DATA SETS IN VEGETATION RESEARCH
    WILDI, O
    [J]. VEGETATIO, 1980, 42 (1-3): : 175 - 180
  • [6] An algorithm for quantifying dependence in multivariate data sets
    Feindt, M.
    Prim, M.
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2013, 698 : 84 - 89
  • [7] MULTIVARIATE INTERPOLATION OF LARGE SETS OF SCATTERED DATA
    RENKA, RJ
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1988, 14 (02): : 139 - 148
  • [8] A MODEL FOR LARGE MULTIVARIATE SPATIAL DATA SETS
    Kleiber, William
    Nychka, Douglas
    Bandyopadhyay, Soutir
    [J]. STATISTICA SINICA, 2019, 29 (03) : 1085 - 1104
  • [9] Visualization of Diversity in Large Multivariate Data Sets
    Pham, Tuan
    Hess, Rob
    Ju, Crystal
    Zhang, Eugene
    Metoyer, Ronald
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1053 - 1062
  • [10] A nonparametric algorithm for automatic classification of large multivariate statistical data sets and its application
    Zenkov, I., V
    Lapko, A., V
    Lapko, V. A.
    Im, S. T.
    Tuboltsev, V. P.
    Avdeenok, V. L.
    [J]. COMPUTER OPTICS, 2021, 45 (02) : 253 - +