A hybrid algorithm with cluster analysis in modelling high dimensional data

被引:3
|
作者
Tunga, Burcu [1 ]
机构
[1] Istanbul Tech Univ, Fac Sci & Letters, Math Engn Dept, TR-34469 Istanbul, Turkey
关键词
High dimensional problems; Data modelling; Cluster analysis; Approximation; EMPR; MULTIVARIATE DATA;
D O I
10.1016/j.dam.2017.09.002
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Multivariate data modelling aims to predict unknown function values through an established mathematical model. It is essential to construct an analytical structure using the given set of high dimensional data points with corresponding function values. The level of multivariance directly affects the modelling process. Increase in the number of independent variables makes the standard numerical methods incapable of obtaining the sought analytical structure. This work aims to overcome the difficulties of high multivariance and to improve the modelling quality by carrying out two main steps: data clustering and data partitioning. Data clustering step deals with dividing the whole problem domain into several clusters by performing k-means clustering algorithm. Data partitioning step performs the Enhanced Multivariance Product Representation method to partition the high dimensional data set of each cluster. The analytical structure is obtained through the partitioned data for each cluster and can be used to predict the unknown function values. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:161 / 168
页数:8
相关论文
共 50 条
  • [1] A fast algorithm to cluster high dimensional basket data
    Ordonez, C
    Omiecinski, E
    Ezquerra, N
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 633 - 636
  • [2] Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data
    Floros, Dimitris
    Liu, Tiancheng
    Pitsianis, Nikos
    Sun, Xiaobai
    [J]. 2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [3] High-Dimensional Cluster Analysis with the Masked EM Algorithm
    Kadir, Shabnam N.
    Goodman, Dan F. M.
    Harris, Kenneth D.
    [J]. NEURAL COMPUTATION, 2014, 26 (11) : 2379 - 2394
  • [4] Cluster analysis of high-dimensional data: A case study
    Bean, R
    McLachlan, G
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS, 2005, 3578 : 302 - 310
  • [5] Robust regularized cluster analysis for high-dimensional data
    Kalina, Jan
    Vlckova, Katarina
    [J]. MATHEMATICAL METHODS IN ECONOMICS (MME 2014), 2014, : 378 - 383
  • [6] New hybrid data orientation cluster algorithm
    School of Software, Central South University, Changsha 410075, China
    不详
    [J]. Kongzhi yu Juece Control Decis, 2009, 5 (697-700+705):
  • [7] A Rapid Hybrid Clustering Algorithm for Large Volumes of High Dimensional Data
    Rathore, Punit
    Kumar, Dheeraj
    Bezdek, James C.
    Rajasegarar, Sutharshan
    Palaniswami, Marimuthu
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) : 641 - 654
  • [8] Cluster weighted model based on TSNE algorithm for high-dimensional data
    Olobatuyi, Kehinde
    Parker, Matthew R. P.
    Ariyo, Oludare
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 17 (03) : 261 - 273
  • [9] Cluster weighted model based on TSNE algorithm for high-dimensional data
    Kehinde Olobatuyi
    Matthew R. P. Parker
    Oludare Ariyo
    [J]. International Journal of Data Science and Analytics, 2024, 17 : 261 - 273
  • [10] Genetic Algorithm Based Wrapper Feature Selection on Hybrid Prediction Model for Analysis of High Dimensional Data
    Anirudha, R. C.
    Kannan, Remya
    Patil, Nagamma
    [J]. 2014 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2014, : 290 - 295