Comprehensive review on Clustering Techniques and its application on High Dimensional Data

被引:10
|
作者
Alam, Afroj [1 ]
Muqeem, Mohd [1 ]
Ahmad, Sultan [2 ]
机构
[1] Integral Univ, Dept Comp Applicat, Lucknow, UP, India
[2] Prince Sattam Bin Abdulaziz Univ, Dept Comp Sci, Coll Comp Engn & Sci, Al Kharj, Saudi Arabia
关键词
Data mining; Clustering; K-means; PAM; CLARA; ETL; High-dimensional datasets; curse of dimensionality;
D O I
10.22937/IJCSNS.2021.21.6.31
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a most powerful un-supervised machine learning techniques for division of instances into homogenous group, which is called cluster. This Clustering is mainly used for generating a good quality of cluster through which we can discover hidden patterns and knowledge from the large datasets. It has huge application in different field like in medicine field, healthcare, gene-expression, image processing, agriculture, fraud detection, profitability analysis etc. The goal of this paper is to explore both hierarchical as well as partitioning clustering and understanding their problem with various approaches for their solution. Among different clustering K-means is better than other clustering due to its linear time complexity. Further this paper also focused on data mining that dealing with high-dimensional datasets with their problems and their existing approaches for their relevancy
引用
收藏
页码:237 / 244
页数:8
相关论文
共 50 条
  • [1] Literature Review on High Dimensional Data Clustering Techniques
    Selvavinayagam, G.
    Loganathan, Venkateshwaran
    Loheswaran, K.
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (06): : 183 - 187
  • [2] Convex Optimization Techniques for High-Dimensional Data Clustering Analysis: A Review
    Yousif, Ahmed Yacoub
    Al-Sarray, Basad
    [J]. Iraqi Journal for Computer Science and Mathematics, 2024, 5 (03): : 378 - 398
  • [3] A comprehensive review on the techniques for coconut oil extraction and its application
    Ng, Yan Jer
    Tham, Pei En
    Khoo, Kuan Shiong
    Cheng, Chin Kui
    Chew, Kit Wayne
    Show, Pau Loke
    [J]. BIOPROCESS AND BIOSYSTEMS ENGINEERING, 2021, 44 (09) : 1807 - 1818
  • [4] A comprehensive review on the techniques for coconut oil extraction and its application
    Yan Jer Ng
    Pei En Tham
    Kuan Shiong Khoo
    Chin Kui Cheng
    Kit Wayne Chew
    Pau Loke Show
    [J]. Bioprocess and Biosystems Engineering, 2021, 44 : 1807 - 1818
  • [5] Clustering Techniques for Traffic Classification: A Comprehensive Review
    Takyi, Kate
    Bagga, Amandeep
    Goopta, Pooja
    [J]. 2018 7TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO) (ICRITO), 2018, : 224 - 230
  • [6] A comprehensive and analytical review of text clustering techniques
    Mehta, Vivek
    Agarwal, Mohit
    Kaliyar, Rohit Kumar
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 18 (03) : 239 - 258
  • [7] Review of chemometric analysis techniques for comprehensive two dimensional separations data
    Pierce, Karisa M.
    Kehimkar, Benjamin
    Marney, Luke C.
    Hoggard, Jamin C.
    Synovec, Robert E.
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2012, 1255 : 3 - 11
  • [8] A review on particle swarm optimization algorithm and its variants to clustering high-dimensional data
    Ahmed A. A. Esmin
    Rodrigo A. Coelho
    Stan Matwin
    [J]. Artificial Intelligence Review, 2015, 44 : 23 - 45
  • [9] A review on particle swarm optimization algorithm and its variants to clustering high-dimensional data
    Esmin, Ahmed A. A.
    Coelho, Rodrigo A.
    Matwin, Stan
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2015, 44 (01) : 23 - 45
  • [10] A comprehensive survey of anomaly detection techniques for high dimensional big data
    Srikanth Thudumu
    Philip Branch
    Jiong Jin
    Jugdutt (Jack) Singh
    [J]. Journal of Big Data, 7