Clustering Analysis for Large Scale Data Sets

被引:0
|
作者
Singh, Sachin [1 ]
Mishra, Ashish [2 ]
机构
[1] Indian Inst Technol, Elect Engn, Roorkee 247667, Uttar Pradesh, India
[2] Coll Engn & Technol, IEC, Elect & Commun, Gr Noida 201310, India
关键词
clustring; large scale data; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The real-world big data can be clustered along desired dimensions but it is limited in its applicability to large-scale problems due to its high computational complexity, user's desire, number of dimensions etc. Recently, many approaches have been proposed to accelerate the large scale data clustering. Unfortunately, these methods usually sacrifice quite a lot of information of the original data; incompetent to produce multiple clustering etc and don't consider the geometrical, psychological and physiological interpretation of naturally occurring data whose representation may be parts-based in human brain. In this paper seven clustering algorithms are analyzed which is based on large scale data of seven different environment and dimensions to find out a universal framework for the representation and processing of knowledge. Our empirical study shows the encouraging results of the LSC-K algorithm in comparisons to state-of-the -art algorithms.
引用
收藏
页码:1 / 4
页数:4
相关论文
共 50 条
  • [1] Fuzzy Clustering of Large-Scale Data Sets Using Principal Component Analysis
    Arfaoui, Olfa
    Sassi Hidri, Minyar
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 683 - 690
  • [2] Parallel Clustering Algorithm for Large-Scale Biological Data Sets
    Wang, Minchao
    Zhang, Wu
    Ding, Wang
    Dai, Dongbo
    Zhang, Huiran
    Xie, Hao
    Chen, Luonan
    Guo, Yike
    Xie, Jiang
    [J]. PLOS ONE, 2014, 9 (04):
  • [3] Efficient clustering of large data sets
    Ananthanarayana, VS
    Murty, MN
    Subramanian, DK
    [J]. PATTERN RECOGNITION, 2001, 34 (12) : 2561 - 2563
  • [4] Compressed constrained spectral clustering framework for large-scale data sets
    Liu, Wenfen
    Ye, Mao
    Wei, Jianghong
    Hu, Xuexian
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 135 : 77 - 88
  • [5] An Improved Affinity Propagation Clustering Algorithm for Large-scale Data Sets
    Liu, Xiaonan
    Yin, Meijuan
    Luo, Junyong
    Chen, Wuping
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 894 - 899
  • [6] Bayesian nonparametric clustering for large data sets
    Zuanetti, Daiane Aparecida
    Mueller, Peter
    Zhu, Yitan
    Yang, Shengjie
    Ji, Yuan
    [J]. STATISTICS AND COMPUTING, 2019, 29 (02) : 203 - 215
  • [7] CLUSTERING OF LARGE DATA SETS - ZUPAN,J
    EVERITT, BS
    [J]. STATISTICIAN, 1983, 32 (03): : 355 - 355
  • [8] Bayesian nonparametric clustering for large data sets
    Daiane Aparecida Zuanetti
    Peter Müller
    Yitan Zhu
    Shengjie Yang
    Yuan Ji
    [J]. Statistics and Computing, 2019, 29 : 203 - 215
  • [9] Clustering Very Large Dissimilarity Data Sets
    Hammer, Barbara
    Hasenfuss, Alexander
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2010, 5998 : 259 - +
  • [10] A fast hierarchical clustering algorithm for large-scale protein sequence data sets
    Szilagyi, Sandor M.
    Szilagyi, Laszlo
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 48 : 94 - 101