Classification and Analysis of Clustering Algorithms for Large Datasets

被引:0
|
作者
Badase, P. S. [1 ]
Deshbhratar, G. P. [1 ]
Bhagat, A. P. [1 ]
机构
[1] Prof Ram Meghe Coll Engn & Mgmt, Dept Comp Sci & Engn, Badnera, Amravati, India
关键词
classification; clustering; density based methods; grid based methods; hierarchical methods; partitioning methods;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining is the analysis step for discovering knowledge and patterns in large databases and large datasets [ 1]. Data mining is the process of applying machine learning methods with the intention of uncovering hidden patterns in large data sets. Data mining techniques basically involves many different ways to classify the data. Such classified data are used to fast accesses of data and for providing fast services to the customers. This paper gives an overview of available algorithms that can be used for clustering in large datasets. The comparative analysis of available clustering algorithms is provided in this paper. This paper also includes the future directions for researchers in the large database clustering domain.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Ensemble of clustering algorithms for large datasets
    Pestunov I.A.
    Berikov V.B.
    Kulikova E.A.
    Rylov S.A.
    Optoelectronics, Instrumentation and Data Processing, 2011, 47 (3) : 245 - 252
  • [2] Hierarchical clustering algorithms for large datasets
    Stekh, Yuri
    Kernytskyy, Andriy
    Lobur, Mykhaylo
    TCSET 2006: MODERN PROBLEMS OF RADIO ENGINEERING, TELECOMMUNICATIONS AND COMPUTER SCIENCE, PROCEEDINGS, 2006, : 388 - 390
  • [3] Clustering algorithms optimizer:A framework for large datasets
    Varshavsky, Roy
    Horn, David
    Linial, Michal
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 85 - +
  • [4] Performance Analysis of Clustering Algorithms in Medical Datasets
    Premalatha, P.
    Subasree, S.
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [5] Incremental algorithms for multi-classification of large datasets
    Yin, Zhiwu
    Huang, Shangteng
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 354 - 357
  • [6] Scalable algorithms for clustering large datasets with mixed type attributes
    He, ZY
    Xu, XF
    Deng, SC
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2005, 20 (10) : 1077 - 1089
  • [7] Analysis of cancer datasets using Classification Algorithms
    Kumar, Parvesh
    Wasan, Siri Krishan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (06): : 175 - 182
  • [8] A Comparative Analysis of Classification Algorithms on Diverse Datasets
    Alghobiri, Muhammad
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2018, 8 (02) : 2790 - 2795
  • [9] Comparative Analysis of HAR Datasets Using Classification Algorithms
    Nayak, Suvra
    Panigrahi, Chhabi
    Pati, Bibudhendu
    Nanda, Sarmistha
    Hsieh, Meng-Yen
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2022, 19 (01) : 47 - 63
  • [10] Performance analysis of Classification Algorithms under Different Datasets
    Rani, A. Swarupa
    Jyothi, S.
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1584 - 1589