Classification and Analysis of Clustering Algorithms for Large Datasets

被引:0
|
作者
Badase, P. S. [1 ]
Deshbhratar, G. P. [1 ]
Bhagat, A. P. [1 ]
机构
[1] Prof Ram Meghe Coll Engn & Mgmt, Dept Comp Sci & Engn, Badnera, Amravati, India
关键词
classification; clustering; density based methods; grid based methods; hierarchical methods; partitioning methods;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining is the analysis step for discovering knowledge and patterns in large databases and large datasets [ 1]. Data mining is the process of applying machine learning methods with the intention of uncovering hidden patterns in large data sets. Data mining techniques basically involves many different ways to classify the data. Such classified data are used to fast accesses of data and for providing fast services to the customers. This paper gives an overview of available algorithms that can be used for clustering in large datasets. The comparative analysis of available clustering algorithms is provided in this paper. This paper also includes the future directions for researchers in the large database clustering domain.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Scalable formal concept analysis algorithms for large datasets using Spark
    Raghavendra K Chunduri
    Aswani Kumar Cherukuri
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 4283 - 4303
  • [42] Clustering Large Datasets Using Data Stream Clustering Techniques
    Bolanos, Matthew
    Forrest, John
    Hahsler, Michael
    DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 135 - 143
  • [43] Similarity-based attribute weighting methods via clustering algorithms in the classification of imbalanced medical datasets
    Kemal Polat
    Neural Computing and Applications, 2018, 30 : 987 - 1013
  • [44] Comparison of Clustering Algorithms for Learning Analytics with Educational Datasets
    Martinez Navarro, Alvaro
    Moreno-Ger, Pablo
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 5 (02): : 9 - 16
  • [45] DataGen: A generator of datasets for evaluation of classification algorithms
    Natl Ukrainian Acad of Sciences, Kiev, Ukraine
    Pattern Recognit Lett, 7 (537-544):
  • [46] Performance Comparison of Classification Algorithms on Medical Datasets
    Ramana, Bendi Venkata
    Boddu, Raja Sarath Kumar
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 140 - 145
  • [47] Metric Structures on Datasets: Stability and Classification of Algorithms
    Memoli, Facundo
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS: 14TH INTERNATIONAL CONFERENCE, CAIP 2011, PT 2, 2011, 6855 : 1 - 33
  • [48] DataGen: a generator of datasets for evaluation of classification algorithms
    Rachkovskij, DA
    Kussul, EM
    PATTERN RECOGNITION LETTERS, 1998, 19 (07) : 537 - 544
  • [49] Clustering large datasets in arbitrary metric spaces
    Ganti, V
    Ramakrishnan, R
    Gehrke, J
    Powell, A
    French, J
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 502 - 511
  • [50] MPRK Algorithm for Clustering the Large Text Datasets
    Thangarasu, M.
    Inbarani, H. Hannah
    2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA), 2016, : 224 - 229