FARM: A New Efficient and Effective Data Clustering Algorithm

被引:0
|
作者
Tsai, Cheng-Fa [1 ]
Lee, Kuei-Sheng [1 ]
机构
[1] Natl Pingtung Univ Sci & Technol, Dept Management Informat Syst, Pingtung, Taiwan
关键词
data mining; data clustering; database; density-based clustering; grid-based clustering; algorithm;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This investigation presents a method named FARM that combines a grid-based algorithm with the density-based approach for clustering data in data mining applications. In the FARM clustering method, the number of separate clusters need not be specified but only the number of divisions of the clusters is required. Experimental results indicate that the proposed method clusters correctly. It filters 98.8% of the noise, and the data set accuracy exceeds 99.7%. The most surprising result is the time required to process data sets. Processing 575,000 data sets takes only 0.33 second - much less time than any currently known clustering algorithm.
引用
收藏
页码:253 / +
页数:2
相关论文
共 50 条
  • [41] An efficient clustering algorithm
    Zhang, YF
    Mao, JL
    Xiong, ZY
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 261 - 265
  • [42] EFFICIENT CLUSTERING ALGORITHM
    BHAT, MV
    HAUPT, A
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1976, 6 (01): : 61 - 64
  • [43] An efficient clustering algorithm
    Jiang, SY
    Xu, YM
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1513 - 1518
  • [44] An efficient K-means clustering algorithm for tall data
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 776 - 811
  • [45] An efficient K-means clustering algorithm for tall data
    Marco Capó
    Aritz Pérez
    Jose A. Lozano
    [J]. Data Mining and Knowledge Discovery, 2020, 34 : 776 - 811
  • [46] Effective Clustering Analysis Based on New Designed Clustering Validity Index and Revised K-means Algorithm for Big Data
    Zhu, Erzhou
    Wen, Peng
    Zhu, Binbin
    Liu, Feng
    Wang, Futian
    Li, Xuejun
    [J]. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 96 - 102
  • [47] AN EFFICIENT DATA STREAM CLUSTERING ALGORITHM BASED ON DYNAMIC GRIDS
    Yun Wu
    Gao Feng
    [J]. NEW TRENDS AND APPLICATIONS OF COMPUTER-AIDED MATERIAL AND ENGINEERING, 2011, 186 : 665 - +
  • [48] Tree-Based Algorithm for Stable and Efficient Data Clustering
    Aljabbouli, Hasan
    Albizri, Abdullah
    Harfouche, Antoine
    [J]. INFORMATICS-BASEL, 2020, 7 (04):
  • [49] An Efficient Distributed Database Clustering Algorithm for Big Data Processing
    Sun, Qiao
    Fu, Lan-mei
    Deng, Bu-qiao
    Pei, Xu-bin
    Sun, Jia-song
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC 2017), 2017, : 70 - 74
  • [50] Efficient Distributed Database Clustering Algorithm for Big Data Processing
    Li, Liantian
    [J]. 2021 6TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2021), 2021, : 495 - 498