Out-of-Core Assessment of Clustering Tendency for Large Data Sets

被引:2
|
作者
Pakhira, Malay K. [1 ]
机构
[1] Kalyani Govt Engn Coll, Kalyani, W Bengal, India
关键词
Clustering; Number of clusters; Visual assessment; VAT algorithm;
D O I
10.1109/IADCC.2010.5423044
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Determining the number of clusters present in a data set automatically is a very important problem. Conventional clustering techniques assume a certain number of clusters, and then try to find out the possible cluster structure associated to the above number. For very large and complex data sets it is not easy to guess this number of clusters. There exists validity based clustering techniques, which measure a certain cluster validity measure of a certain clustering result by varying the number of clusters. After doing this for a broad range of possible number of clusters, this method selects the number for which the validity measure is optimum. This method is, however, awkward and may not always be applicable for very large data sets. Recently an interesting visual technique for determining clustering tendency has been developed. This new technique is called VAT in abbreviation. The original VAT and its different versions are found to determine the number of clusters, before actually applying any clustering algorithm, very satisfactorily. In this paper, we have proposed an out-of-core VAT algorithm (o-VAT) for very large data sets.
引用
收藏
页码:29 / 33
页数:5
相关论文
共 50 条
  • [1] Using an out-of-core technique for clustering large data sets
    Masciari, E
    Pizzuti, C
    Raimondo, G
    Talia, D
    [J]. 12TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2001, : 133 - 137
  • [2] OMR: Out-of-Core MapReduce for Large Data Sets
    Kaur, Gurneet
    Vora, Keval
    Koduru, Sai Charan
    Gupta, Rajiv
    [J]. PROCEEDINGS OF THE 2018 ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON MEMORY MANAGEMENT (ISMM'18), 2018, : 71 - 83
  • [3] OMR: Out-of-Core MapReduce for Large Data Sets
    Kaur, Gurneet
    Vora, Keval
    Koduru, Sai Charan
    Gupta, Rajiv
    [J]. ACM SIGPLAN NOTICES, 2018, 53 (05) : 71 - 83
  • [4] An efficient out-of-core volume ray casting method for the visualization of large medical data sets
    Xue, Jian
    Tian, Jie
    Chen, Jian
    Dai, Yakang
    [J]. MEDICAL IMAGING 2007: VISUALIZATION AND IMAGE-GUIDED PROCEDURES, PTS 1 AND 2, 2007, 6509
  • [5] Out-of-core clustering of volumetric datasets
    Granberg C.J.
    Li L.
    [J]. Journal of Zhejiang University-SCIENCE A, 2006, 7 (7): : 1134 - 1140
  • [6] Out-of-core clustering of volumetric datasets
    GRANBERG Carl J.
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2006, (07) : 1134 - 1140
  • [7] Out-of-Core Surface Reconstruction from Large Point Sets for Infrastructure Inspection
    Xu, Chen
    Frechet, Simon
    Laurendeau, Denis
    Miralles, Francois
    [J]. 2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 313 - 317
  • [8] Large out-of-core tetrahedral meshing
    Alleaume, Aurelien
    Francez, Laurent
    Loriot, Alark
    Maman, Nathan
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL MESHING ROUNDTABLE, 2008, : 461 - +
  • [9] Interactive out-of-core isosurface visualisation in time-varying data sets
    Vrolijk, B
    Post, FH
    [J]. COMPUTERS & GRAPHICS-UK, 2006, 30 (02): : 265 - 276
  • [10] Out-of-core rendering of large, unstructured grids
    Farias, R
    Silva, CT
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (04) : 42 - 50