A clustering method for very large mixed data sets

被引:2
|
作者
Sánchez-Díaz, G
Ruiz-Shulcloper, J
机构
关键词
D O I
10.1109/ICDM.2001.989590
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the developed countries, especially over the last decade, there has been an explosive growth in the capability to generate, collect and use very large data sets. The objects of these data sets could be simultaneously described by quantitative and qualitative attributes. At present, algorithms able to process either very large data sets (in metric spaces) or mixed (qualitative and quantitative) incomplete data (missing value) sets have been developed, but not for very large mixed incomplete data sets. In this paper we introduce a new clustering method named GLC+ to process very large mixed incomplete data sets in order to obtain a partition in connected sets.
引用
收藏
页码:643 / 644
页数:2
相关论文
共 50 条
  • [1] Clustering Very Large Dissimilarity Data Sets
    Hammer, Barbara
    Hasenfuss, Alexander
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2010, 5998 : 259 - +
  • [2] A genetic algorithm for clustering on very large data sets
    Gasvoda, J
    Ding, Q
    [J]. COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2003, : 163 - 167
  • [3] Selective sampling for approximate clustering of very large data sets
    Wang, Liang
    Bezdek, James C.
    Leckie, Christopher
    Kotagiri, Ramamohanarao
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2008, 23 (03) : 313 - 331
  • [4] Extending fuzzy and probabilistic clustering to very large data sets
    Hathaway, Richard J.
    Bezdek, James C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (01) : 215 - 234
  • [5] Progressive sampling schemes for approximate clustering in very large data sets
    Bezdek, JC
    Hathaway, RJ
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 15 - 21
  • [6] An investigation of mountain method clustering for large data sets
    Velthuizen, RP
    Hall, LO
    Clarke, LP
    Silbiger, ML
    [J]. PATTERN RECOGNITION, 1997, 30 (07) : 1121 - 1135
  • [7] DESCRY: A density based clustering algorithm for very large data sets
    Angiulli, F
    Pizzuti, C
    Ruffolo, M
    [J]. INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 203 - 210
  • [8] A visual and interactive data exploration method for large data sets and clustering
    Da Costa, David
    Venturini, Gilles
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 553 - +
  • [9] Clustering in large data sets with the limited memory bundle method
    Karmitsa, Napsu
    Bagirov, Adil M.
    Taheri, Sona
    [J]. PATTERN RECOGNITION, 2018, 83 : 245 - 259
  • [10] A projection method for robust estimation and clustering in large data sets
    Pena, Daniel
    Prieto, Francisco J.
    [J]. DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH, 2006, : 209 - +