Categorical data clustering: A correlation-based approach for unsupervised attribute weighting

被引:2
|
作者
Carbonera, Joel Luis [1 ]
Abel, Mara [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
关键词
clustering; subspace clustering; categorical data; attribute weighting; data mining; K-MEANS; ALGORITHM;
D O I
10.1109/ICTAI.2014.46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interest in attribute weighting, in clustering tasks, have been increasing in the last years. However, few attempts have been made to apply automated attribute weighting to categorical data clustering. Most of the existing approaches computes the weights based on the frequency of the mode category or according to the average distance of data objects from the mode of a cluster. In this paper, we adopt a different approach, investigating how to use the correlation among categorical attributes for measuring their relevancies in clustering tasks. As a result, we propose a correlation-based attribute weighting approach for categorical attributes.
引用
收藏
页码:259 / 263
页数:5
相关论文
共 50 条
  • [41] Entropy correlation-based clustering method for representative data aggregation in wireless sensor networks
    Nguyen Thi Thanh Nga
    Nguyen Kim Khanh
    Ngo Hong Son
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2018, 28 (04) : 270 - 283
  • [42] A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set
    Ahmad, Amir
    Dey, Lipika
    PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 110 - 118
  • [43] CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data
    Li, Xiang
    Kao, Ben
    Shan, Caihua
    Yin, Dawei
    Ester, Martin
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 439 - 449
  • [44] Correlation-Based Functional Clustering via Subspace Projection
    Chiou, Jeng-Min
    Li, Pai-Ling
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1684 - 1692
  • [45] Clustering of Longitudinal Trajectories Using Correlation-Based Distances
    Pinto da Costa J.F.
    Ferreira F.
    Mascarello M.
    Gaio R.
    SN Computer Science, 2021, 2 (6)
  • [46] Unsupervised Kernelized Correlation-Based Hyperspectral Unmixing With Missing Pixels
    Shahid, Kazi Tanzeem
    Schizas, Ioannis D.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 4509 - 4520
  • [47] Correlation-Based Clustering of Traffic Data for the Mechanistic-Empirical Pavement Design Guide
    Mai, Derong
    Turochy, Rod E.
    Timm, David H.
    TRANSPORTATION RESEARCH RECORD, 2013, (2339) : 104 - 111
  • [48] Correlation-based spectral clustering for flexible process monitoring
    Fujiwara, Koichi
    Kano, Manabu
    Hasebe, Shinji
    JOURNAL OF PROCESS CONTROL, 2011, 21 (10) : 1438 - 1448
  • [49] Clustering CITE-seq data with a canonical correlation-based deep learning method
    Yuan, Musu
    Chen, Liang
    Deng, Minghua
    FRONTIERS IN GENETICS, 2022, 13
  • [50] Visualization of correlation-based environmental data
    Dzemyda, G
    ENVIRONMETRICS, 2004, 15 (08) : 827 - 836