Categorical data clustering: A correlation-based approach for unsupervised attribute weighting

被引:2
|
作者
Carbonera, Joel Luis [1 ]
Abel, Mara [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
关键词
clustering; subspace clustering; categorical data; attribute weighting; data mining; K-MEANS; ALGORITHM;
D O I
10.1109/ICTAI.2014.46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interest in attribute weighting, in clustering tasks, have been increasing in the last years. However, few attempts have been made to apply automated attribute weighting to categorical data clustering. Most of the existing approaches computes the weights based on the frequency of the mode category or according to the average distance of data objects from the mode of a cluster. In this paper, we adopt a different approach, investigating how to use the correlation among categorical attributes for measuring their relevancies in clustering tasks. As a result, we propose a correlation-based attribute weighting approach for categorical attributes.
引用
收藏
页码:259 / 263
页数:5
相关论文
共 50 条
  • [1] An improved correlation-based algorithm with discretization for attribute reduction in data clustering
    Kannan, S. Senthamarai
    Ramaraj, N.
    [J]. Data Science Journal, 2009, 8 : 125 - 138
  • [2] A novel attribute weighting algorithm for clustering high-dimensional categorical data
    Bai, Liang
    Liang, Jiye
    Dang, Chuangyin
    Cao, Fuyuan
    [J]. PATTERN RECOGNITION, 2011, 44 (12) : 2843 - 2861
  • [3] CLUSTERING CATEGORICAL DATA BASED ON COMBINATIONS OF ATTRIBUTE VALUES
    Do, Hee-Jung
    Kim, Jae Yearn
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (12A): : 4393 - 4405
  • [4] Automated Attribute Weighting Fuzzy k-Centers Algorithm for Categorical Data Clustering
    Mau, Toan Nguyen
    Huynh, Van-Nam
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2021), 2021, 12898 : 205 - 217
  • [5] Data Correlation-Based Clustering in Sensor Networks
    Yeo, Myung Ho
    Lee, Mi Sook
    Lee, Seok Jae
    Yoo, Jae Soo
    [J]. CSA 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND ITS APPLICATIONS, PROCEEDINGS, 2008, : 332 - 337
  • [6] Learnable Weighting of Intra-Attribute Distances for Categorical Data Clustering with Nominal and Ordinal Attributes
    Zhang, Yiqun
    Cheung, Yiu-ming
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3560 - 3576
  • [7] A correlation-based approach to attribute selection in chemical graph mining
    Okada, Takashi
    [J]. NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2007, 3609 : 517 - 526
  • [8] A Comparison of Categorical Attribute Data Clustering Methods
    Hautamaki, Ville
    Pollanen, Antti
    Kinnunen, Tomi
    Lee, Kong Aik
    Li, Haizhou
    Franti, Pasi
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 53 - 62
  • [9] Correlation-based detection of attribute outliers
    Koh, Judice L. Y.
    Lee, Mong Li
    Hsu, Wynne
    Lam, Kai Tak
    [J]. ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 164 - +
  • [10] Data Correlation-Based Clustering Algorithm in Wireless Sensor Networks
    Yeo, Myungho
    Seo, Dongmin
    Yoo, Jaesoo
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2009, 3 (03): : 331 - 343