Building a Decision Cluster Classification Model for High Dimensional Data by a Variable Weighting k-Means Method

被引:0
|
作者
Li, Yan [1 ]
Hung, Edward [1 ]
Chung, Korris [1 ]
Huang, Joshua [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[2] Univ Hong Kong, East Bus Tech Inst, Hong Kong, Hong Kong, Peoples R China
关键词
Clustering; classification; W-k-means; k-NN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a now classification method (ADCC) for high dimensional data is proposed. In this method, a decision cluster classification model (DCC) consists of a set of disjoint decision clusters, each labeled with a dominant class that determines the class of new objects falling in the cluster. A cluster tree is first generated from a training data set by recursively calling a variable weighting k-means algorithm. Then, the DCC model is selected from the tree. Anderson-Darling test is used to determine the stopping condition of the tree growing. A series of experiments on both synthetic and real data sets have shown that the new classification method (ADCC) performed better in accuracy and scalability than the existing methods of k-NN, decision tree and SVM. It is particularly suitable for large, high dimensional data with many classes.
引用
收藏
页码:337 / +
页数:3
相关论文
共 50 条
  • [21] Classification of aquifer vulnerability using K-means cluster analysis
    Javadi, S.
    Hashemy, S. M.
    Mohammadi, K.
    Howard, K. W. F.
    Neshat, A.
    JOURNAL OF HYDROLOGY, 2017, 549 : 27 - 37
  • [22] K-Means Method for Grouping in Hybrid MapReduce Cluster
    Yang, Yang
    Long, Xiang
    Jiang, Bo
    JOURNAL OF COMPUTERS, 2013, 8 (10) : 2648 - 2655
  • [23] Economy level and freight model utilizing quick K-means cluster method
    School of Traffic and Transportation Engineering, Beijing Jiaotong University, Beijing 100044, China
    不详
    Jilin Daxue Xuebao (Gongxueban), 2008, 5 (1040-1043):
  • [24] Standardization and weighting of variables for the fuzzy K-means clustering of discontinuity data
    Hammah, RE
    Curran, JH
    PACIFIC ROCKS 2000: ROCK AROUND THE RIM, 2000, : 659 - 666
  • [25] Pedestrian classification using K-means and Random Decision Forests
    Alencar, Francisco A. R.
    Massera Filho, Carlos
    Gomes, Diego
    Wolf, Denis F.
    2014 2ND BRAZILIAN ROBOTICS SYMPOSIUM (SBR) / 11TH LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS) / 6TH ROBOCONTROL WORKSHOP ON APPLIED ROBOTICS AND AUTOMATION, 2014, : 103 - 108
  • [26] Building a Decision Cluster Forest Model to Classify High Dimensional Data with Multi-classes
    Li, Yan
    Hung, Edward
    ADVANCES IN MACHINE LEARNING, PROCEEDINGS, 2009, 5828 : 263 - 277
  • [27] k-Means Discriminant Maps for Data Visualization and Classification
    Nhat, Vo Dinh Minh
    Lee, SungYoung
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1187 - 1191
  • [28] Research on Clothing Styles Classification Model Based on the MDS and K-means Method
    Xu, Huijuan
    Zou, Fengyuan
    Wei, Jing
    Zhang, Ying
    ADVANCES IN TEXTILE ENGINEERING, 2011, 331 : 616 - +
  • [29] A Weighting k-Means Clustering Approach by Integrating Intra-Cluster and Inter-Cluster Distances
    Huang X.-H.
    Wang C.
    Xiong L.-Y.
    Zeng H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (12): : 2836 - 2848
  • [30] A Novel K-Means Classification Method with Genetic Algorithm
    Li, Xuesi
    Jiang, Kai
    Wang, Hongbo
    Zhu, Xuejun
    Shi, Ruochong
    Shi, Haobin
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC 2017), 2017, : 40 - 44