Building a Decision Cluster Classification Model for High Dimensional Data by a Variable Weighting k-Means Method

被引：0

作者：

Li, Yan ^{[1
]}

Hung, Edward ^{[1
]}

Chung, Korris ^{[1
]}

Huang, Joshua ^{[2
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China

[2] Univ Hong Kong, East Bus Tech Inst, Hong Kong, Hong Kong, Peoples R China

来源：

AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 2008年 / 5360卷

关键词：

Clustering; classification; W-k-means; k-NN;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a now classification method (ADCC) for high dimensional data is proposed. In this method, a decision cluster classification model (DCC) consists of a set of disjoint decision clusters, each labeled with a dominant class that determines the class of new objects falling in the cluster. A cluster tree is first generated from a training data set by recursively calling a variable weighting k-means algorithm. Then, the DCC model is selected from the tree. Anderson-Darling test is used to determine the stopping condition of the tree growing. A series of experiments on both synthetic and real data sets have shown that the new classification method (ADCC) performed better in accuracy and scalability than the existing methods of k-NN, decision tree and SVM. It is particularly suitable for large, high dimensional data with many classes.

引用

页码：337 / +

页数：3

共 50 条

[21] Classification of aquifer vulnerability using K-means cluster analysis
Javadi, S.
Hashemy, S. M.
Mohammadi, K.
Howard, K. W. F.
Neshat, A.
JOURNAL OF HYDROLOGY, 2017, 549 : 27 - 37
[22] K-Means Method for Grouping in Hybrid MapReduce Cluster
Yang, Yang
Long, Xiang
Jiang, Bo
JOURNAL OF COMPUTERS, 2013, 8 (10) : 2648 - 2655
[23] Economy level and freight model utilizing quick K-means cluster method
School of Traffic and Transportation Engineering, Beijing Jiaotong University, Beijing 100044, China
不详
Jilin Daxue Xuebao (Gongxueban), 2008, 5 (1040-1043):
[24] Standardization and weighting of variables for the fuzzy K-means clustering of discontinuity data
Hammah, RE
Curran, JH
PACIFIC ROCKS 2000: ROCK AROUND THE RIM, 2000, : 659 - 666
[25] Pedestrian classification using K-means and Random Decision Forests
Alencar, Francisco A. R.
Massera Filho, Carlos
Gomes, Diego
Wolf, Denis F.
2014 2ND BRAZILIAN ROBOTICS SYMPOSIUM (SBR) / 11TH LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS) / 6TH ROBOCONTROL WORKSHOP ON APPLIED ROBOTICS AND AUTOMATION, 2014, : 103 - 108
[26] Building a Decision Cluster Forest Model to Classify High Dimensional Data with Multi-classes
Li, Yan
Hung, Edward
ADVANCES IN MACHINE LEARNING, PROCEEDINGS, 2009, 5828 : 263 - 277
[27] k-Means Discriminant Maps for Data Visualization and Classification
Nhat, Vo Dinh Minh
Lee, SungYoung
APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1187 - 1191
[28] Research on Clothing Styles Classification Model Based on the MDS and K-means Method
Xu, Huijuan
Zou, Fengyuan
Wei, Jing
Zhang, Ying
ADVANCES IN TEXTILE ENGINEERING, 2011, 331 : 616 - +
[29] A Weighting k-Means Clustering Approach by Integrating Intra-Cluster and Inter-Cluster Distances
Huang X.-H.
Wang C.
Xiong L.-Y.
Zeng H.
Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (12): : 2836 - 2848
[30] A Novel K-Means Classification Method with Genetic Algorithm
Li, Xuesi
Jiang, Kai
Wang, Hongbo
Zhu, Xuejun
Shi, Ruochong
Shi, Haobin
PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC 2017), 2017, : 40 - 44

← 1 2 3 4 5 →