Study of clustering algorithm based on model data

被引:0
|
作者
Li, Kai [1 ]
Cui, Li-Juan [2 ]
机构
[1] HeBei Univ, Sch Math & Comp, Baoding 071002, Peoples R China
[2] HeBei Univ, Lib, Baoding 071002, Peoples R China
关键词
model clustering; measure space; validation of clustering; diversity;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering technique is a key tool in data mining and pattern recognition. Usually, objects for some traditional clustering algorithms are expressed in the form of vectors, which consist of some components to be described as features. However, objects in real tasks may be some models which are clustered other than data points, for example! neural networks, decision trees, support vector machines, etc. This paper studies the clustering algorithm based on model data. By defining the extended measure, clustering methods are studied for the abstract data objects. Framework of clustering algorithm for models is presented. To validate the effectiveness of models clustering algorithm, we choose the hierarchical model clustering algorithm in the experiments. Models in clustering algorithm are BP(Back Propagation) neural networks and learning method is BP algorithm. Measures are chosen as both same-fault measure and double-fault measure for pairwise of models. Distances between clusters are the single link and the complete link, respectively. By this way, we may obtain part of neural network models which are from each cluster and improve diversity of neural network models. Then, part of models is ensembled. Moreover, we also study the relations between the number of clusters in clustering analysis, the size of ensemble learning, and performance of ensemble learning by experiments. Experimental results show that performance of ensemble learning by choosing part of models using clustering of models is improved.
引用
收藏
页码:3961 / +
页数:2
相关论文
共 50 条
  • [21] Data Clustering Based on Approach of Genetic Algorithm
    Wang, Hai-hui
    Zhao, Wen-jie
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2753 - 2757
  • [22] The Clustering Algorithm Study of Gene Expression Data
    He Rui
    Lin Chunmei
    ENVIRONMENTAL BIOTECHNOLOGY AND MATERIALS ENGINEERING, PTS 1-3, 2011, 183-185 : 93 - +
  • [23] A hybrid data clustering algorithm based on improved krill herd algorithm and KHM clustering
    Wang, Qiu-Ping
    Ding, Cheng
    Wang, Xiao-Feng
    Kongzhi yu Juece/Control and Decision, 2020, 35 (10): : 2449 - 2458
  • [24] An adaptive grid-density based data stream clustering algorithm based on uncertainty model
    Liu, Zhuo
    Yang, Yue
    Zhang, Jianpei
    Yang, Jing
    Chu, Yan
    Zhang, Zebao
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2014, 51 (11): : 2518 - 2527
  • [25] A Document Clustering Method based on Hierarchical Algorithm with Model Clustering
    Sun, Haojun
    Liu, Zhihui
    Kong, Lingjun
    2008 22ND INTERNATIONAL WORKSHOPS ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOLS 1-3, 2008, : 1229 - +
  • [26] Clustering Algorithm Based on Time Series Similarity to Web Data Clustering
    Yang Yan
    Yao Hua-Xiong
    Li Rong
    PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1373 - 1377
  • [27] An Industrial Network Intrusion Detection Algorithm Based on Multifeature Data Clustering Optimization Model
    Liang, Wei
    Li, Kuan-Ching
    Long, Jing
    Kui, Xiaoyan
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (03) : 2063 - 2071
  • [28] Online Data Segmentation Based on Clustering Algorithm and Autoregressive Model for Human Actions Recognition
    Jiang, M.
    Liu, X. L.
    Zhang, Z.
    Zhao, Y.
    Zhang, R.
    Qiu, S.
    Wu, D. H.
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 412 - 416
  • [29] Research On Novel Model of Data Mining Based on Improved Association Rules and Clustering Algorithm
    Tan, Qing
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY (EMCS 2017), 2017, 61 : 522 - 526
  • [30] APPLICATION OF MODEL-BASED CLUSTERING ALGORITHM TO COVID-19 VACCINE DATA
    Kalkan, Seda Bagdatli
    Basar, Oezlem Deniz
    JP JOURNAL OF BIOSTATISTICS, 2022, 21 (02) : 141 - 154