Study of clustering algorithm based on model data

被引:0
|
作者
Li, Kai [1 ]
Cui, Li-Juan [2 ]
机构
[1] HeBei Univ, Sch Math & Comp, Baoding 071002, Peoples R China
[2] HeBei Univ, Lib, Baoding 071002, Peoples R China
关键词
model clustering; measure space; validation of clustering; diversity;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering technique is a key tool in data mining and pattern recognition. Usually, objects for some traditional clustering algorithms are expressed in the form of vectors, which consist of some components to be described as features. However, objects in real tasks may be some models which are clustered other than data points, for example! neural networks, decision trees, support vector machines, etc. This paper studies the clustering algorithm based on model data. By defining the extended measure, clustering methods are studied for the abstract data objects. Framework of clustering algorithm for models is presented. To validate the effectiveness of models clustering algorithm, we choose the hierarchical model clustering algorithm in the experiments. Models in clustering algorithm are BP(Back Propagation) neural networks and learning method is BP algorithm. Measures are chosen as both same-fault measure and double-fault measure for pairwise of models. Distances between clusters are the single link and the complete link, respectively. By this way, we may obtain part of neural network models which are from each cluster and improve diversity of neural network models. Then, part of models is ensembled. Moreover, we also study the relations between the number of clusters in clustering analysis, the size of ensemble learning, and performance of ensemble learning by experiments. Experimental results show that performance of ensemble learning by choosing part of models using clustering of models is improved.
引用
收藏
页码:3961 / +
页数:2
相关论文
共 50 条
  • [1] Inductive Model of Data Clustering based on the Agglomerative Hierarchical Algorithm
    Babichev, Sergii
    Taif, Mohamed Ali
    Lytvynenko, Volodymyr
    PROCEEDINGS OF THE 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2016, : 19 - 22
  • [2] Study on Spatio-Temporal Indexing Model of Geohazard Monitoring Data Based on Data Stream Clustering Algorithm
    Li, Jiahao
    Song, Weiwei
    Chen, Jianglong
    Wei, Qunlan
    Wang, Jinxia
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (03)
  • [3] Study on Ensemble based Clustering Algorithm for Gene Expression Data
    Chu, Zhenfang
    Cao, Buyang
    Yu, Fang
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [4] Possibility Clustering Algorithm for Incomplete Data Based on a Deep Computing Model
    Li, Dongping
    Yang, Yingchun
    Yue, Qiang
    Cheng, Liqi
    Song, Jie
    Liu, Yuyan
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP03)
  • [5] An improved anonymity model for big data security based on clustering algorithm
    Yin, Chunyong
    Zhang, Sun
    Xi, Jinwen
    Wang, Jin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (07):
  • [6] CLUSTERING STUDY BASED ON A LARGE DATA SET OF QUANTUM GENETIC SPECTRAL CLUSTERING ALGORITHM
    Jiang Yong
    Tan Huailiang
    Li Guangwen
    Zhou Hengwei
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 435 - 440
  • [7] Study on microblog public opinion data mining algorithm based on multi-visual clustering model
    Li, Lin-lin
    Hou, Wei-zhen
    Liu, Jing
    INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2020, 13 (02) : 151 - 165
  • [8] Study on Library Management System Based on Data Mining and Clustering Algorithm
    Wang J.
    Alroobaea R.
    Baqasah A.M.
    Althobaiti A.
    Kansal L.
    Informatica (Slovenia), 2022, 46 (09): : 17 - 24
  • [9] Clustering algorithm based on filter model
    Qiu, Bao-Zhi
    Zhang, Rui-Lin
    Li, Xiang-Li
    Kongzhi yu Juece/Control and Decision, 2020, 35 (05): : 1091 - 1101
  • [10] A DATA STREAMS CLUSTERING ALGORITHM BASED ON INTERVAL DATA
    Li, Yan
    Ye, Ming
    Wang, Huiwen
    Liu, Dan
    Che, Yin
    PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 2775 - 2778