Active Informative Pairwise Constraint Formulation Algorithm for Constraint-Based Clustering

被引:2
|
作者
Zhong, Guoxiang [1 ]
Deng, Xiuqin [1 ]
Xu, Shengbing [1 ,2 ]
机构
[1] Guangdong Univ Technol, Sch Apply Math, Guangzhou 510520, Guangdong, Peoples R China
[2] Guangdong Univ Technol, Sch Comp, Guangzhou 510006, Guangdong, Peoples R China
关键词
Constraint-based clustering; pairwise constraint; weak sample; strong sample; symmetric relative entropy;
D O I
10.1109/ACCESS.2019.2923659
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Constraint-based clustering utilizes pairwise constraints to improve clustering performance. In this paper, we propose a novel formulation algorithm to generate more informative pairwise constraints from limited queries for the constraint-based clustering. Our method consists of two phases: pre-clustering and marking. The pre-clustering phase introduces the fuzzy c-means clustering (FCM) to generate the cluster knowledge that is composed of the membership degree and the cluster centers. In the marking phase, we first propose the weak sample with the larger uncertainty expressed by the entropy of the membership degree. Then, we study the strong sample that contains less uncertainty and should be closest to its cluster center. Finally, given weak samples in descending order of entropy, we formulate informative pairs with strong samples and seek answers using the second minimal symmetric relative entropy priority principle, which leads to more efficient queries. Making use of the pairwise constraint k-means clustering (PCKM) as the underlying constraint-based clustering algorithm, further data experiments are conducted in several datasets to verify the improvement of our method.
引用
收藏
页码:81983 / 81993
页数:11
相关论文
共 50 条
  • [1] An Active Learning Algorithm Based on Shannon Entropy for Constraint-Based Clustering
    Chen, Duo Wen
    Jin, Ying Hua
    [J]. IEEE ACCESS, 2020, 8 : 171447 - 171456
  • [2] Neighboring constraint-based pairwise point cloud registration algorithm
    Geng, Nan
    Ma, Fufeng
    Yang, Huijun
    Li, Boyang
    Zhang, Zhiyi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (24) : 16763 - 16780
  • [3] Neighboring constraint-based pairwise point cloud registration algorithm
    Nan Geng
    Fufeng Ma
    Huijun Yang
    Boyang Li
    Zhiyi Zhang
    [J]. Multimedia Tools and Applications, 2016, 75 : 16763 - 16780
  • [4] Active Learning Method for Constraint-Based Clustering Algorithms
    Cai, Lijun
    Yu, Tinghao
    He, Tingqin
    Chen, Lei
    Lin, Meiqi
    [J]. WEB-AGE INFORMATION MANAGEMENT, PT II, 2016, 9659 : 319 - 329
  • [5] Constraint-based clustering selection
    Van Craenendonck, Toon
    Blockeel, Hendrik
    [J]. MACHINE LEARNING, 2017, 106 (9-10) : 1497 - 1521
  • [6] Constraint-based clustering selection
    Toon Van Craenendonck
    Hendrik Blockeel
    [J]. Machine Learning, 2017, 106 : 1497 - 1521
  • [7] Constraint-based query clustering
    Ruiz, Carlos
    Menasalvas, Ernestina
    Spiliopoulou, Myra
    [J]. ADVANCES IN INTELLIGENT WEB MASTERING, 2007, 43 : 304 - +
  • [8] Combined Density-based and Constraint-based Algorithm for Clustering
    陈同孝
    陈荣昌
    林志强
    邱永兴
    [J]. Journal of Donghua University(English Edition), 2006, (06) : 36 - 38
  • [9] Constraint-based clustering in large databases
    Tung, AKH
    Han, JW
    Lakshmanan, LVS
    Ng, RT
    [J]. DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 : 405 - 419
  • [10] A constraint-based region inference algorithm
    Birkedal, L
    Tofte, M
    [J]. THEORETICAL COMPUTER SCIENCE, 2001, 258 (1-2) : 299 - 392