Active Informative Pairwise Constraint Formulation Algorithm for Constraint-Based Clustering

被引:2
|
作者
Zhong, Guoxiang [1 ]
Deng, Xiuqin [1 ]
Xu, Shengbing [1 ,2 ]
机构
[1] Guangdong Univ Technol, Sch Apply Math, Guangzhou 510520, Guangdong, Peoples R China
[2] Guangdong Univ Technol, Sch Comp, Guangzhou 510006, Guangdong, Peoples R China
关键词
Constraint-based clustering; pairwise constraint; weak sample; strong sample; symmetric relative entropy;
D O I
10.1109/ACCESS.2019.2923659
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Constraint-based clustering utilizes pairwise constraints to improve clustering performance. In this paper, we propose a novel formulation algorithm to generate more informative pairwise constraints from limited queries for the constraint-based clustering. Our method consists of two phases: pre-clustering and marking. The pre-clustering phase introduces the fuzzy c-means clustering (FCM) to generate the cluster knowledge that is composed of the membership degree and the cluster centers. In the marking phase, we first propose the weak sample with the larger uncertainty expressed by the entropy of the membership degree. Then, we study the strong sample that contains less uncertainty and should be closest to its cluster center. Finally, given weak samples in descending order of entropy, we formulate informative pairs with strong samples and seek answers using the second minimal symmetric relative entropy priority principle, which leads to more efficient queries. Making use of the pairwise constraint k-means clustering (PCKM) as the underlying constraint-based clustering algorithm, further data experiments are conducted in several datasets to verify the improvement of our method.
引用
收藏
页码:81983 / 81993
页数:11
相关论文
共 50 条
  • [21] Constraint-based algorithm for computing temporal invariants
    Rintanen, Jussi
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8761 : 665 - 673
  • [22] Constraint-based reachability
    Gotlieb, Arnaud
    Denmat, Tristan
    Lazaar, Nadjib
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2013, (107): : 25 - 43
  • [23] Constraint-based agents
    Nareyek, A
    [J]. CONSTRAINT-BASED AGENTS: AN ARCHITECTURE FOR CONSTRAINT-BASED MODELING AND LOCAL-SEARCH-BASED REASONING FOR PLANNING AND SCHEDULING IN OPEN AND DYNAMIC WORLDS, 2001, 2062 : 1 - +
  • [24] Constraint-Based Metrics
    Chris Golston
    [J]. Natural Language & Linguistic Theory, 1998, 16 : 719 - 770
  • [25] CONSTRAINT-BASED REASONING
    KASIF, S
    [J]. IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1991, 6 (06): : 55 - 55
  • [26] Constraint-based metrics
    Golston, C
    [J]. NATURAL LANGUAGE & LINGUISTIC THEORY, 1998, 16 (04) : 719 - 770
  • [27] Constraint-based scheduling
    Fromherz, MPJ
    [J]. PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 3231 - 3244
  • [28] Constraint-based lexica
    Bouma, G
    Van Eynde, F
    Flickinger, D
    [J]. LEXICON DEVELOPMENT FOR SPEECH AND LANGUAGE PROCESSING, 2000, 12 : 43 - +
  • [29] CONSTRAINT-BASED MODELING
    MUNDY, JL
    VROBEL, P
    JOYNSON, R
    [J]. IMAGE UNDERSTANDING WORKSHOP /, 1989, : 425 - 442
  • [30] Constraint-Based Refactoring
    Steimann, Friedrich
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2018, 40 (01):