Fairness, Semi-Supervised Learning, and More: A General Framework for Clustering with Stochastic Pairwise Constraints

被引:0
|
作者
Brubach, Brian [1 ]
Chakrabarti, Darshan [2 ]
Dickerson, John P. [3 ]
Srinivasan, Aravind [3 ]
Tsepenekas, Leonidas [3 ]
机构
[1] Wellesley Coll, Wellesley, MA 02181 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Univ Maryland, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
APPROXIMATION ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Metric clustering is fundamental in areas ranging from Combinatorial Optimization and Data Mining, to Machine Learning and Operations Research. However, in a variety of situations we may have additional requirements or knowledge, distinct from the underlying metric, regarding which pairs of points should be clustered together. To capture and analyze such scenarios, we introduce a novel family of stochastic pairwise constraints, which we incorporate into several essential clustering objectives (radius/median/means). Moreover, we demonstrate that these constraints can succinctly model an intriguing collection of applications, including among others Individual Fairness in clustering and Must-link constraints in semi-supervised learning. Our main result consists of a general framework that yields approximation algorithms with provable guarantees for important clustering objectives, while at the same time producing solutions that respect the stochastic pairwise constraints. Furthermore, for certain objectives we devise improved results in the case of Must-link constraints, which are also the best possible from a theoretical perspective. Finally, we present experimental evidence that validates the effectiveness of our algorithms.
引用
收藏
页码:6822 / 6830
页数:9
相关论文
共 50 条
  • [1] Semi-supervised Clustering with Pairwise and Size Constraints
    Zhang, Shaohong
    Wong, Hau-San
    Xie, Dongqing
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2450 - 2457
  • [2] Semi-supervised DenPeak Clustering with Pairwise Constraints
    Ren, Yazhou
    Hu, Xiaohui
    Shi, Ke
    Yu, Guoxian
    Yao, Dezhong
    Xu, Zenglin
    [J]. PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 837 - 850
  • [3] Semi-supervised document clustering via active learning with pairwise constraints
    Huang, Ruizhang
    Lam, Wai
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 517 - 522
  • [4] AN EFFECTIVE SEMI-SUPERVISED CLUSTERING FRAMEWORK INTEGRATING PAIRWISE CONSTRAINTS AND ATTRIBUTE PREFERENCES
    Wang, Jinlong
    Wu, Shunyao
    Wen, Can
    Li, Gang
    [J]. COMPUTING AND INFORMATICS, 2012, 31 (03) : 597 - 612
  • [5] Effective semi-supervised graph clustering with pairwise constraints
    Chen, Jingwei
    Xie, Shiyu
    Yang, Hui
    Nie, Feiping
    [J]. INFORMATION SCIENCES, 2024, 681
  • [6] Semi-Supervised Maximum Margin Clustering with Pairwise Constraints
    Zeng, Hong
    Cheung, Yiu-Ming
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) : 926 - 939
  • [7] Semi-Supervised Agglomerative Hierarchical Clustering Algorithms with Pairwise Constraints
    Miyamoto, Sadaaki
    Terami, Akihisa
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [8] Consistency regularization for deep semi-supervised clustering with pairwise constraints
    Dan Huang
    Jie Hu
    Tianrui Li
    Shengdong Du
    Hongmei Chen
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 3359 - 3372
  • [9] Consistency regularization for deep semi-supervised clustering with pairwise constraints
    Huang, Dan
    Hu, Jie
    Li, Tianrui
    Du, Shengdong
    Chen, Hongmei
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) : 3359 - 3372
  • [10] Semi-supervised Spectral Clustering with automatic propagation of pairwise constraints
    Voiron, Nicolas
    Benoit, Alexandre
    Filip, Andrei
    Lambert, Patrick
    Ionescu, Bogdan
    [J]. 2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,