Correlation Clustering with Noisy Input

被引:0
|
作者
Mathieu, Claire [1 ]
Schudy, Warren [1 ]
机构
[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Correlation clustering is a type of clustering that uses a. basic form of input data For every pair of data items, the input specifies whether they ale similar (belonging to the same cluster) or dissimilar (belonging to different clusters) This lamination may be inconsistent, and the goal is to find a clustering (partition of the vertices) that. disagrees with as few pieces of information as possible Colleration clustering is APX-hard for worst-case inputs We study the following semi-random noisy model to generate the input stall, from an arbitrary partition of the vertices into clusters. Then; for each pair of vertices, the similarity information is corrupted (noisy) independently with probability p Finally, an adversary generates the Input by choosing similality/dissimilarity information arbitrarily for each corrupted pair of vertices In this model, out algorithm produces a. clustering with cost at most 1 + O(n(-1/6)) tones the cost of the optimal clustering, as long as p <= 1/2 71- n(-1/3) Moreover, if all clusters have size at least(1) c(1)root n then we can exactly reconstruct the planted clustering If the noise p is small, that p <= n(-delta)/60, then we can exactly reconstruct all clusters of the planted clustering that have size at least 3150/delta, and provide a certificate (witness) proving that those clusters file in any optimal clustering Among other techniques, we use the natural semi-definite programming relaxation followed by an ink-nesting rounding phase The analysis uses SDP duality and spectral properties of random mattices.
引用
收藏
页码:712 / 728
页数:17
相关论文
共 50 条
  • [31] Graph Connectivity in Noisy Sparse Subspace Clustering
    Wang, Yining
    Wang, Yu-Xiang
    Singh, Aarti
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 538 - 546
  • [32] Multi-view Clustering with Noisy Views
    Ye, Yongkai
    Liu, Xinwang
    Yin, Jianping
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 339 - 344
  • [33] Subtractive Clustering: A Tool for Reconstructing Noisy Curves
    Kavita
    Rajpal, Navin
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 59 - 62
  • [34] A note on the Gamma test analysis of noisy input/output data and noisy time series
    Jones, Antonia J.
    Evans, D.
    Kemp, S. E.
    PHYSICA D-NONLINEAR PHENOMENA, 2007, 229 (01) : 1 - 8
  • [35] On the approximation of correlation clustering and consensus clustering
    Bonizzoni, Paola
    Della Vedova, Gianluca
    Dondi, Riccardo
    Jiang, Tao
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2008, 74 (05) : 671 - 696
  • [36] Rough Clustering Generated by Correlation Clustering
    Aszalos, Laszlo
    Mihalydeak, Tamas
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2013, 8170 : 315 - 324
  • [37] LUCKe - Connecting Clustering and Correlation Clustering
    Beer, Anna
    Stephan, Lisa
    Seidl, Thomas
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 431 - 440
  • [38] Correlation analysis of deterministic and noisy chaos
    Anishchenko, VS
    Vadivasova, TE
    Okrokvertskhov, GA
    Strelkova, GI
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2003, 48 (07) : 750 - 760
  • [39] Projecting dialect distances to geography: Bootstrap clustering vs. noisy clustering
    Nerbonne, John
    Kleiweg, Peter
    Heeringa, Wilbert
    Manni, Franz
    DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 647 - +
  • [40] Reconstruction of noisy system correlation dimension
    Dyudin, M. S.
    Kalaidin, R. N.
    IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENIY-PRIKLADNAYA NELINEYNAYA DINAMIKA, 2020, 28 (02): : 201 - 207