Correlation Clustering with Noisy Input

被引:0
|
作者
Mathieu, Claire [1 ]
Schudy, Warren [1 ]
机构
[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Correlation clustering is a type of clustering that uses a. basic form of input data For every pair of data items, the input specifies whether they ale similar (belonging to the same cluster) or dissimilar (belonging to different clusters) This lamination may be inconsistent, and the goal is to find a clustering (partition of the vertices) that. disagrees with as few pieces of information as possible Colleration clustering is APX-hard for worst-case inputs We study the following semi-random noisy model to generate the input stall, from an arbitrary partition of the vertices into clusters. Then; for each pair of vertices, the similarity information is corrupted (noisy) independently with probability p Finally, an adversary generates the Input by choosing similality/dissimilarity information arbitrarily for each corrupted pair of vertices In this model, out algorithm produces a. clustering with cost at most 1 + O(n(-1/6)) tones the cost of the optimal clustering, as long as p <= 1/2 71- n(-1/3) Moreover, if all clusters have size at least(1) c(1)root n then we can exactly reconstruct the planted clustering If the noise p is small, that p <= n(-delta)/60, then we can exactly reconstruct all clusters of the planted clustering that have size at least 3150/delta, and provide a certificate (witness) proving that those clusters file in any optimal clustering Among other techniques, we use the natural semi-definite programming relaxation followed by an ink-nesting rounding phase The analysis uses SDP duality and spectral properties of random mattices.
引用
收藏
页码:712 / 728
页数:17
相关论文
共 50 条
  • [1] Clustering with Noisy Queries
    Mazumdar, Arya
    Saha, Barna
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] Adaptive Noisy Clustering
    Chichignoud, Michael
    Loustau, Sebastien
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (11) : 7279 - 7292
  • [3] Censored Regression With Noisy Input
    Liu, Zhaoting
    Li, Chunguang
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (19) : 5071 - 5082
  • [4] Clustering Noisy Temporal Data
    Grant, Paul
    Islam, Md Zahidul
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 184 - 193
  • [5] Noisy Sparse Subspace Clustering
    Wang, Yu-Xiang
    Xu, Huan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [6] Input variable selection for PLS modeling using nearest correlation spectral clustering
    Fujiwara, Koichi
    Sawada, Hiroshi
    Kano, Manabu
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 : 109 - 119
  • [7] Modulation of input gain and response gain by noisy synaptic input
    Aslı Ayaz
    Frances S Chance
    BMC Neuroscience, 10 (Suppl 1)
  • [8] A novel particle filter with noisy input
    Zhang, Xinyu
    Gao, Miao
    Li, Tiancheng
    Duan, Jiemin
    Yi, Yingmin
    Liang, Junli
    DIGITAL SIGNAL PROCESSING, 2025, 161
  • [9] EVALUATION OF BOOLEAN FUNCTIONS ON NOISY INPUT
    EFIMOV, AN
    LUKZILBERMAN, EV
    CYBERNETICS, 1987, 23 (01): : 113 - 122
  • [10] Church Synthesis Problem for Noisy Input
    Velner, Yaron
    Rabinovich, Alexander
    FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, 2011, 6604 : 275 - +