On the Size and Recovery of Submatrices of Ones in a Random Binary Matrix

被引:0
|
作者
Sun, Xing [1 ]
Nobel, Andrew B. [2 ]
机构
[1] Merck Res Labs, N Wales, PA 19454 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
关键词
frequent itemset mining; bipartite graph; biclique; submatrix of 1s; statistical significance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary matrices, and their associated submatrices of 1s, play a central role in the study of random bipartite graphs and in core data mining problems such as frequent itemset mining (FIM). Motivated by these connections, this paper addresses several statistical questions regarding submatrices of 1s in a random binary matrix with independent Bernoulli entries. We establish a three-point concentration result, and a related probability bound, for the size of the largest square submatrix of 1s in a square Bernoulli matrix, and extend these results to non-square matrices and submatrices with fixed aspect ratios. We then consider the noise sensitivity of frequent itemset mining under a simple binary additive noise model, and show that, even at small noise levels, large blocks of 1s leave behind fragments of only logarithmic size. As a result, standard FIM algorithms, which search only for submatrices of 1s, cannot directly recover such blocks when noise is present. On the positive side, we show that an error-tolerant frequent itemset criterion can recover a submatrix of 1s against a background of 0s plus noise, even when the size of the submatrix of 1s is very small.
引用
收藏
页码:2431 / 2453
页数:23
相关论文
共 50 条
  • [1] ON THE NONSINGULARITY OF PRINCIPAL SUBMATRICES OF A RANDOM ORTHOGONAL MATRIX
    KARIYA, T
    WU, CFJ
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1985, 12 (03) : 353 - 357
  • [2] On the maximal size of large-average and ANOVA-fit submatrices in a Gaussian random matrix
    Sun, Xing
    Nobel, Andrew B.
    BERNOULLI, 2013, 19 (01) : 275 - 294
  • [3] Even 2 × 2 Submatrices of a Random Zero-One Matrix
    Anant P. Godbole
    Joseph A. Johnson
    Graphs and Combinatorics, 2004, 20 : 457 - 466
  • [4] Binary factorizations of the matrix of all ones
    Trefois, Maguy
    Van Dooren, Paul
    Delvenne, Jean-Charles
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2015, 468 : 63 - 79
  • [5] Invertibility of random submatrices via tail-decoupling and a matrix Chernoff inequality
    Chretien, Stephane
    Darses, Sebastien
    STATISTICS & PROBABILITY LETTERS, 2012, 82 (07) : 1479 - 1487
  • [6] Even 2 x 2 submatrices of a random zero-one matrix
    Godbole, AP
    Johnson, JA
    GRAPHS AND COMBINATORICS, 2004, 20 (04) : 457 - 466
  • [7] On the rank of a random binary matrix
    Cooper, Colin
    Frieze, Alan
    Pegden, Wesley
    ELECTRONIC JOURNAL OF COMBINATORICS, 2019, 26 (04):
  • [8] Partition of a Binary Matrix into k (k ≥ 3) Exclusive Row and Column Submatrices Is Difficult
    Liu, Peiqiang
    Zhu, Daming
    Xiao, Jinjie
    Xie, Qingsong
    Mao, Yanyan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [9] Detection and Recovery of Hidden Submatrices
    Dadon, Marom
    Huleihel, Wasim
    Bendory, Tamir
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2024, 10 : 69 - 82
  • [10] Principal submatrices of a Hermitian matrix
    Li, CK
    Poon, YT
    LINEAR & MULTILINEAR ALGEBRA, 2003, 51 (02): : 199 - 208