On the Size and Recovery of Submatrices of Ones in a Random Binary Matrix

被引:0
|
作者
Sun, Xing [1 ]
Nobel, Andrew B. [2 ]
机构
[1] Merck Res Labs, N Wales, PA 19454 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
关键词
frequent itemset mining; bipartite graph; biclique; submatrix of 1s; statistical significance;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary matrices, and their associated submatrices of 1s, play a central role in the study of random bipartite graphs and in core data mining problems such as frequent itemset mining (FIM). Motivated by these connections, this paper addresses several statistical questions regarding submatrices of 1s in a random binary matrix with independent Bernoulli entries. We establish a three-point concentration result, and a related probability bound, for the size of the largest square submatrix of 1s in a square Bernoulli matrix, and extend these results to non-square matrices and submatrices with fixed aspect ratios. We then consider the noise sensitivity of frequent itemset mining under a simple binary additive noise model, and show that, even at small noise levels, large blocks of 1s leave behind fragments of only logarithmic size. As a result, standard FIM algorithms, which search only for submatrices of 1s, cannot directly recover such blocks when noise is present. On the positive side, we show that an error-tolerant frequent itemset criterion can recover a submatrix of 1s against a background of 0s plus noise, even when the size of the submatrix of 1s is very small.
引用
收藏
页码:2431 / 2453
页数:23
相关论文
共 50 条
  • [31] Off-diagonal submatrices of a Hermitian matrix
    Li, CK
    Poon, YT
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 2004, 132 (10) : 2849 - 2856
  • [32] An Efficient VoIP Steganography Based on Random Binary Matrix
    Qin, Jie
    Tian, Hui
    Huang, Yongfeng
    Liu, Jin
    Chen, Yonghong
    Wang, Tian
    Cai, Yiqiao
    Wang, Xu An
    2015 10TH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC), 2015, : 462 - 465
  • [33] Correlations between the ranks of submatrices and weights of random codes
    Klyachko, Alexander A.
    Ozen, Ibrahim
    FINITE FIELDS AND THEIR APPLICATIONS, 2009, 15 (04) : 497 - 516
  • [34] Size and structure of random ordered binary decision diagrams
    Gröpl, C
    Prömel, HJ
    Srivastav, A
    STACS 98 - 15TH ANNUAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE, 1998, 1373 : 238 - 248
  • [35] Spanning tree size in random binary search trees
    Panholzer, A
    Prodinger, H
    ANNALS OF APPLIED PROBABILITY, 2004, 14 (02): : 718 - 733
  • [36] MATRIX APPROACH TO SYNCHRONIZATION RECOVERY FOR BINARY CYCLIC CODES
    TAVARES, SE
    FUKADA, M
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1969, 15 (1P1) : 93 - +
  • [37] Direct solver for pentadiagonal matrix containing tridiagonal submatrices
    Das, Sandipan Kumar
    NUMERICAL HEAT TRANSFER PART B-FUNDAMENTALS, 2017, 72 (01) : 1 - 20
  • [38] Textures with two traceless submatrices of the neutrino mass matrix
    Alhendi, H. A.
    Lashin, E. I.
    Mudlej, A. A.
    PHYSICAL REVIEW D, 2008, 77 (01)
  • [39] RANKS OF SUBMATRICES IN THE REFLEXIVE SOLUTIONS OF SOME MATRIX EQUATIONS
    Guerarra, Sihem
    Belkhiri, Radja
    FACTA UNIVERSITATIS-SERIES MATHEMATICS AND INFORMATICS, 2024, 39 (01): : 33 - 49