FastStep: Scalable Boolean Matrix Decomposition

被引:4
|
作者
Araujo, Miguel [1 ,2 ]
Ribeiro, Pedro [1 ]
Faloutsos, Christos [2 ]
机构
[1] Univ Porto, Cracs INESC TEC, Oporto, Portugal
[2] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
10.1007/978-3-319-31753-3_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Matrix Decomposition methods are applied to a wide range of tasks, such as data denoising, dimensionality reduction, co-clustering and community detection. However, in the presence of boolean inputs, common methods either do not scale or do not provide a boolean reconstruction, which results in high reconstruction error and low interpretability of the decomposition. We propose a novel step decomposition of boolean matrices in non-negative factors with boolean reconstruction. By formulating the problem using threshold operators and through suitable relaxation of this problem, we provide a scalable algorithm that can be applied to boolean matrices with millions of non-zero entries. We show that our method achieves significantly lower reconstruction error when compared to standard state of the art algorithms. We also show that the decomposition keeps its interpretability by analyzing communities in a flights dataset (where the matrix is interpreted as a graph in which nodes are airports) and in a movie-ratings dataset with 10 million non-zeros.
引用
收藏
页码:461 / 473
页数:13
相关论文
共 50 条
  • [1] Extended Boolean Matrix Decomposition
    Lu, Haibing
    Vaidya, Jaideep
    Atluri, Vijayalakshmi
    Hong, Yuan
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 317 - +
  • [2] Scalable boolean matrix multiplication with applications on optical buses
    Li, KQ
    [J]. PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 407 - 410
  • [3] XOR-based Boolean Matrix Decomposition
    Wicker, Jorg
    Hua, Yan Cathy
    Rebello, Rayner
    Pfahringer, Bernhard
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 638 - 647
  • [4] Boolean Matrix Decomposition by Formal Concept Sampling
    Osicka, Petr
    Trnecka, Martin
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2243 - 2246
  • [5] Characteristic matrix of covering and its application to Boolean matrix decomposition
    Wang, Shiping
    Zhu, William
    Zhu, Qingxin
    Min, Fan
    [J]. INFORMATION SCIENCES, 2014, 263 : 186 - 197
  • [6] Optimal Boolean matrix decomposition: Application to role engineering
    Lu, Haibing
    Vaidya, Jaideep
    Atluri, Vijayalakshmi
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 297 - +
  • [7] Role Mining Using Boolean Matrix Decomposition With Hierarchy
    Ye, Wei
    Li, Ruixuan
    Li, Huaqing
    [J]. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 805 - 812
  • [8] Improved Algorithms for Exact and Approximate Boolean Matrix Decomposition
    Sun, Yuan
    Ye, Shiwei
    Sun, Yi
    Kameda, Tsunehiko
    [J]. PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 301 - 310
  • [9] Kronecker product decomposition of Boolean matrix with application to topological structure analysis of Boolean networks
    Wei, Xiaomeng
    Li, Haitao
    Zhao, Guodong
    [J]. MATHEMATICAL MODELLING AND CONTROL, 2023, 3 (04): : 306 - 315
  • [10] The Framework of Protein Function Prediction Based on Boolean Matrix Decomposition
    Liu, Lin
    Tang, Lin
    Tang, Mingjing
    Zhou, Wei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (05): : 1020 - 1033