Multi-Assignment Clustering for Boolean Data

被引:0
|
作者
Frank, Mario [1 ]
Streich, Andreas P. [2 ]
Basin, David [3 ]
Buhmann, Joachim M. [3 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[2] Phonak AG, Adv Concepts & Technol, CH-8712 Stafa, Switzerland
[3] ETH, Dept Comp Sci, CH-8092 Zurich, Switzerland
关键词
clustering; multi-assignments; overlapping clusters; Boolean data; role mining; latent feature models;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a probabilistic model for clustering Boolean data where an object can be simultaneously assigned to multiple clusters. By explicitly modeling the underlying generative process that combines the individual source emissions, highly structured data are expressed with substantially fewer clusters compared to single-assignment clustering. As a consequence, such a model provides robust parameter estimators even when the number of samples is low. We extend the model with different noise processes and demonstrate that maximum-likelihood estimation with multiple assignments consistently infers source parameters more accurately than single-assignment clustering. Our model is primarily motivated by the task of role mining for role-based access control, where users of a system are assigned one or more roles. In experiments with real-world access-control data, our model exhibits better generalization performance than state-of-the-art approaches.
引用
收藏
页码:459 / 489
页数:31
相关论文
共 50 条
  • [1] Multi-assignment clustering for Boolean data
    UC Berkeley, Computer Science Division, 721 Soda Hall, Berkeley, CA 94720, United States
    不详
    不详
    [J]. J. Mach. Learn. Res., (459-489):
  • [2] Nonparametric multi-assignment clustering
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Chang, Tao-Hsing
    Jou, Tzai-Min
    [J]. INTELLIGENT DATA ANALYSIS, 2017, 21 (04) : 893 - 911
  • [3] Multi-assignment clustering: Machine learning from a biological perspective
    Ulfenborg, Benjamin
    Karlsson, Alexander
    Riveiro, Maria
    Andersson, Christian X.
    Sartipy, Peter
    Synnergren, Jane
    [J]. JOURNAL OF BIOTECHNOLOGY, 2021, 326 : 1 - 10
  • [4] THE SYNCHRONIZED MULTI-ASSIGNMENT ORIENTEERING PROBLEM
    Garcia, Christopher
    [J]. JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2023, 19 (03) : 1790 - 1812
  • [5] A semi-supervised learning algorithm for multi-label classification and multi-assignment clustering problems based on a Multivariate Data Analysis
    Gull, Carlos Quintero
    Aguilar, Jose
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [6] Multi-assignment interacting multiple model for tracking microbubbles
    Li, Bing
    Tay, Peter
    Acton, Scott T.
    [J]. 2005 39th Asilomar Conference on Signals, Systems and Computers, Vols 1 and 2, 2005, : 281 - 284
  • [7] A heuristic algorithm based on multi-assignment procedures for nurse scheduling
    Ademir Aparecido Constantino
    Dario Landa-Silva
    Everton Luiz de Melo
    Candido Ferreira Xavier de Mendonça
    Douglas Baroni Rizzato
    Wesley Romão
    [J]. Annals of Operations Research, 2014, 218 : 165 - 183
  • [8] A heuristic algorithm based on multi-assignment procedures for nurse scheduling
    Constantino, Ademir Aparecido
    Landa-Silva, Dario
    de Melo, Everton Luiz
    Xavier de Mendonca, Candido Ferreira
    Rizzato, Douglas Baroni
    Romao, Wesley
    [J]. ANNALS OF OPERATIONS RESEARCH, 2014, 218 (01) : 165 - 183
  • [9] Clustering algorithm for Boolean and categorical data
    [J]. 2001, Huazhong University of Science and Technology (29):
  • [10] Multi-Assignment Single Joins for Parallel Cross-Match of Astronomic Catalogs on Heterogeneous Clusters
    Jia, Xiaoying
    Luo, Qiong
    [J]. 28TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM) 2016), 2016,