Multi-Assignment Clustering for Boolean Data

被引:0
|
作者
Frank, Mario [1 ]
Streich, Andreas P. [2 ]
Basin, David [3 ]
Buhmann, Joachim M. [3 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[2] Phonak AG, Adv Concepts & Technol, CH-8712 Stafa, Switzerland
[3] ETH, Dept Comp Sci, CH-8092 Zurich, Switzerland
关键词
clustering; multi-assignments; overlapping clusters; Boolean data; role mining; latent feature models;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a probabilistic model for clustering Boolean data where an object can be simultaneously assigned to multiple clusters. By explicitly modeling the underlying generative process that combines the individual source emissions, highly structured data are expressed with substantially fewer clusters compared to single-assignment clustering. As a consequence, such a model provides robust parameter estimators even when the number of samples is low. We extend the model with different noise processes and demonstrate that maximum-likelihood estimation with multiple assignments consistently infers source parameters more accurately than single-assignment clustering. Our model is primarily motivated by the task of role mining for role-based access control, where users of a system are assigned one or more roles. In experiments with real-world access-control data, our model exhibits better generalization performance than state-of-the-art approaches.
引用
收藏
页码:459 / 489
页数:31
相关论文
共 50 条
  • [21] Clustering of Symbolic Data Using the Assignment-Prototype Algorithm
    Silva, Kelly P.
    de Carvalho, Francisco A. T.
    Csernel, Marc
    [J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 1282 - +
  • [22] Data association for multi target - Multi model particle filtering: Implicit assignment to weighted assignment
    Zaveri, MA
    Desai, UB
    Merchant, SN
    [J]. 2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 27 - 31
  • [23] Multi-source Data Clustering
    Li, Tiancheng
    Corchado, Juan M.
    Bajo, Javier
    Sun, Shudong
    [J]. 2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 830 - 837
  • [24] Density Peaks Clustering Based on Candidate Center and Multi Assignment Policies
    Shi, Yanli
    Bai, Luyao
    [J]. IEEE ACCESS, 2023, 11 : 57158 - 57173
  • [25] Multi-Task Assignment Strategy for Vehicular Crowdsensing with Clustering Characteristic
    Li, Fan
    Fu, Yuchuan
    Zhao, Pincan
    Liu, Sha
    Li, Changle
    [J]. 2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [26] Decomposition of Boolean Multi-Relational Data with Graded Relations
    Trnecka, Martin
    Trneckova, Marketa
    [J]. 2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2016, : 221 - 226
  • [27] Novel Multi Stage Genetic Clustering for Multiobjective optimization in Data Clustering
    Thakare, Anuradha D.
    Dhote, C. A.
    [J]. 1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 402 - 407
  • [28] Channel Assignment for Multi-Radio Wireless Mesh Networks using Clustering
    Makram, Sadeq Ali
    Guenes, Mesut
    [J]. 2008 INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 5 - +
  • [29] Boolean autoencoders and hypercube clustering complexity
    Baldi, P.
    [J]. DESIGNS CODES AND CRYPTOGRAPHY, 2012, 65 (03) : 383 - 403
  • [30] Decompose Boolean Matrices with Correlation Clustering
    Aszalos, Laszlo
    [J]. ENTROPY, 2021, 23 (07)