Non-redundant data clustering

被引:0
|
作者
David Gondek
Thomas Hofmann
机构
[1] Brown University,Department of Computer Science
来源
关键词
Non-redundant clustering; Exploratory data mining; Information bottleneck;
D O I
暂无
中图分类号
学科分类号
摘要
Data clustering is a popular approach for automatically finding classes, concepts, or groups of patterns. In practice, this discovery process should avoid redundancies with existing knowledge about class structures or groupings, and reveal novel, previously unknown aspects of the data. In order to deal with this problem, we present an extension of the information bottleneck framework, called coordinated conditional information bottleneck, which takes negative relevance information into account by maximizing a conditional mutual information score subject to constraints. Algorithmically, one can apply an alternating optimization scheme that can be used in conjunction with different types of numeric and non-numeric attributes. We discuss extensions of the technique to the tasks of semi-supervised classification and enumeration of successive non-redundant clusterings. We present experimental results for applications in text mining and computer vision.
引用
收藏
页码:1 / 24
页数:23
相关论文
共 50 条
  • [31] A non-redundant data set of nanobody-antigen crystal structures
    Zavrtanik, Uros
    Hadzi, San
    [J]. DATA IN BRIEF, 2019, 24
  • [32] Nonnegative non-redundant tensor decomposition
    Kyrgyzov, Olexiy
    Erdogmus, Deniz
    [J]. FRONTIERS OF MATHEMATICS IN CHINA, 2013, 8 (01) : 41 - 61
  • [33] Mining Non-Redundant Association Rules
    Mohammed J. Zaki
    [J]. Data Mining and Knowledge Discovery, 2004, 9 : 223 - 248
  • [34] Decomposing non-redundant sharing by complementation
    Zaffanella, E
    Hill, PM
    Bagnara, R
    [J]. STATIC ANALYSIS, 1999, 1694 : 69 - 84
  • [35] Decomposing non-redundant sharing by complementation
    Zaffanella, E
    Hill, PM
    Bagnara, R
    [J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2002, 2 : 233 - 261
  • [36] Mining non-redundant association rules
    Zaki, MJ
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 9 (03) : 223 - 248
  • [37] Micromorphic continua: non-redundant formulations
    Giovanni Romano
    Raffaele Barretta
    Marina Diaco
    [J]. Continuum Mechanics and Thermodynamics, 2016, 28 : 1659 - 1670
  • [38] Decomposing non-redundant sharing by complementation
    Zaffanella, Enea
    Hill, Patricia M.
    Bagnara, Roberto
    [J]. Theory and Practice of Logic Programming, 2002, 2 (02) : 233 - 261
  • [39] Mining Non-redundant Reclassification Rules
    Tsay, Li-Shiang
    Im, Seunghyun
    [J]. NEXT-GENERATION APPLIED INTELLIGENCE, PROCEEDINGS, 2009, 5579 : 806 - +
  • [40] OPTICAL ANALOG OF A NON-REDUNDANT ARRAY
    GORI, F
    GUATTARI, G
    [J]. PHYSICS LETTERS A, 1970, A 32 (07) : 446 - &