Constrained Co-Clustering for Textual Documents

被引:0
|
作者
Song, Yangqiu [1 ]
Pan, Shimei [2 ]
Liu, Shixia [1 ]
Wei, Furu [1 ]
Zhou, Michelle X. [3 ]
Qian, Weihong [1 ]
机构
[1] IBM Res China, Beijing, Peoples R China
[2] IBM Res TJ Watson Ctr, Hawthorne, NY USA
[3] IBM Res Almaden Ctr, San Jose, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a constrained co-clustering approach for clustering textual documents. Our approach combines the benefits of information-theoretic co-clustering and constrained clustering. We use a two-sided hidden Markov random field (HMRF) to model both the document and word constraints. We also develop an alternating expectation maximization (EM) algorithm to optimize the constrained co-clustering model. We have conducted two sets of experiments on a benchmark data set: (1) using human-provided category labels to derive document and word constraints for semi-supervised document clustering, and (2) using automatically extracted named entities to derive document constraints for unsupervised document clustering. Compared to several representative constrained clustering and co-clustering approaches, our approach is shown to be more effective for high-dimensional, sparse text data.
引用
收藏
页码:581 / 586
页数:6
相关论文
共 50 条
  • [31] CO-CLUSTERING OF NONSMOOTH GRAPHONS
    Choi, David
    ANNALS OF STATISTICS, 2017, 45 (04): : 1488 - 1515
  • [32] Co-clustering by similarity refinement
    Zhang, Jian
    ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 381 - 386
  • [33] Image and feature co-clustering
    Qiu, GP
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 991 - 994
  • [34] Scalable Co-clustering Algorithms
    Kwon, Bongjune
    Cho, Hyuk
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 1, PROCEEDINGS, 2010, 6081 : 32 - +
  • [35] Co-clustering with augmented matrix
    Meng-Lun Wu
    Chia-Hui Chang
    Rui-Zhe Liu
    Applied Intelligence, 2013, 39 : 153 - 164
  • [36] Co-Adjustment Learning for Co-Clustering
    Ji Zhang
    Hongjun Wang
    Shudong Huang
    Tianrun Li
    Peng Jin
    Ping Deng
    Qigang Zhao
    Cognitive Computation, 2021, 13 : 504 - 517
  • [37] Semi-supervised sparse neighbor constrained co-clustering with dissimilarityand similarity regularization
    Li, Xiangli
    Lu, Xiyan
    Fan, Xuezhen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [38] Co-Adjustment Learning for Co-Clustering
    Zhang, Ji
    Wang, Hongjun
    Huang, Shudong
    Li, Tianrun
    Jin, Peng
    Deng, Ping
    Zhao, Qigang
    COGNITIVE COMPUTATION, 2021, 13 (02) : 504 - 517
  • [39] Feature co-shrinking for co-clustering
    Tan, Qi
    Yang, Pei
    He, Jingrui
    PATTERN RECOGNITION, 2018, 77 : 12 - 19