Regularized bi-directional co-clustering

被引:0
|
作者
Séverine Affeldt
Lazhar Labiod
Mohamed Nadif
机构
[1] Université de Paris,
[2] CNRS,undefined
[3] Centre Borelli,undefined
来源
Statistics and Computing | 2021年 / 31卷
关键词
Co-clustering; Regularization; Information retrieval; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
The simultaneous clustering of documents and words, known as co-clustering, has proved to be more effective than one-sided clustering in dealing with sparse high-dimensional datasets. By their nature, text data are also generally unbalanced and directional. Recently, the von Mises–Fisher (vMF) mixture model was proposed to handle unbalanced data while harnessing the directional nature of text. In this paper, we propose a general co-clustering framework based on a matrix formulation of vMF model-based co-clustering. This formulation leads to a flexible framework for text co-clustering that can easily incorporate both word–word semantic relationships and document–document similarities. By contrast with existing methods, which generally use an additive incorporation of similarities, we propose a bi-directional multiplicative regularization that better encapsulates the underlying text data structure. Extensive evaluations on various real-world text datasets demonstrate the superior performance of our proposed approach over baseline and competitive methods, both in terms of clustering results and co-cluster topic coherence.
引用
收藏
相关论文
共 50 条
  • [41] Methods for co-clustering: a review
    Brault, Vincent
    Lomet, Aurore
    JOURNAL OF THE SFDS, 2015, 156 (03): : 27 - 51
  • [42] Constrained Dual Graph Regularized Orthogonal Nonnegative Matrix Tri-Factorization for Co-Clustering
    Ge, Shaodi
    Li, Hongjun
    Luo, Liuhong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [43] Bi-directional characteristics of leaf reflectance and transmittance: Measurement and influence on canopy bi-directional reflectance
    Sanz, C
    Espana, M
    Baret, F
    Weiss, M
    Vaillant, L
    Hanocq, JF
    Sarrouy, C
    Clastre, P
    Bruguier, N
    Chelle, M
    Andrieu, B
    Zurfluh, O
    PHYSICAL MEASUREMENTS AND SIGNATURES IN REMOTE SENSING, VOLS 1 AND 2, 1997, : 583 - 590
  • [44] CO-CLUSTERING OF NONSMOOTH GRAPHONS
    Choi, David
    ANNALS OF STATISTICS, 2017, 45 (04): : 1488 - 1515
  • [45] Image and feature co-clustering
    Qiu, GP
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 991 - 994
  • [46] Scalable Co-clustering Algorithms
    Kwon, Bongjune
    Cho, Hyuk
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 1, PROCEEDINGS, 2010, 6081 : 32 - +
  • [47] Co-clustering by similarity refinement
    Zhang, Jian
    ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 381 - 386
  • [48] Co-clustering with augmented matrix
    Meng-Lun Wu
    Chia-Hui Chang
    Rui-Zhe Liu
    Applied Intelligence, 2013, 39 : 153 - 164
  • [49] Temporal relation co-clustering on directional social network and author-topic evolution
    Wei Peng
    Tao Li
    Knowledge and Information Systems, 2011, 26 : 467 - 486
  • [50] Temporal relation co-clustering on directional social network and author-topic evolution
    Peng, Wei
    Li, Tao
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 26 (03) : 467 - 486