The latent topic block model for the co-clustering of textual interaction data

被引:6
|
作者
Berge, Laurent R. [1 ,5 ]
Bouveyron, Charles [2 ,3 ]
Corneli, Marco [2 ,6 ]
Latouche, Pierre [1 ,4 ]
机构
[1] Univ Paris 05, Lab MAP5, UMR CNRS 8145, Paris, France
[2] Univ Cote dAzur, Lab JA Dieudonne, UMR CNRS 7351, Nice, France
[3] INRIA Sophia Antipolis, Epione, Valbonne, France
[4] Univ Paris 1 Pantheon Sorbonne, EA 4543, Lab SAMM, Paris, France
[5] Univ Luxembourg, 162a Ave Faiencerie, L-1511 Luxembourg, Luxembourg
[6] Off 4S813, Lab JA Dieudonne, Campus Valrose, F-06108 Nice, France
关键词
Co-clustering; Latent block model; Text matrices; Topic model; Variational inference; EM ALGORITHM; LIKELIHOOD;
D O I
10.1016/j.csda.2019.03.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Textual interaction data involving two disjoint sets of individuals/objects are considered. An example of such data is given by the reviews on web platforms (e.g. Amazon, TripAdvisor, etc.) where buyers comment on products/services they bought. A new generative model, the latent topic block model (LTBM), is developed along with an inference algorithm to simultaneously partition the elements of each set, accounting for the textual information. The estimation of the model parameters is performed via a variational version of the expectation maximization (EM) algorithm. A model selection criterion is formally obtained to estimate the number of partitions. Numerical experiments on simulated data are carried out to highlight the main features of the estimation procedure. Two real-world datasets are finally employed to show the usefulness of the proposed approach. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:247 / 270
页数:24
相关论文
共 50 条
  • [1] Tensor latent block model for co-clustering
    Rafika Boutalbi
    Lazhar Labiod
    Mohamed Nadif
    International Journal of Data Science and Analytics, 2020, 10 : 161 - 175
  • [2] Tensor latent block model for co-clustering
    Boutalbi, Rafika
    Labiod, Lazhar
    Nadif, Mohamed
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 10 (02) : 161 - 175
  • [3] The functional latent block model for the co-clustering of electricity consumption curves
    Bouveyron, Charles
    Bozzi, Laurent
    Jacques, Julien
    Jollois, Francois-Xavier
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2018, 67 (04) : 897 - 915
  • [4] A Deep Dynamic Latent Block Model for Co-clustering of Zero-Inflated Data Matrices
    Marchello, Giulia
    Corneli, Marco
    Bouveyron, Charles
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [5] A Deep Dynamic Latent Block Model for the Co-Clustering of Zero-Inflated Data Matrices
    Marchello, Giulia
    Corneli, Marco
    Bouveyron, Charles
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT I, 2023, 14169 : 695 - 710
  • [6] Textual data summarization using the Self-Organized Co-Clustering model
    Selosse, Margot
    Jacques, Julien
    Biernacki, Christophe
    PATTERN RECOGNITION, 2020, 103
  • [7] Latent Dirichlet co-clustering
    Shafiei, M. Mahdi
    Milios, Evangelos E.
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 542 - +
  • [8] Co-clustering of evolving count matrices with the dynamic latent block model: application to pharmacovigilance
    Giulia Marchello
    Audrey Fresse
    Marco Corneli
    Charles Bouveyron
    Statistics and Computing, 2022, 32
  • [9] Co-clustering of evolving count matrices with the dynamic latent block model: application to pharmacovigilance
    Marchello, Giulia
    Fresse, Audrey
    Corneli, Marco
    Bouveyron, Charles
    STATISTICS AND COMPUTING, 2022, 32 (03)
  • [10] Constrained Co-Clustering for Textual Documents
    Song, Yangqiu
    Pan, Shimei
    Liu, Shixia
    Wei, Furu
    Zhou, Michelle X.
    Qian, Weihong
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 581 - 586