Clustering for time-varying relational count data

被引:0
|
作者
Goto, Satoshi [1 ]
Takagishi, Mariko [2 ]
Yadohisa, Hiroshi [3 ]
机构
[1] SoftBank Corp, Big Data Strategy Off, Tokyo, Japan
[2] Osaka Univ, Grad Sch Engn Sci, Suita, Osaka, Japan
[3] Doshisha Univ, Dept Culture & Informat Sci, Kyoto, Japan
关键词
Bayesian model; Clustering; Count data; Time-varying relational data; Zero-inflated poisson;
D O I
10.1016/j.csda.2020.107123
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Relational count data are often obtained from sources such as simultaneous purchase in online shops and social networking service information. Clustering such relational count data reveals the latent structure of the relationship between objects such as household items or people. When relational count data observed at multiple time points are available, it is worthwhile incorporating the time structure into the clustering result to understand how objects move between the clusters over time. In this paper, we propose two clustering methods for analyzing time-varying relational count data. The first model, the dynamic Poisson infinite relational model (dPIRM), handles time-varying relational count data. In the second model, which we call the dynamic zero-inflated Poisson infinite relational model, we further extend the dPIRM so that it can handle zero-inflated data. Proposing both two models is important as zero-inflated data are often encountered, especially when the time intervals are short. In addition, by explicitly deriving the relevant full conditional distributions, we describe the features of the estimated parameters and, in turn, the relationship between the two models. We show the effectiveness of both models through a simulation study and a real data example. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] On handling time-varying data in the relational data model
    Tansel, AU
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2004, 46 (02) : 119 - 126
  • [2] Multivariate functional data modeling with time-varying clustering
    Philip A. White
    Alan E. Gelfand
    [J]. TEST, 2021, 30 : 586 - 602
  • [3] Multivariate functional data modeling with time-varying clustering
    White, Philip A.
    Gelfand, Alan E.
    [J]. TEST, 2021, 30 (03) : 586 - 602
  • [4] A new spatial count data model with time-varying parameters
    Buddhavarapu, Prasad
    Bansal, Prateek
    Prozzi, Jorge A.
    [J]. TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2021, 150 : 566 - 586
  • [5] Some applications of time-varying coefficient models to count data
    Chiogna, M
    Gaetan, C
    [J]. BETWEEN DATA SCIENCE AND APPLIED DATA ANALYSIS, 2003, : 182 - 190
  • [6] A nonparametric time-varying coefficient model for panel count data
    Zhao, Huadong
    Tu, Wanzhu
    Yu, Zhangsheng
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2018, 30 (03) : 640 - 661
  • [7] A 2-STEP PROCEDURE FOR CLUSTERING TIME-VARYING DATA
    KOSMELJ, K
    [J]. JOURNAL OF MATHEMATICAL SOCIOLOGY, 1986, 12 (03): : 315 - 326
  • [8] A time-varying quadratic programming for online clustering of streaming data
    Mohammad Amin Adibi
    Jamal Shahrabi
    [J]. Pattern Analysis and Applications, 2018, 21 : 967 - 976
  • [9] A time-varying quadratic programming for online clustering of streaming data
    Adibi, Mohammad Amin
    Shahrabi, Jamal
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2018, 21 (04) : 967 - 976
  • [10] CROSS-SECTIONAL APPROACH FOR CLUSTERING TIME-VARYING DATA
    KOSMELJ, K
    BATAGELJ, V
    [J]. JOURNAL OF CLASSIFICATION, 1990, 7 (01) : 99 - 109