Interval-censored data solely records the aggregated counts of events during specific time intervals - such as the number of patients admitted to the hospital or the volume of vehicles passing traffic loop detectors - and not the exact occurrence time of the events. It is currently not understood how to fit the Hawkes point processes to this kind of data. Its typical loss function (the point process log-likelihood) cannot be computed without exact event times. Furthermore, it does not have the independent increments property to use the Poisson likelihood. This work builds a novel point process, a set of tools, and approximations for fitting Hawkes processes within interval-censored data scenarios. First, we define the Mean Behavior Poisson process (MBPP), a novel Poisson process with a direct parameter correspondence to the popular self-exciting Hawkes process. We fit MBPP in the interval-censored setting using an interval-censored Poisson log-likelihood (IC-LL). We use the parameter equivalence to uncover the parameters of the associated Hawkes process. Second, we introduce two novel exogenous functions to distinguish the exogenous from the endogenous events. We propose the multi-impulse exogenous function - for when the exogenous events are observed as event time - and the latent homogeneous Poisson process exogenous function - for when the exogenous events are presented as interval-censored volumes. Third, we provide several approximation methods to estimate the intensity and compensator function of MBPP when no analytical solution exists. Fourth and finally, we connect the interval-censored loss of MBPP to a broader class of Bregman divergence-based functions. Using the connection, we show that the popularity estimation algorithm Hawkes Intensity Process (HIP) (Rizoiu et al., 2017b) is a particular case of the MBPP. We verify our models through empirical testing on synthetic data and real-world data. We find that our MBPP outperforms HIP on real-world datasets for the task of popularity prediction. This work makes it possible to efficiently fit the Hawkes process to interval-censored data.
机构:
School of Mathematics and Information Science,Jiangxi Normal UniversitySchool of Mathematics and Information Science,Jiangxi Normal University
Wen Li DENG
Zu Kang ZHENG
论文数: 0引用数: 0
h-index: 0
机构:
Department of Statistics,School of Management,Fudan UniversitySchool of Mathematics and Information Science,Jiangxi Normal University
Zu Kang ZHENG
Ri Quan ZHANG
论文数: 0引用数: 0
h-index: 0
机构:
Department of Statistics,School of Finance and Statistics,East China Normal UniversitySchool of Mathematics and Information Science,Jiangxi Normal University
机构:
Jiangxi Normal Univ, Sch Math & Informat Sci, Nanchang 330022, Peoples R ChinaJiangxi Normal Univ, Sch Math & Informat Sci, Nanchang 330022, Peoples R China
Deng, Wen Li
Zheng, Zu Kang
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Dept Stat, Sch Management, Shanghai 200433, Peoples R ChinaJiangxi Normal Univ, Sch Math & Informat Sci, Nanchang 330022, Peoples R China
Zheng, Zu Kang
Zhang, Ri Quan
论文数: 0引用数: 0
h-index: 0
机构:
E China Normal Univ, Sch Finance & Stat, Dept Stat, Shanghai 200062, Peoples R ChinaJiangxi Normal Univ, Sch Math & Informat Sci, Nanchang 330022, Peoples R China
机构:
Sungshin Womens Univ, Sch Math Stat & Data Sci, Seoul, South Korea
Sungshin Womens Univ, Data Sci Ctr, Seoul, South KoreaSungshin Womens Univ, Sch Math Stat & Data Sci, Seoul, South Korea
Choi, Taehwa
Park, Seohyeon
论文数: 0引用数: 0
h-index: 0
机构:
Korea Univ, Dept Stat, Seoul 02841, South KoreaSungshin Womens Univ, Sch Math Stat & Data Sci, Seoul, South Korea
Park, Seohyeon
Cho, Hunyong
论文数: 0引用数: 0
h-index: 0
机构:
Amazon Com, Seattle, WA 98109 USA
Duke Univ, Dept Stat Sci, Durham, NC USASungshin Womens Univ, Sch Math Stat & Data Sci, Seoul, South Korea
Cho, Hunyong
Choi, Sangbum
论文数: 0引用数: 0
h-index: 0
机构:
Korea Univ, Dept Stat, Seoul 02841, South KoreaSungshin Womens Univ, Sch Math Stat & Data Sci, Seoul, South Korea