Modeling E-mail Networks and Inferring Leadership Using Self-Exciting Point Processes

被引:57
|
作者
Fox, Eric W. [1 ]
Short, Martin B. [1 ]
Schoenberg, Frederic P. [1 ]
Coronges, Kathryn D. [1 ]
Bertozzi, Andrea L. [1 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
Conditional intensity; Enron E-mail dataset; Hawkes process; IkeNet dataset; Social networks; HEAVY TAILS;
D O I
10.1080/01621459.2015.1135802
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose various self-exciting point process models for the times when e-mails are sent between individuals in a social network. Using an expectation maximization (EM)-type approach, we fit these models to an e-mail network dataset from West Point Military Academy and the Enron e-mail dataset. We argue that the self-exciting models adequately capture major temporal clustering features in the data and perform better than traditional stationary Poisson models. We also investigate how accounting for diurnal and weekly trends in e-mail activity improves the overall fit to the observed network data. A motivation and application forfitting these self-exciting models is to use parameter estimates to characterize important e-mail communication behaviors such as the baseline sending rates, average reply rates, and average response times. A primary goal is to use these features, estimated from the self-exciting models, to infer the underlying leadership status of users in the West Point and Enron networks. Supplementary materials for this article are available online.
引用
收藏
页码:564 / 584
页数:21
相关论文
共 50 条