A probabilistic approach to mining mobile phone data sequences

被引:0
|
作者
Katayoun Farrahi
Daniel Gatica-Perez
机构
[1] JKU University Linz,
[2] Idiap Research Institute,undefined
[3] EPFL,undefined
来源
关键词
Mobile Phone; Topic Model; Latent Dirichlet Allocation; Unseen Data; Mobile Phone Data;
D O I
暂无
中图分类号
学科分类号
摘要
We present a new approach to address the problem of large sequence mining from big data. The particular problem of interest is the effective mining of long sequences from large-scale location data to be practical for Reality Mining applications, which suffer from large amounts of noise and lack of ground truth. To address this complex data, we propose an unsupervised probabilistic topic model called the distant n-gram topic model (DNTM). The DNTM is based on latent Dirichlet allocation (LDA), which is extended to integrate sequential information. We define the generative process for the model, derive the inference procedure, and evaluate our model on both synthetic data and real mobile phone data. We consider two different mobile phone datasets containing natural human mobility patterns obtained by location sensing, the first considering GPS/wi-fi locations and the second considering cell tower connections. The DNTM discovers meaningful topics on the synthetic data as well as the two mobile phone datasets. Finally, the DNTM is compared to LDA by considering log-likelihood performance on unseen data, showing the predictive power of the model. The results show that the DNTM consistently outperforms LDA as the sequence length increases.
引用
收藏
页码:223 / 238
页数:15
相关论文
共 50 条
  • [1] A probabilistic approach to mining mobile phone data sequences
    Farrahi, Katayoun
    Gatica-Perez, Daniel
    [J]. PERSONAL AND UBIQUITOUS COMPUTING, 2014, 18 (01) : 223 - 238
  • [2] Probabilistic Mining of Socio-Geographic Routines From Mobile Phone Data
    Farrahi, Katayoun
    Gatica-Perez, Daniel
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (04) : 746 - 755
  • [3] Spatio-Temporal Routine Mining on Mobile Phone Data
    Qin, Tian
    Shangguan, Wufan
    Song, Guojie
    Tang, Jie
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (05)
  • [4] Mining Mobile Phone Data to Investigate Urban Crime Theories at Scale
    Traunmueller, Martin
    Quattrone, Giovanni
    Capra, Licia
    [J]. SOCIAL INFORMATICS, SOCINFO 2014, 2014, 8851 : 396 - 411
  • [5] Research on Cell Phone Photograph Data Mining in Mobile Electronic Commerce
    Gong, Songjie
    Wang, Ya
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 5801 - 5804
  • [6] Mining Probabilistic Representative Gathering Patterns for Mobile Sensor Data
    Wang, Shuang
    Yi, Hai
    Wu, Lina
    Zhou, Fucai
    Xiong, Neal N.
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2017, 18 (02): : 321 - 332
  • [7] Transportation Modes Identification from Mobile Phone Data Using Probabilistic Models
    Xu, Dafeng
    Song, Guojie
    Gao, Peng
    Cao, Rongzeng
    Nie, Xinwei
    Xie, Kunqing
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PT II, 2011, 7121 : 359 - +
  • [8] Probabilistic positioning in mobile phone network and its consequences for the privacy of mobility data
    Ogulenko, Aleksey
    Benenson, Itzhak
    Omer, Itzhak
    Alon, Barak
    [J]. COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2021, 85
  • [9] Colocation Mining in Uncertain Data Sets: A Probabilistic Approach
    Sheshikala, M.
    Rao, D. Rajeswara
    Kadampur, Md. Ali
    [J]. WORLD CONGRESS ON ENGINEERING, WCE 2015, VOL I, 2015, : 312 - 319
  • [10] Mining Mobile Phone Messages in Mobile Social Network
    Cui, Wen
    [J]. ADVANCED MATERIALS AND ENGINEERING MATERIALS, PTS 1 AND 2, 2012, 457-458 : 130 - 133