Human mobility forecasting with region-based flows and geotagged Twitter data

被引:10
|
作者
Terroso-Saenz, Fernando [1 ]
Flores, Raul [1 ]
Munoz, Andres [1 ]
机构
[1] Univ Catol Murcia UCAM, Polytech Sch, Murcia, Spain
关键词
Human mobility; Machine learning; Prediction model; Online social network; Twitter; NEURAL-NETWORK; PREDICTION;
D O I
10.1016/j.eswa.2022.117477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main lines of research in the discipline of mobility mining is the development of predictors able to anticipate human travel behaviour in great detail. However, access to the high-resolution spatio-temporal data on which most existing solutions are based is rather limited due to multiple factors, e.g. costly access to third-party data. These restrictions give rise to a problem of developing predictors of human mobility in most setting, since the amount of data available to train these prediction models is insufficient. This paper explores the feasibility of using a public data source such as Twitter to predict the number of trips at the nationwide level. The proposed approach combines a large set of geotagged Twitter posts with an open data source published by the Spanish government on traveller mobility based on mobile phone location. Both datasets are used as input to Machine Learning models to validate the use of Twitter data for improving the prediction of these models. The results show that Twitter data have considerable value as a predictor of large-scale human mobility, especially for Long Short-Term Memory (LSTM) models. As a result, the relevance of this work resides in demonstrating that the use of Twitter could be considered as an alternative to substantially enhance the prediction of mobility within a country when it is combined with other open data sources.
引用
收藏
页数:15
相关论文
共 50 条
  • [11] DATA INTEGRATION THROUGH REGION-BASED NOMINAL FILTERING
    FOSNIGHT, EA
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SYSTEMS, 1992, 6 (06): : 469 - 478
  • [12] A Drift Region-Based Data Sample Filtering Method
    Dong, Fan
    Lu, Jie
    Song, Yiliao
    Liu, Feng
    Zhang, Guangquan
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9377 - 9390
  • [13] Circumferential profiles for region-based analysis of dynamic SPECT data
    DiBella, EVR
    Gullberg, GT
    Barclay, AB
    Eisner, RL
    1996 IEEE NUCLEAR SCIENCE SYMPOSIUM - CONFERENCE RECORD, VOLS 1-3, 1997, : 1608 - 1612
  • [14] A Region-based Training Data Segmentation Strategy to Credit Scoring
    Saia, Roberto
    Carta, Salvatore
    Fenu, Gianni
    Pompianu, Livio
    SECRYPT : PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, 2022, : 275 - 282
  • [15] Region-based Querying of Solar Data Using Descriptor Signatures
    Banda, Juan M.
    Liu, Chang
    Angryk, Rafal A.
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 1 - 7
  • [16] A region-based learning approach to discovering temporal structures in data
    Wei, Z
    MACHINE LEARNING, PROCEEDINGS, 1999, : 484 - 492
  • [17] Unsupervised Classification of Spectropolarimetric Data by Region-Based Evidence Fusion
    Zhao, Yongqiang
    Zhang, Guohua
    Jie, Feiran
    Gao, Shibo
    Chen, Chao
    Pan, Quan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2011, 8 (04) : 755 - 759
  • [18] Region-based association tests for sequencing data on survival traits
    Chien, Li-Chu
    Bowden, Donald W.
    Chiu, Yen-Feng
    GENETIC EPIDEMIOLOGY, 2017, 41 (06) : 511 - 522
  • [19] Region-based geometric modelling of human airways and arterial vessels
    Ding, Songlin
    Ye, Yong
    Tu, Jiyuan
    Subic, Aleksandar
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2010, 34 (02) : 114 - 121
  • [20] Region-based geometric modeling of human airways and arterial vessels
    Ding, Songlin
    Ye, Yong
    Tu, Jiyuan
    Subic, Aleks
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 541 - 545