Mining large-scale human mobility data for long-term crime prediction

被引:0
|
作者
Cristina Kadar
Irena Pletikosa
机构
[1] ETH Zurich,D
来源
关键词
Crime prediction; Urban computing; Spatio-temporal data; Human mobility; Location-based social networks; Applied machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Traditional crime prediction models based on census data are limited, as they fail to capture the complexity and dynamics of human activity. With the rise of ubiquitous computing, there is the opportunity to improve such models with data that make for better proxies of human presence in cities. In this paper, we leverage large human mobility data to craft an extensive set of features for crime prediction, as informed by theories in criminology and urban studies. We employ averaging and boosting ensemble techniques from machine learning, to investigate their power in predicting yearly counts for different types of crimes occurring in New York City at census tract level. Our study shows that spatial and spatio-temporal features derived from Foursquare venues and checkins, subway rides, and taxi rides, improve the baseline models relying on census and POI data. The proposed models achieve absolute R2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$R^{2}$\end{document} metrics of up to 65% (on a geographical out-of-sample test set) and up to 89% (on a temporal out-of-sample test set). This proves that, next to the residential population of an area, the ambient population there is strongly predictive of the area’s crime levels. We deep-dive into the main crime categories, and find that the predictive gain of the human dynamics features varies across crime types: such features bring the biggest boost in case of grand larcenies, whereas assaults are already well predicted by the census features. Furthermore, we identify and discuss top predictive features for the main crime categories. These results offer valuable insights for those responsible for urban policy or law enforcement.
引用
收藏
相关论文
共 50 条
  • [1] Mining large-scale human mobility data for long-term crime prediction
    Kadar, Cristina
    Pletikosa, Irena
    [J]. EPJ DATA SCIENCE, 2018, 7
  • [2] Takeaways in Large-scale Human Mobility Data Mining
    Chen, Guangshuo
    Viana, Aline Carneiro
    Fiore, Marco
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON LOCAL AND METROPOLITAN AREA NETWORKS (LANMAN), 2018, : 55 - 60
  • [3] Data analysis toolkit for long-term, large-scale experiments
    Bennett, D. P.
    Cuss, R. J.
    Vardon, P. J.
    Harrington, J. F.
    Philp, R. N.
    Thomas, H. R.
    [J]. MINERALOGICAL MAGAZINE, 2012, 76 (08) : 3355 - 3364
  • [4] LIABILITY AND LARGE-SCALE, LONG-TERM HAZARDS
    RINGLEB, AH
    WIGGINS, SN
    [J]. JOURNAL OF POLITICAL ECONOMY, 1990, 98 (03) : 574 - 595
  • [5] Monitoring and long-term prediction of refuse compositions and settlement in large-scale landfill
    Zhao, YC
    Chen, ZG
    Shi, QG
    Huang, RH
    [J]. WASTE MANAGEMENT & RESEARCH, 2001, 19 (02) : 160 - 168
  • [6] USE OF A COMPUTER FOR DATA MANAGEMENT IN LARGE-SCALE LONG-TERM COOPERATIVE STUDIES
    RAMSHAW, WA
    LATVIS, VF
    COLLINS, DD
    FEINSTEIN, AR
    [J]. JOURNAL OF CHRONIC DISEASES, 1973, 26 (04): : 201 - 217
  • [7] Long-term dynamics of the large-scale magnetic structures
    Ambroz, P
    [J]. SOLAR PHYSICS, 2004, 224 (01) : 61 - 68
  • [8] LONG-TERM FORECASTING AND PROBLEM OF LARGE-SCALE WARS
    STEFFLRE, V
    [J]. FUTURES, 1974, 6 (04) : 302 - 308
  • [9] LONG-TERM LARGE-SCALE CLINICAL EVALUATION OF INDOMETHACIN
    ENGLUND, DW
    [J]. ARTHRITIS AND RHEUMATISM, 1966, 9 (03): : 502 - &
  • [10] Long-Term Dynamics of the Large-Scale Magnetic Structures
    P. Ambrož
    [J]. Solar Physics, 2004, 224 : 61 - 68