Evaluation of Random Forest in Crime Prediction: Comparing Three-Layered Random Forest and Logistic Regression

被引:9
|
作者
Oh, Gyeongseok [2 ]
Song, Juyoung [3 ]
Park, Hyoungah [4 ]
Na, Chongmin [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Korean Natl Police Univ, Asan, South Korea
[3] Penn State Univ, Schuylkill, PA USA
[4] St Peters Univ, Jersey, NJ USA
关键词
RISK-ASSESSMENT; VIOLENCE RISK; CLASSIFICATION; RECIDIVISM; RACE;
D O I
10.1080/01639625.2021.1953360
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
This study evaluated random forest's accuracy in predicting violent or criminal behavior of juveniles compared to that of conventional logistic regression using different sets of risk factors. Drawing on the National Longitudinal Study of Adolescent Health (Add Health), we predicted three outcomes - arrests, convictions, and incarcerations - using three sets of predictors, starting with sociodemographic variables only (Model 1) and incrementally adding behavioral/situational (Model 2) and emotional/environmental risk factors (Model 3). Although both prediction methods yielded similar levels of "overall" predictive accuracy (measured by the area under the receiver operating characteristic curve), our balanced random forest model, with a cost ratio of 10 (false negatives) to 1 (false positives), substantially improved prediction of who will be arrested, convicted, and incarcerated, which is of paramount importance for many researchers and practitioners. In addition to its capability to enhance sensitivity (prediction of "true positives"), random forest is more effective in forecasting juvenile criminal behavior than is conventional logistic regression in that the former is less susceptible to the influences of added predictors than is the latter.
引用
收藏
页码:1036 / 1049
页数:14
相关论文
共 50 条
  • [11] Evaluation of random forest regression for prediction of breeding value from genomewide SNPs
    SARKAR R.U.P.A.M.K.U.M.A.R.
    RAO A.R.
    MEHER P.K.
    NEPOLEAN T.
    MOHAPATRA T.
    [J]. Journal of Genetics, 2015, 94 (2) : 187 - 192
  • [12] Evaluation of random forest regression for prediction of breeding value from genomewide SNPs
    Sarkar, Rupam Kumar
    Rao, A. R.
    Meher, Prabina Kumar
    Nepolean, T.
    Mohapatra, T.
    [J]. JOURNAL OF GENETICS, 2015, 94 (02) : 187 - 192
  • [13] Comparing Random Forest with Logistic Regression for Predicting Class-Imbalanced Civil War Onset Data
    Muchlinski, David
    Siroky, David
    He, Jingrui
    Kocher, Matthew
    [J]. POLITICAL ANALYSIS, 2016, 24 (01) : 87 - 103
  • [14] Forest Fire Probability Mapping in Eastern Serbia: Logistic Regression versus Random Forest Method
    Milanovic, Slobodan
    Markovic, Nenad
    Pamucar, Dragan
    Gigovic, Ljubomir
    Kostic, Pavle
    Milanovic, Sladjan D.
    [J]. FORESTS, 2021, 12 (01): : 1 - 17
  • [15] Logistic regression and random forest unveil key molecular descriptors of druglikeness
    Billones, Liza T.
    Morales, Nadia B.
    Billones, Junie B.
    [J]. CHEM-BIO INFORMATICS JOURNAL, 2021, 21 : 39 - 58
  • [16] A Comparison of Logistic Regression, Random Forest Models in Predicting the Risk of Diabetes
    Zhang, Baoxin
    Lu, Li
    Hou, Jiaqi
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON IMAGE COMPUTING AND DIGITAL MEDICINE (ISICDM 2019), 2019, : 231 - 234
  • [17] Determinants of Stock Option Listing: Logistic Regression and Random Forest Approach
    Joshi, Himanshu
    Chauhan, Raineesh
    [J]. PACIFIC BUSINESS REVIEW INTERNATIONAL, 2020, 13 (01): : 1 - 12
  • [18] A comparison of random forest regression and multiple linear regression for prediction in neuroscience
    Smith, Paul F.
    Ganesh, Siva
    Liu, Ping
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2013, 220 (01) : 85 - 91
  • [19] Prediction of flood sensitivity based on Logistic Regression, eXtreme Gradient Boosting, and Random Forest modeling methods
    Wu, Ying
    Zhang, Zhiming
    Qi, Xiaotian
    Hu, Wenhan
    Si, Shuai
    [J]. WATER SCIENCE AND TECHNOLOGY, 2024, 89 (10) : 2605 - 2624
  • [20] Machine Learning and Risk Assessment: Random Forest Does Not Outperform Logistic Regression in the Prediction of Sexual Recidivism
    Etzler, Sonja
    Schonbrodt, Felix D.
    Pargent, Florian
    Eher, Reinhard
    Rettenberger, Martin
    [J]. ASSESSMENT, 2024, 31 (02) : 460 - 481