Evaluation of Random Forest in Crime Prediction: Comparing Three-Layered Random Forest and Logistic Regression

被引:9
|
作者
Oh, Gyeongseok [2 ]
Song, Juyoung [3 ]
Park, Hyoungah [4 ]
Na, Chongmin [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Korean Natl Police Univ, Asan, South Korea
[3] Penn State Univ, Schuylkill, PA USA
[4] St Peters Univ, Jersey, NJ USA
关键词
RISK-ASSESSMENT; VIOLENCE RISK; CLASSIFICATION; RECIDIVISM; RACE;
D O I
10.1080/01639625.2021.1953360
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
This study evaluated random forest's accuracy in predicting violent or criminal behavior of juveniles compared to that of conventional logistic regression using different sets of risk factors. Drawing on the National Longitudinal Study of Adolescent Health (Add Health), we predicted three outcomes - arrests, convictions, and incarcerations - using three sets of predictors, starting with sociodemographic variables only (Model 1) and incrementally adding behavioral/situational (Model 2) and emotional/environmental risk factors (Model 3). Although both prediction methods yielded similar levels of "overall" predictive accuracy (measured by the area under the receiver operating characteristic curve), our balanced random forest model, with a cost ratio of 10 (false negatives) to 1 (false positives), substantially improved prediction of who will be arrested, convicted, and incarcerated, which is of paramount importance for many researchers and practitioners. In addition to its capability to enhance sensitivity (prediction of "true positives"), random forest is more effective in forecasting juvenile criminal behavior than is conventional logistic regression in that the former is less susceptible to the influences of added predictors than is the latter.
引用
收藏
页码:1036 / 1049
页数:14
相关论文
共 50 条
  • [1] Crime Prediction Model using Three Classification Techniques: Random Forest, Logistic Regression, and LightGBM
    Alsubayhin, Abdulrahman
    Ramzan, Muhammad Sher
    Alzahrani, Bander
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 240 - 251
  • [2] Prediction of unsuccessful endometrial ablation: random forest vs logistic regression
    Stevens, Kelly Yvonne Roger
    Lagaert, Liesbet
    Bakkes, Tom
    Gelderblom, Malou Evi
    Houterman, Saskia
    Gijsen, Tanja
    Schoot, Benedictus C.
    [J]. GYNECOLOGICAL SURGERY, 2021, 18 (01)
  • [3] Logistic Regression and Random Forest for Effective Imbalanced Classification
    Luo, Hanwu
    Pan, Xiubao
    Wang, Qingshun
    Ye, Shasha
    Qian, Ying
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 916 - 917
  • [4] Comparison of Accuracy Rate in Prediction of Cardiovascular Disease using Random Forest with Logistic Regression
    Vishnuvardhan, Talluri
    Rama, A.
    [J]. CARDIOMETRY, 2022, (25): : 1526 - 1531
  • [5] Random Forest and Logistic Regression algorithms for prediction of groundwater contamination using ammonia concentration
    Ahmed Madani
    Mohammed Hagage
    Salwa F. Elbeih
    [J]. Arabian Journal of Geosciences, 2022, 15 (20)
  • [6] Comparing the Efficiency of Heart Disease Prediction using Novel Random Forest, Logistic Regression and Decision Tree And SVM Algorithms
    Teja, P. Prasanna Sai
    Veeramani, T.
    [J]. CARDIOMETRY, 2022, (25): : 1491 - 1499
  • [7] Classification and Prediction of Heart Disease using Novel Random Forest Algorithm by Comparing Logistic Regression for Obtaining Better Accuracy
    Poojitha, T.
    Mahaveerakannan, R.
    [J]. CARDIOMETRY, 2022, (25): : 1538 - 1545
  • [8] Comparative Performance Analysis of Random Forest and Logistic Regression Algorithms
    Malkocoglu, Ayse Berika Varol
    Malkocoglu, Sevki Utku
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 25 - 30
  • [9] Approximating Prediction Uncertainty for Random Forest Regression Models
    Coulston, John W.
    Blinn, Christine E.
    Thomas, Valerie A.
    Wynne, Randolph H.
    [J]. PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2016, 82 (03): : 189 - 197
  • [10] Heart Disease Prediction Based on Age Detection Using Logistic Regression over Random Forest
    Karthi, C. B. M.
    Kalaivani, A.
    [J]. CARDIOMETRY, 2022, (25): : 1731 - 1737