Detecting hate crimes through machine learning and natural language processing

被引:0
|
作者
Salazar, Ana Ortiz [1 ]
机构
[1] Performance Analyt & Res, Seattle Police Dept, Seattle, WA USA
关键词
Hate crimes; bias; NLP; machine learning; Seattle; BIAS;
D O I
10.1080/15614263.2024.2397363
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Misidentification and misreporting of hate crimes by victims and law enforcement are significant barriers to accurate data collection of hate crimes, and their consequent study and prevention. The use of machine learning in crime detection can improve the accuracy and speed at which reported incidents with bias elements are identified. This study develops a machine learning classifier that categorizes police reports as either events with bias elements or events with no bias elements. We use incident/offense reports from the Seattle Police Department to train a Natural Language Processing classification algorithm. We collect narratives, location data, and victim and suspect demographics to use as features. We evaluate the performance of logistic regression, random forest, and XGBoost algorithms, as well as several text embedding techniques. Despite substantial class imbalance, our model achieves a macro F1-score of 0.79, demonstrating the benefits of applied machine learning in accurately detecting and reporting hate crimes.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Detecting Phishing Attacks Using Natural Language Processing and Machine Learning
    Peng, Tianrui
    Harris, Ian G.
    Sawa, Yuki
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 300 - 301
  • [2] Detecting Phishing Attacks Using Natural Language Processing And Machine Learning
    Banu, Reshma
    Anand, M.
    Kamath, Akshatha C.
    Ashika, S.
    Ujwala, H. S.
    Harshitha, S. N.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1210 - 1214
  • [3] Coding in the Liberal Arts through Natural Language Processing and Machine Learning
    Wolz, Ursula
    Wilson, Jennifer
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13506 - 13507
  • [5] Knowledgeable Machine Learning for Natural Language Processing
    Han, Xu
    Zhang, Zhengyan
    Liu, Zhiyuan
    COMMUNICATIONS OF THE ACM, 2021, 64 (11) : 50 - 51
  • [6] Machine learning in statistical natural language processing
    Mochihashi, Daichi
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2015, 69 (02): : 131 - 135
  • [7] Perceptions of Electric Vehicle Adoption Through Natural Language Processing and Machine Learning
    Araiza, Jesus Alejandro Gutierrez
    Luna, Sergio
    Santiago, Ivonne
    Akundi, Aditya
    18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [8] Automating the Assessment of Multicultural Orientation Through Machine Learning and Natural Language Processing
    Goldberg, Simon B.
    Tanana, Michael
    Stewart, Shaakira Haywood
    Williams, Camille Y.
    Soma, Christina S.
    Atkins, David C.
    Imel, Zac E.
    Owen, Jesse
    PSYCHOTHERAPY, 2024,
  • [9] Presumptive Detection of Cyberbullying on Twitter through Natural Language Processing and Machine Learning in the Spanish Language
    Leon-Paredes, Gabriel A.
    Palomeque-Leon, Wilson F.
    Gallegos-Segovia, Pablo L.
    Vintimilla-Tapia, Paul E.
    Bravo-Torres, Jack F.
    Barbosa-Santillan, Liliana, I
    Paredes-Pinos, Maria M.
    2019 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (CHILECON), 2019,
  • [10] Artificial learning companionusing machine learning and natural language processing
    R. Pugalenthi
    A Prabhu Chakkaravarthy
    J Ramya
    Samyuktha Babu
    R. Rasika Krishnan
    International Journal of Speech Technology, 2021, 24 : 553 - 560