A supervised machine learning framework with combined blocking for detecting serial crimes

被引:0
|
作者
Yusheng Li
Xueyan Shao
机构
[1] Chinese Academy of Sciences,Institutes of Science and Development
[2] University of Chinese Academy of Sciences,School of Public Policy and Management
来源
Applied Intelligence | 2022年 / 52卷
关键词
Serial crime detection; Classification; Pairwise calculation; Blocking; Class imbalance;
D O I
暂无
中图分类号
学科分类号
摘要
Detecting serial crimes is to find criminals who have committed multiple crimes. A classification technique is often used to process serial crime detection, but the pairwise comparison of crimes is of quadratic complexity, and the number of nonserial case pairs far exceeds the number of serial case pairs. The blocking method can play a role in reducing pairwise calculation and eliminating nonserial case pairs. But the limitation of previous studies is that most of them use a single criterion to select blocks, which is difficult to guarantee an excellent blocking result. Some studies integrate multiple criteria into one comprehensive index. However, the performance is easily affected by the weighting method. In this paper, we propose a combined blocking (CB) approach. Each criminal behaviour is defined as a behaviour key (BHK) and used to form a block. CB learns several weak blocking schemes by different blocking criteria and then combines them to form the final blocking scheme. The final blocking scheme consists of several BHKs. Because rare behaviour can better identify crime series, each BHK is assigned a score according to its rarity. BHKs and their scores are used to determine whether a case pair need to be compared. After comparing with multiple blocking methods, CB can effectively guarantee the number of serial case pairs while greatly reducing unnecessary nonserial case pairs. The CB is embedded in a supervised machine learning framework. Experiments on real-world robbery cases demonstrate that it can effectively reduce pairwise comparison, alleviate the class imbalance problem and improve detection performance.
引用
收藏
页码:11517 / 11538
页数:21
相关论文
共 50 条
  • [41] Detecting ham and spam emails using feature union and supervised machine learning models
    Rustam, Furqan
    Saher, Najia
    Mehmood, Arif
    Lee, Ernesto
    Washington, Sandrilla
    Ashraf, Imran
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26545 - 26561
  • [42] Domain Supervised Deep Learning Framework for Detecting Chinese Diabetes-Related Topics
    Chen, Xinhuan
    Zhang, Yong
    Zhao, Kangzhi
    Hu, Qingcheng
    Xing, Chunxiao
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 53 - 71
  • [43] Pseudo-label based semi-supervised learning in the distributed machine learning framework
    王晓曦
    WU Wenjun
    YANG Feng
    SI Pengbo
    ZHANG Xuanyi
    ZHANG Yanhua
    [J]. High Technology Letters, 2022, 28 (02) : 172 - 180
  • [44] ALGORITHMS FOR SUPERVISED MACHINE LEARNING- BASED STRUCTURAL PERFORMANCE EVALUATION FRAMEWORK
    Wang, Xiaowei
    Heo, YeongAe
    [J]. PROCEEDINGS OF THE ASME 39TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, OMAE2020, VOL 2A, 2020,
  • [45] A Systematic Approach of Dataset definition for a Supervised Machine Learning using NFR Framework
    Marinho, Matheus
    Arruda, Danilo
    Wanderley, Fernando
    Lins, Anthony
    [J]. 2018 11TH INTERNATIONAL CONFERENCE ON THE QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (QUATIC), 2018, : 110 - 118
  • [46] UCSL : A Machine Learning Expectation-Maximization Framework for Unsupervised Clustering Driven by Supervised Learning
    Louiset, Robin
    Gori, Pietro
    Dufumier, Benoit
    Houenou, Josselin
    Grigis, Antoine
    Duchesnay, Edouard
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 755 - 771
  • [47] Pseudo-label based semi-supervised learning in the distributed machine learning framework
    Wang, Xiaoxi
    Wu, Wenjun
    Yang, Feng
    Si, Pengbo
    Zhang, Xuanyi
    Zhang, Yanhua
    [J]. High Technology Letters, 2022, 28 (02): : 172 - 180
  • [48] An Ensemble-based Supervised Machine Learning Framework for Android Ransomware Detection
    Sharma, Shweta
    Challa, Rama Krishna
    Kumar, Rakesh
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (3A) : 422 - 429
  • [49] A semi-supervised interpretable machine learning framework for sensor fault detection
    Martakis, Panagiotis
    Movsessian, Artur
    Reuland, Yves
    Pai, Sai G. S.
    Quqa, Said
    Cava, David Garcia
    Tcherniak, Dmitri
    Chatzi, Eleni
    [J]. SMART STRUCTURES AND SYSTEMS, 2022, 29 (01) : 251 - 266
  • [50] Supervised Machine Learning Tools and PUF Based Internet of Vehicles Authentication Framework
    Sadhu, Pintu Kumar
    Eickholt, Jesse
    Yanambaka, Venkata P.
    Abdelgawad, Ahmed
    [J]. ELECTRONICS, 2022, 11 (23)