A supervised machine learning framework with combined blocking for detecting serial crimes

被引：0

作者：

Yusheng Li

Xueyan Shao

机构：

[1] Chinese Academy of Sciences,Institutes of Science and Development

[2] University of Chinese Academy of Sciences,School of Public Policy and Management

来源：

Applied Intelligence | 2022年 / 52卷

关键词：

Serial crime detection; Classification; Pairwise calculation; Blocking; Class imbalance;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Detecting serial crimes is to find criminals who have committed multiple crimes. A classification technique is often used to process serial crime detection, but the pairwise comparison of crimes is of quadratic complexity, and the number of nonserial case pairs far exceeds the number of serial case pairs. The blocking method can play a role in reducing pairwise calculation and eliminating nonserial case pairs. But the limitation of previous studies is that most of them use a single criterion to select blocks, which is difficult to guarantee an excellent blocking result. Some studies integrate multiple criteria into one comprehensive index. However, the performance is easily affected by the weighting method. In this paper, we propose a combined blocking (CB) approach. Each criminal behaviour is defined as a behaviour key (BHK) and used to form a block. CB learns several weak blocking schemes by different blocking criteria and then combines them to form the final blocking scheme. The final blocking scheme consists of several BHKs. Because rare behaviour can better identify crime series, each BHK is assigned a score according to its rarity. BHKs and their scores are used to determine whether a case pair need to be compared. After comparing with multiple blocking methods, CB can effectively guarantee the number of serial case pairs while greatly reducing unnecessary nonserial case pairs. The CB is embedded in a supervised machine learning framework. Experiments on real-world robbery cases demonstrate that it can effectively reduce pairwise comparison, alleviate the class imbalance problem and improve detection performance.

引用

页码：11517 / 11538

页数：21

共 50 条

[41] Detecting ham and spam emails using feature union and supervised machine learning models
Rustam, Furqan
Saher, Najia
Mehmood, Arif
Lee, Ernesto
Washington, Sandrilla
Ashraf, Imran
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26545 - 26561
[42] Domain Supervised Deep Learning Framework for Detecting Chinese Diabetes-Related Topics
Chen, Xinhuan
Zhang, Yong
Zhao, Kangzhi
Hu, Qingcheng
Xing, Chunxiao
[J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 53 - 71
[43] Pseudo-label based semi-supervised learning in the distributed machine learning framework
王晓曦
WU Wenjun
YANG Feng
SI Pengbo
ZHANG Xuanyi
ZHANG Yanhua
[J]. High Technology Letters, 2022, 28 (02) : 172 - 180
[44] ALGORITHMS FOR SUPERVISED MACHINE LEARNING- BASED STRUCTURAL PERFORMANCE EVALUATION FRAMEWORK
Wang, Xiaowei
Heo, YeongAe
[J]. PROCEEDINGS OF THE ASME 39TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, OMAE2020, VOL 2A, 2020,
[45] A Systematic Approach of Dataset definition for a Supervised Machine Learning using NFR Framework
Marinho, Matheus
Arruda, Danilo
Wanderley, Fernando
Lins, Anthony
[J]. 2018 11TH INTERNATIONAL CONFERENCE ON THE QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (QUATIC), 2018, : 110 - 118
[46] UCSL : A Machine Learning Expectation-Maximization Framework for Unsupervised Clustering Driven by Supervised Learning
Louiset, Robin
Gori, Pietro
Dufumier, Benoit
Houenou, Josselin
Grigis, Antoine
Duchesnay, Edouard
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 755 - 771
[47] Pseudo-label based semi-supervised learning in the distributed machine learning framework
Wang, Xiaoxi
Wu, Wenjun
Yang, Feng
Si, Pengbo
Zhang, Xuanyi
Zhang, Yanhua
[J]. High Technology Letters, 2022, 28 (02): : 172 - 180
[48] An Ensemble-based Supervised Machine Learning Framework for Android Ransomware Detection
Sharma, Shweta
Challa, Rama Krishna
Kumar, Rakesh
[J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (3A) : 422 - 429
[49] A semi-supervised interpretable machine learning framework for sensor fault detection
Martakis, Panagiotis
Movsessian, Artur
Reuland, Yves
Pai, Sai G. S.
Quqa, Said
Cava, David Garcia
Tcherniak, Dmitri
Chatzi, Eleni
[J]. SMART STRUCTURES AND SYSTEMS, 2022, 29 (01) : 251 - 266
[50] Supervised Machine Learning Tools and PUF Based Internet of Vehicles Authentication Framework
Sadhu, Pintu Kumar
Eickholt, Jesse
Yanambaka, Venkata P.
Abdelgawad, Ahmed
[J]. ELECTRONICS, 2022, 11 (23)

← 1 2 3 4 5 →