A novel multi-stream method for violent interaction detection using deep learning

被引:10
|
作者
Li, Hongchang [1 ]
Wang, Jing [1 ]
Han, Jianjun [1 ]
Zhang, Jinmin [1 ]
Yang, Yushan [1 ]
Zhao, Yue [1 ]
机构
[1] Xian Special Equipment Inspect Inst, Xian 710068, Peoples R China
来源
MEASUREMENT & CONTROL | 2020年 / 53卷 / 5-6期
基金
中国国家自然科学基金;
关键词
Violent interaction detection; hand-craft features; deep learning; VIDEO; RECOGNITION;
D O I
10.1177/0020294020902788
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Violent interaction detection is a hot topic in computer vision. However, the recent research works on violent interaction detection mainly focus on the traditional hand-craft features, and does not make full use of the research results of deep learning in computer vision. In this paper, we propose a new robust violent interaction detection framework based on multi-stream deep learning in surveillance scene. The proposed approach enhances the recognition performance of violent action in video by fusing three different streams: attention-based spatial RGB stream, temporal stream, and local spatial stream. The attention-based spatial RGB stream learns the spatial attention regions of persons that have high probability to be action region through soft-attention mechanism. The temporal stream employs optical flow as input to extract temporal features. The local spatial stream learns spatial local features using block images as input. Experimental results demonstrate the effectiveness and reliability of the proposed method on three violent interactive datasets: hockey fights, movies, violent interaction. We also verify the proposed method on our own elevator surveillance video dataset and the performance of the proposed method is satisfied.
引用
下载
收藏
页码:796 / 806
页数:11
相关论文
共 50 条
  • [1] Multi-stream Deep Learning Framework for Automated Presentation Assessment
    Li, Junnan
    Wong, Yongkang
    Kankanhalli, Mohan S.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 222 - 225
  • [2] Multi-Stream Deep Similarity Learning Networks for Visual Tracking
    Li, Kunpeng
    Kong, Yu
    Fu, Yun
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2166 - 2172
  • [3] Multimodal Multi-stream Deep Learning for Egocentric Activity Recognition
    Song, Sibo
    Chandrasekhar, Vijay
    Mandal, Bappaditya
    Li, Liyuan
    Lim, Joo-Hwee
    Babu, Giduthuri Sateesh
    San, Phyo Phyo
    Cheung, Ngai-Man
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 378 - 385
  • [4] A Multi-Stream Sequence Learning Framework for Human Interaction Recognition
    Haroon, Umair
    Ullah, Amin
    Hussain, Tanveer
    Ullah, Waseem
    Sajjad, Muhammad
    Muhammad, Khan
    Lee, Mi Young
    Baik, Sung Wook
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2022, 52 (03) : 435 - 444
  • [5] Multi-stream Network for Human-object Interaction Detection
    Wang, Chang
    Sun, Jinyu
    Ma, Shiwei
    Lu, Yuqiu
    Liu, Wang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (08)
  • [6] Ingredient analysis of biological wastewater using hybrid multi-stream deep learning framework
    Bakht, Ahtesham
    Nawaz, Alam
    Lee, Moonyong
    Lee, Hyunsoo
    COMPUTERS & CHEMICAL ENGINEERING, 2022, 168
  • [7] A Multi-Stream Approach to Mixed-Traffic Accident Recognition Using Deep Learning
    Fu, Swee Tee
    Theng, Lau Bee
    Shiong, Brian Loh Chung
    Mccarthy, Chris
    Tsun, Mark Tee Kit
    IEEE Access, 2024, 12 : 185232 - 185249
  • [8] A multi-stream CNN for deep violence detection in video sequences using handcrafted features
    Mohtavipour, Seyed Mehdi
    Saeidi, Mahmoud
    Arabsorkhi, Abouzar
    VISUAL COMPUTER, 2022, 38 (06): : 2057 - 2072
  • [9] A multi-stream CNN for deep violence detection in video sequences using handcrafted features
    Seyed Mehdi Mohtavipour
    Mahmoud Saeidi
    Abouzar Arabsorkhi
    The Visual Computer, 2022, 38 : 2057 - 2072
  • [10] Multi-task learning とmulti-stream monocular depth estimation using integrated model with multi-task learning and multi-stream
    Takamine, Michiru
    Endo, Satoshi
    Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05): : 1 - 9