Audiovisual Dependency Attention for Violence Detection in Videos

被引：4

作者：

Pang, Wenfeng ^{[1
]}

Xie, Wei ^{[1
]}

He, Qianhua ^{[1
]}

Li, Yanxiong ^{[1
]}

Yang, Jichen ^{[2
]}

机构：

[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Peoples R China

[2] Guangdong Polytech Normal Univ, Sch Cyberspace Secur, Speech Informat Secur Lab, Guangzhou 510640, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

中国国家自然科学基金;

关键词：

Audiovisual dependency attention; dependency map; violence detection; SCENES; MOVIES; FUSION;

D O I：

10.1109/TMM.2022.3184533

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Violence detection in videos can help maintain public order, detect crimes, or provide timely assistance. In this paper, we aim to leverage multimodal information to determine whether successive frames contain violence. Specifically, we propose an audiovisual dependency attention (AVD-attention) module modified from the co-attention architecture to fuse visual and audio information, unlike commonly used methods such as the feature concatenation, addition, and score fusion. Because the AVD-attention module's dependency map contains sufficient fusion information, we argue that it should be applied more sufficiently. A combination pooling method is utilized to convert the dependency map to an attention vector, which can be considered a new feature that includes fusion information or a mask of the attention feature map. Since some information in the input feature might be lost after processing by attention modules, we employ a multimodal low-rank bilinear method that considers all pairwise interactions among two features in each time step to complement the original information for output features of the module. AVD-attention outperformed co-attention in experiments on the XD-Violence dataset. Our system outperforms state-of-the-art systems.

引用

页码：4922 / 4932

页数：11

共 50 条

[1] AUDIOVISUAL HIGHLIGHT DETECTION IN VIDEOS
Mundnich, Karel
Fenster, Alexandra
Khare, Aparna
Sundaram, Shiva
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4155 - 4159
[2] MULTIMODAL VIOLENCE DETECTION IN VIDEOS
Peixoto, Bruno
Lavi, Bahram
Bestagini, Paolo
Dias, Zanoni
Rocha, Anderson
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2957 - 2961
[3] TOWARD SUBJECTIVE VIOLENCE DETECTION IN VIDEOS
Peixoto, Bruno
Lavi, Bahram
Pereira Martin, Joao Paulo
Avila, Sandra
Dias, Zanoni
Rocha, Anderson
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8276 - 8280
[4] A dataset for automatic violence detection in videos
Bianculli, Miriana
Falcionelli, Nicola
Sernani, Paolo
Tomassini, Selene
Contardo, Paolo
Lombardi, Mara
Dragoni, Aldo Franco
DATA IN BRIEF, 2020, 33
[5] Human Violence Recognition and Detection in Surveillance Videos
Bilinski, Piotr
Bremond, Francois
2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2016, : 30 - 36
[6] Bidirectional Convolutional LSTM for the Detection of Violence in Videos
Hanson, Alex
Koutilya, P. N. V. R.
Krishnagopal, Sanjukta
Davis, Larry
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 280 - 295
[7] Federated Learning for Physical Violence Detection in Videos
Silva, Victor E. de S.
Lacerda, Tiago B.
Miranda, Pericles B. C.
Nascimento, Andre C. A.
Furtado, Ana Paula C.
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[8] Autocorrelation of gradients based violence detection in surveillance videos
Deepak, K.
Vignesh, L. K. P.
Chandrakala, S.
ICT EXPRESS, 2020, 6 (03): : 155 - 159
[9] Violence Detection from Videos using HOG Features
Das, Sunanda
Sarker, Amlan
Mahmud, Tareq
2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2019,
[10] Detection of Violence in Cartoon Videos Using Visual Features
Khalil, Tahira
Bangash, Javed Iqbal
Khan, Abdul Waheed
Lashari, Saima Anwar
Khan, Abdullah
Ramli, Dzati Athiar
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 4962 - 4971

← 1 2 3 4 5 →