Exploring Object-Centric and Scene-Centric CNN Features and Their Complementarity for Human Rights Violations Recognition in Images

被引：5

作者：

Kalliatakis, Grigorios ^{[1
]}

Ehsan, Shoaib ^{[1
]}

Leonardis, Ales ^{[2
]}

Fasli, Maria ^{[1
]}

Mcdonald-Maier, Klaus D. ^{[1
]}

机构：

[1] Univ Essex, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, Essex, England

[2] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

英国工程与自然科学研究理事会; 英国经济与社会研究理事会;

关键词：

Computer vision; image interpretation; visual recognition; convolutional neural networks; human rights abuses recognition; FUSION;

D O I：

10.1109/ACCESS.2019.2891745

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Identifying potential abuses of human rights through imagery is a novel and challenging task in the field of computer vision, which will enable to expose human rights violations over large-scale data that may otherwise be impossible. While standard databases for object and scene categorization contain hundreds of different classes, the largest available dataset of human rights violations contains only four classes. Here, we introduce the human rights archive (HRA) database, a verified-by-experts repository of 3050 human rights violations photographs, labeled with human rights semantic categories, comprising a list of the types of human rights abuses encountered at present. With the HRA dataset and a two-phase transfer learning scheme, we fine-tuned the state-of-the-art deep convolutional neural networks (CNNs) to provide human rights violations classification CNNs. We also present extensive experiments refined to evaluate how well object-centric and scene-centric CNN features can be combined for the task of recognizing human rights abuses. With this, we show that the HRA database poses a challenge at a higher level for the well-studied representation learning methods and provide a benchmark in the task of human rights violations recognition in visual context. We expect that this dataset can help to open up new horizons on creating systems that are able to recognize rich information about human rights violations.

引用

页码：10045 / 10056

页数：12

共 3 条

[1] Deep convolution neural network with scene-centric and object-centric information for object detection
Shen, Zong-Ying
Han, Shiang-Yu
Fu, Li-Chen
Hsiao, Pei-Yung
Lau, Yo-Chung
Chang, Sheng-Jen
[J]. IMAGE AND VISION COMPUTING, 2019, 85 : 14 - 25
[2] Scene-Centric vs. Object-Centric Image-Text Cross-Modal Retrieval: A Reproducibility Study
Hendriksen, Mariya
Vakulenko, Svitlana
Kuiper, Ernst
de Rijke, Maarten
[J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 68 - 85
[3] Local Fast R-CNN Flow for Object-Centric Event Recognition in Complex Traffic Scenes
Gu, Qin
Yang, Jianyu
Yan, Wei Qi
Li, Yanqiang
Klette, Reinhard
[J]. IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10799 : 439 - 452

← 1 →