Exploring Object-Centric and Scene-Centric CNN Features and Their Complementarity for Human Rights Violations Recognition in Images

被引:5
|
作者
Kalliatakis, Grigorios [1 ]
Ehsan, Shoaib [1 ]
Leonardis, Ales [2 ]
Fasli, Maria [1 ]
Mcdonald-Maier, Klaus D. [1 ]
机构
[1] Univ Essex, Sch Comp Sci & Elect Engn, Colchester CO4 3SQ, Essex, England
[2] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
来源
IEEE ACCESS | 2019年 / 7卷
基金
英国工程与自然科学研究理事会; 英国经济与社会研究理事会;
关键词
Computer vision; image interpretation; visual recognition; convolutional neural networks; human rights abuses recognition; FUSION;
D O I
10.1109/ACCESS.2019.2891745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying potential abuses of human rights through imagery is a novel and challenging task in the field of computer vision, which will enable to expose human rights violations over large-scale data that may otherwise be impossible. While standard databases for object and scene categorization contain hundreds of different classes, the largest available dataset of human rights violations contains only four classes. Here, we introduce the human rights archive (HRA) database, a verified-by-experts repository of 3050 human rights violations photographs, labeled with human rights semantic categories, comprising a list of the types of human rights abuses encountered at present. With the HRA dataset and a two-phase transfer learning scheme, we fine-tuned the state-of-the-art deep convolutional neural networks (CNNs) to provide human rights violations classification CNNs. We also present extensive experiments refined to evaluate how well object-centric and scene-centric CNN features can be combined for the task of recognizing human rights abuses. With this, we show that the HRA database poses a challenge at a higher level for the well-studied representation learning methods and provide a benchmark in the task of human rights violations recognition in visual context. We expect that this dataset can help to open up new horizons on creating systems that are able to recognize rich information about human rights violations.
引用
收藏
页码:10045 / 10056
页数:12
相关论文
共 3 条
  • [1] Deep convolution neural network with scene-centric and object-centric information for object detection
    Shen, Zong-Ying
    Han, Shiang-Yu
    Fu, Li-Chen
    Hsiao, Pei-Yung
    Lau, Yo-Chung
    Chang, Sheng-Jen
    [J]. IMAGE AND VISION COMPUTING, 2019, 85 : 14 - 25
  • [2] Scene-Centric vs. Object-Centric Image-Text Cross-Modal Retrieval: A Reproducibility Study
    Hendriksen, Mariya
    Vakulenko, Svitlana
    Kuiper, Ernst
    de Rijke, Maarten
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 68 - 85
  • [3] Local Fast R-CNN Flow for Object-Centric Event Recognition in Complex Traffic Scenes
    Gu, Qin
    Yang, Jianyu
    Yan, Wei Qi
    Li, Yanqiang
    Klette, Reinhard
    [J]. IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10799 : 439 - 452