Weakly supervised detection with decoupled attention-based deep representation

被引:2
|
作者
Jiang, Wenhui [1 ]
Zhao, Zhicheng [1 ,2 ]
Su, Fei [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst & Network Culture, Beijing 100876, Peoples R China
关键词
Weak supervision; Object detection; Deep learning; Attention model; OBJECT LOCALIZATION;
D O I
10.1007/s11042-017-5087-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Training object detectors with only image-level annotations is an important problem with a variety of applications. However, due to the deformable nature of objects, a target object delineated by a bounding box always includes irrelevant context and occlusions, which causes large intra-class object variations and ambiguity in object-background distinction. For this reason, identifying the object of interest from a substantial amount of cluttered backgrounds is very challenging. In this paper, we propose a decoupled attention-based deep model to optimize region-based object representation. Different from existing approaches posing object representation in a single-tower model, our proposed network decouples object representation into two separate modules, i.e., image representation and attention localization. The image representation module captures content-based semantic representation, while the attention localization module regresses an attention map which simultaneously highlights the locations of the discriminative object parts and down weights the irrelevant backgrounds presented in the image. The combined representation alleviates the impact from the noisy context and occlusions inside an object bounding box. As a result, object-background ambiguity can be largely reduced and background regions can be suppressed effectively. In addition, the proposed object representation model can be seamlessly integrated into a state-of-the-art weakly supervised detection framework, and the entire model can be trained end-to-end. We extensively evaluate the detection performance on the PASCAL VOC 2007, VOC 2010 and VOC2012 datasets. Experimental results demonstrate that our approach effectively improves weakly supervised object detection.
引用
收藏
页码:3261 / 3277
页数:17
相关论文
共 50 条
  • [1] Weakly supervised detection with decoupled attention-based deep representation
    Wenhui Jiang
    Zhicheng Zhao
    Fei Su
    [J]. Multimedia Tools and Applications, 2018, 77 : 3261 - 3277
  • [2] Attention-based framework for weakly supervised video anomaly detection
    Hualin Ma
    Liyan Zhang
    [J]. The Journal of Supercomputing, 2022, 78 : 8409 - 8429
  • [3] Attention-based framework for weakly supervised video anomaly detection
    Ma, Hualin
    Zhang, Liyan
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (06): : 8409 - 8429
  • [4] Attention-based Selection Strategy for Weakly Supervised Object Localization
    Zhang, Zhenfei
    Bui, Tien D.
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10305 - 10311
  • [5] Attention-based Dropout Layer for Weakly Supervised Object Localization
    Choe, Junsuk
    Shim, Hyunjung
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2214 - 2223
  • [6] Attention-Based Deep Learning System for Classification of Breast Lesions-Multimodal, Weakly Supervised Approach
    Bobowicz, Maciej
    Rygusik, Marlena
    Buler, Jakub
    Buler, Rafal
    Ferlin, Maria
    Kwasigroch, Arkadiusz
    Szurowska, Edyta
    Grochowski, Michal
    [J]. CANCERS, 2023, 15 (10)
  • [7] A Weakly Supervised Text Detection Based on Attention Mechanism
    Dong, Lanfang
    Zhou, Diancheng
    Liu, Hanchao
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 406 - 417
  • [8] Weakly supervised target detection based on spatial attention
    Wenqing Zhao
    Lijiao Xu
    [J]. Visual Intelligence, 2 (1):
  • [9] TransCAM: Transformer attention-based CAM refinement for Weakly supervised semantic segmentation
    Li, Ruiwen
    Mai, Zheda
    Zhang, Zhibo
    Jang, Jongseong
    Sanner, Scott
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 92
  • [10] Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation
    Zhang, Tianyi
    Lin, Guosheng
    Cai, Jianfei
    Shen, Tong
    Shen, Chunhua
    Kot, Alex C.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (11) : 2930 - 2941