Online image augmentation via regional cross-attention

被引:0
|
作者
Yin, Chuan [1 ]
Xu, Yichen [1 ]
Zhang, Siyi [1 ]
Jin, Jingyuan [1 ]
Zhang, Pengquan [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Elect & Informat, Hangzhou 310018, Peoples R China
关键词
Data augmentation; Fine-grained image classification; Cross-attention mechanism;
D O I
10.1016/j.compeleceng.2024.109571
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at overcoming undesirable fine-grained image augmentation caused by inaccurate salient region location in challenging scenarios, we propose a cross-region attention-based automatic image augmentation method. Unlike existing offline methods that randomly choose the sub-regions for image mixing, we first propose a cross-region attention mechanism, which can simultaneously explore the intro and inter-image block relation through the collaboration of fix and shift window-based cross-attention. Secondly, an image mixing sub-network is designed based on the cross-region attention. This sub-network can be easily employed in various fine-grained classification networks. After integrating the image mixing sub-network and finegrained classification network into a union, we focus on solving the two-layer optimization problem caused by the inconsistency between heterogeneous network structures. The superiority of the proposed method for fine-grained image enhancement is verified through extensive experimental evaluations on six fine-grained datasets.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Image Caption with Synchronous Cross-Attention
    Wang, Yue
    Liu, Jinlai
    Wang, Xiaojie
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 433 - 441
  • [2] Semantic Image Synthesis via Class-Adaptive Cross-Attention
    Fontanini, Tomaso
    Ferrari, Claudio
    Lisanti, Giuseppe
    Bertozzi, Massimo
    Prati, Andrea
    IEEE ACCESS, 2025, 13 : 10326 - 10339
  • [3] Semantic Image Synthesis via Class-Adaptive Cross-Attention
    Fontanini, Tomaso
    Ferrari, Claudio
    Lisanti, Giuseppe
    Bertozzi, Massimo
    Prati, Andrea
    IEEE Access, 2025, 13 : 10326 - 10339
  • [4] Semantic Image Synthesis via Class-Adaptive Cross-Attention
    Fontanini, Tomaso
    Ferrari, Claudio
    Lisanti, Giuseppe
    Bertozzi, Massimo
    Prati, Andrea
    IEEE ACCESS, 2025, 13 : 10326 - 10339
  • [5] Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network
    Zhang, Bo
    Chen, Yaxiong
    Xiong, Shengwu
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 899 - 913
  • [6] Dual Cross-Attention for medical image segmentation
    Ates, Gorkem Can
    Mohan, Prasoon
    Celik, Emrah
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [7] Optical remote sensing image salient object detection via bidirectional cross-attention and attention restoration
    Gu, Yubin
    Chen, Siting
    Sun, Xiaoshuai
    Ji, Jiayi
    Zhou, Yiyi
    Ji, Rongrong
    PATTERN RECOGNITION, 2025, 164
  • [8] Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation
    Yang, Yinfei
    Jin, Ning
    Lin, Kuo
    Guo, Mandy
    Cer, Daniel
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 263 - 268
  • [9] Deformable Cross-Attention Transformer for Medical Image Registration
    Chen, Junyu
    Liu, Yihao
    He, Yufan
    Du, Yong
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 115 - 125
  • [10] CAFIN: cross-attention based face image repair network
    Li, Yaqian
    Li, Kairan
    Li, Haibin
    Zhang, Wenming
    MULTIMEDIA SYSTEMS, 2024, 30 (05)