INSTANCE-AWARE REMOTE SENSING IMAGE CAPTIONING WITH CROSS-HIERARCHY ATTENTION

被引:10
|
作者
Wang, Chengze
Jiang, Zhiyu [1 ]
Yuan, Yuan
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing image captioning; semantic understanding; visual attention;
D O I
10.1109/IGARSS39084.2020.9323213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spatial attention is a straightforward approach to enhance the performance for remote sensing image captioning. However, conventional spatial attention approaches consider only the attention distribution on one fixed coarse grid, resulting in the semantics of tiny objects can be easily ignored or disturbed during the visual feature extraction. Worse still, the fixed semantic level of conventional spatial attention limits the image understanding in different levels and perspectives, which is critical for tackling the huge diversity in remote sensing images. To address these issues, we propose a remote sensing image caption generator with instance-awareness and cross-hierarchy attention. 1) The instances awareness is achieved by introducing a multi-level feature architecture that contains the visual information of multi-level instance-possible regions and their surroundings. 2) Moreover, based on this multi-level feature extraction, a cross-hierarchy attention mechanism is proposed to prompt the decoder to dynamically focus on different semantic hierarchies and instances at each time step. The experimental results on public datasets demonstrate the superiority of proposed approach over existing methods.
引用
收藏
页码:980 / 983
页数:4
相关论文
共 50 条
  • [1] Instance-aware image dehazing
    Chao, Qingqing
    Yan, Jinqiang
    Sun, Tianmeng
    Li, Silong
    Chi, Jieru
    Yang, Guowei
    Chen, Chenglizhao
    Yu, Teng
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [2] Instance-Aware Distillation for Efficient Object Detection in Remote Sensing Images
    Li, Cong
    Cheng, Gong
    Wang, Guangxing
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [3] Rethinking Remote Sensing Pretrained Model: Instance-Aware Visual Prompting for Remote Sensing Scene Classification
    Fang, Leyuan
    Kuang, Yang
    Liu, Qiang
    Yang, Yi
    Yue, Jun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 13
  • [4] Image inpainting based on cross-hierarchy global and local aware network
    Jiang, Bin
    Huang, Wei
    Yang, Chao
    Huang, Yun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18747 - 18760
  • [5] Image inpainting based on cross-hierarchy global and local aware network
    Bin Jiang
    Wei Huang
    Chao Yang
    Yun Huang
    [J]. Multimedia Tools and Applications, 2023, 82 : 18747 - 18760
  • [6] Instance-Aware Contour Learning for Vectorized Building Extraction From Remote Sensing Imagery
    Huang, Xingliang
    Chen, Kaiqiang
    Wang, Zhirui
    Sun, Xian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 12745 - 12759
  • [7] Weakly supervised object detection from remote sensing images via self-attention distillation and instance-aware mining
    Peng Yang
    Shi Zhou
    Linlin Wang
    Guowei Yang
    [J]. Multimedia Tools and Applications, 2024, 83 : 39073 - 39095
  • [8] Sound Active Attention Framework for Remote Sensing Image Captioning
    Lu, Xiaoqiang
    Wang, Binqiang
    Zheng, Xiangtao
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (03): : 1985 - 2000
  • [9] InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
    Kim, Soohyun
    Baek, Jongbeom
    Park, Jihye
    Kim, Gyeongnyeon
    Kim, Seungryong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18300 - 18310
  • [10] Recurrent Attention and Semantic Gate for Remote Sensing Image Captioning
    Li, Yunpeng
    Zhang, Xiangrong
    Gu, Jing
    Li, Chen
    Wang, Xin
    Tang, Xu
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60