FPAN: Fine-grained and progressive attention localization network for data retrieval

被引:3
|
作者
Chen, Sijia [1 ]
Song, Bin [1 ]
Guo, Jie [1 ]
Zhang, Yanling [1 ]
Du, Xiaojiang [2 ]
Guizani, Mohsen [3 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Shaanxi, Peoples R China
[2] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
[3] Univ Idaho, Dept Elect & Comp Engn, Moscow, ID 83843 USA
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Query-based object localization; Fine-grained attention; Progressive attention; Unified framework; Fully convolution localization network; OBJECT TRACKING; MODEL;
D O I
10.1016/j.comnet.2018.07.011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The localization of the target object for data retrieval is a key issue in the Intelligent and Connected Transportation Systems (ICTS). However, due to the lack of intelligence in the traditional transportation system, it takes a lot of resources to manually retrieve and locate the queried objects from a large number of images. In order to solve this issue, we propose an effective method for query-based object localization, which uses artificial intelligence techniques to automatically locate the queried object in complex backgrounds. The proposed method is termed as Fine-grained and Progressive Attention Localization Network (FPAN), which uses an image and a queried object as input to accurately locate the target object in the image. Specifically, the fine-grained attention module is naturally embedded into each layer of a convolution neural network (CNN), thereby gradually suppressing the regions that are irrelevant to the queried object and eventually focusing attention on the target area. We further employ top-down attentions fusion algorithm operated by a learnable cascade up-sampling structure to establish the connection between the attention map and the exact location of the queried object in the original image. Furthermore, the FPAN is trained by multi-task learning with box segmentation loss and cosine loss. At last, we conduct comprehensive experiments on both queried-based digit localization and object tracking with synthetic and benchmark datasets. The experimental results show that our algorithm is far superior than other algorithms on the synthesis datasets and outperforms most existing trackers on the OTB and VOT datasets. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:98 / 111
页数:14
相关论文
共 50 条
  • [1] Progressive Co-Attention Network for Fine-Grained Visual Classification
    Zhang, Tian
    Chang, Dongliang
    Ma, Zhanyu
    Guo, Jun
    [J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [2] Fine-grained Iterative Attention Network for Temporal Language Localization in Videos
    Qu, Xiaoye
    Tang, Pengwei
    Zou, Zhikang
    Cheng, Yu
    Dong, Jianfeng
    Zhou, Pan
    Xu, Zichuan
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4280 - 4288
  • [3] A feature consistency driven attention erasing network for fine-grained image retrieval
    Zhao, Qi
    Wang, Xu
    Lyu, Shuchang
    Liu, Binghao
    Yang, Yifan
    [J]. PATTERN RECOGNITION, 2022, 128
  • [4] A fast residual attention network for fine-grained unsupervised anomaly detection and localization
    Nafti, Najeh
    Besbes, Olfa
    Ben Abdallah, Asma
    Vacavant, Antoine
    Bedoui, Mohamed Hedi
    [J]. APPLIED SOFT COMPUTING, 2024, 165
  • [5] Fine-Grained Image Retrieval via Object Localization
    Wang, Rong
    Zou, Wei
    Wang, Jiajun
    [J]. ELECTRONICS, 2023, 12 (10)
  • [6] Two-Level Progressive Attention Convolutional Network for Fine-Grained Image Recognition
    Wei, Hua
    Zhu, Ming
    Wang, Bo
    Wang, Jiarong
    Sun, Deyao
    [J]. IEEE ACCESS, 2020, 8 : 104985 - 104995
  • [7] A Progressive Gated Attention Model for Fine-Grained Visual Classification
    Zhu, Qiangxi
    Li, Zhixin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2063 - 2068
  • [8] Dual attention interactive fine-grained classification network based on data augmentation
    Zhu, Qiangxi
    Kuang, Wenlan
    Li, Zhixin
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [9] Attribute-Guided Multi-Level Attention Network for Fine-Grained Fashion Retrieval
    Xiao, Ling
    Yamasaki, Toshihiko
    [J]. IEEE ACCESS, 2024, 12 (48068-48080) : 48068 - 48080
  • [10] Fine-Grained Retrieval with Autoencoders
    Portenier, Tiziano
    Hu, Qiyang
    Favaro, Paolo
    Zwicker, Matthias
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 85 - 95