Fusion-Attention Network for person search with free-form natural language

被引:17
|
作者
Ji, Zhong [1 ]
Li, Shengjia [1 ]
Pang, Yanwei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; Natural language description; Attention network;
D O I
10.1016/j.patrec.2018.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the task of searching persons from surveillance videos or large scale image dataset, it is more challenging to utilize free-form natural language to retrieve persons than using images and attributes. Thus, to deal with the challenges brought from the complexity of free-from natural language and visual-description mapping, we propose to strengthen the role of textual descriptions by means of fusion and attention mechanisms to make the discriminative words visually sensitive. Specifically, we develop an end-to-end fusion-attention structure, called Description-Strengthened Fusion-Attention Network (DSFA-Net) to tackle the challenging task. Specifically, DSFA-Net has a fusion sub-network and an attention sub-network, where three attention mechanisms are applied. Extensive experiments are performed on the large-scale CUHK-PEDES, which demonstrate the superiority of DSFA-Net. (C) 2018 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [1] Free-Form Image Inpainting via Contrastive Attention Network
    Ma, Xin
    Zhou, Xiaoqiang
    Huang, Huaibo
    Chai, Zhenhua
    Wei, Xiaolin
    He, Ran
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9242 - 9249
  • [2] Hybrid Attention Network for Language-Based Person Search
    Li, Yang
    Xu, Huahu
    Xiao, Junsheng
    SENSORS, 2020, 20 (18) : 1 - 23
  • [3] TEXT SEARCH FOR FREE-FORM THINKERS
    STANTON, L
    IEEE SOFTWARE, 1989, 6 (03) : 106 - 106
  • [4] Person Search with Natural Language Description
    Li, Shuang
    Xiao, Tong
    Li, Hongsheng
    Zhou, Bolei
    Yue, Dayu
    Wang, Xiaogang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5187 - 5196
  • [5] Residual inpainting using selective free-form attention
    Yang, Shiyuan
    Wang, Yi
    Cai, Huaiyu
    Chen, Xiaodong
    NEUROCOMPUTING, 2022, 510 : 149 - 158
  • [6] Multimodal Alignment and Attention-Based Person Search via Natural Language Description
    Ji, Zhong
    Li, Shengjia
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (11) : 11147 - 11156
  • [7] Data Fusion for Free-form Surfaces in Reverse Engineering
    Liu, Peng-xin
    Jia, Hai-li
    Wei, Zhao
    PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 1443 - 1448
  • [8] Multilevel Collaborative Attention Network for Person Search
    Li, Wenbo
    Chen, Ze
    Fu, Zhenyong
    Lu, Hongtao
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 467 - 482
  • [9] A Free-Form Database Query Language for Mobile Phones
    Ahmad, Rohiza
    Abdul-Kareem, Sameem
    2009 WRI INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND MOBILE COMPUTING: CMC 2009, VOL 3, 2009, : 279 - +
  • [10] Stochastic Natural Vibration Analyses of Free-Form Shells
    San, Bingbing
    Ma, Yunlong
    Xiao, Zhi
    Feng, Dongming
    Yin, Liwei
    APPLIED SCIENCES-BASEL, 2019, 9 (15):