Fusion-Attention Network for person search with free-form natural language

被引:17
|
作者
Ji, Zhong [1 ]
Li, Shengjia [1 ]
Pang, Yanwei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; Natural language description; Attention network;
D O I
10.1016/j.patrec.2018.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the task of searching persons from surveillance videos or large scale image dataset, it is more challenging to utilize free-form natural language to retrieve persons than using images and attributes. Thus, to deal with the challenges brought from the complexity of free-from natural language and visual-description mapping, we propose to strengthen the role of textual descriptions by means of fusion and attention mechanisms to make the discriminative words visually sensitive. Specifically, we develop an end-to-end fusion-attention structure, called Description-Strengthened Fusion-Attention Network (DSFA-Net) to tackle the challenging task. Specifically, DSFA-Net has a fusion sub-network and an attention sub-network, where three attention mechanisms are applied. Extensive experiments are performed on the large-scale CUHK-PEDES, which demonstrate the superiority of DSFA-Net. (C) 2018 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [21] Natural Local Approximation based Contouring Control for Free-form Contours
    Meng, Hao
    Lou, Yunjiang
    Zhou, Jiangpeng
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 4434 - 4439
  • [22] Scale Voting With Pyramidal Feature Fusion Network for Person Search
    Hong, Zheran
    Liu, Bin
    Lu, Yan
    Yin, Guojun
    Yu, Nenghai
    IEEE ACCESS, 2019, 7 : 139692 - 139702
  • [23] Augmenting and Structuring User Queries to Support Efficient Free-Form Code Search
    Sirres, Raphael
    Bissyande, Tegawende F.
    Kim, Dongsun
    Lo, David
    Klein, Jacques
    Kim, Kisub
    Le Traon, Yves
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, : 945 - 945
  • [24] Augmenting and structuring user queries to support efficient free-form code search
    Raphael Sirres
    Tegawendé F. Bissyandé
    Dongsun Kim
    David Lo
    Jacques Klein
    Kisub Kim
    Yves Le Traon
    Empirical Software Engineering, 2018, 23 : 2622 - 2654
  • [25] Augmenting and structuring user queries to support efficient free-form code search
    Sirres, Raphael
    Bissyand, Tegawende F.
    Kim, Dongsun
    Lo, David
    Klein, Jacques
    Kim, Kisub
    Le Traon, Yves
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (05) : 2622 - 2654
  • [26] Deep Free-Form Deformation Network for Object-Mask Registration
    Zhang, Haoyang
    He, Xuming
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4261 - 4269
  • [27] Hierarchical Gumbel Attention Network for Text-based Person Search
    Zheng, Kecheng
    Liu, Wu
    Liu, Jiawei
    Zha, Zheng-Jun
    Mei, Tao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3441 - 3449
  • [28] Feature attention fusion network for occluded person re-identification
    Zhuang, Xuyao
    Wei, Dan
    Liang, Danyang
    Jiang, Lei
    IMAGE AND VISION COMPUTING, 2024, 143
  • [29] Computing natural division lines on free-form surfaces based on measured data
    Lukacs, G
    Andor, L
    MATHEMATICAL METHODS FOR CURVES AND SURFACES II, 1998, : 319 - 326
  • [30] Inspection path planning of free-form surfaces based on improved cuckoo search algorithm
    Chen, Yueping
    Tan, Bo
    Zeng, Linan
    MEASUREMENT & CONTROL, 2023, 56 (7-8): : 1321 - 1332