Fusion-Attention Network for person search with free-form natural language

被引:17
|
作者
Ji, Zhong [1 ]
Li, Shengjia [1 ]
Pang, Yanwei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; Natural language description; Attention network;
D O I
10.1016/j.patrec.2018.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the task of searching persons from surveillance videos or large scale image dataset, it is more challenging to utilize free-form natural language to retrieve persons than using images and attributes. Thus, to deal with the challenges brought from the complexity of free-from natural language and visual-description mapping, we propose to strengthen the role of textual descriptions by means of fusion and attention mechanisms to make the discriminative words visually sensitive. Specifically, we develop an end-to-end fusion-attention structure, called Description-Strengthened Fusion-Attention Network (DSFA-Net) to tackle the challenging task. Specifically, DSFA-Net has a fusion sub-network and an attention sub-network, where three attention mechanisms are applied. Extensive experiments are performed on the large-scale CUHK-PEDES, which demonstrate the superiority of DSFA-Net. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [41] Adversarial Attribute-Text Embedding for Person Search With Natural Language Query
    Zha, Zheng-Jun
    Liu, Jiawei
    Chen, Di
    Wu, Feng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1836 - 1846
  • [44] Optimum cooling system design of a free-form injection mold using an abductive network
    Lin, JC
    JOURNAL OF MATERIALS PROCESSING TECHNOLOGY, 2002, 120 (1-3) : 226 - 236
  • [45] DAAPS: A Deformable-Attention-Based Anchor-Free Person Search Model
    Xin, Xiaoqi
    Han, Dezhi
    Cui, Mingming
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (02): : 2407 - 2425
  • [46] Multi-Attention-Guided Cascading Network for End-to-End Person Search
    Yang, Jianxi
    Wang, Xiaoyong
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [47] Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search
    Liu, Jiawei
    Zha, Zheng-Jun
    Hong, Richang
    Wang, Meng
    Zhang, Yongdong
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 665 - 673
  • [48] Attention-based mechanism and feature fusion network for person re-identification
    An, Mingshou
    He, Yunchuan
    Lim, Hye-Youn
    Kang, Dae-Seong
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2024, 20 (01)
  • [49] Person Re-identification Based on Multi-scale Network Attention Fusion
    Wang Fenhua
    Zhao Bo
    Huang Chao
    Yan Youqi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (12) : 3045 - 3052
  • [50] From free-form structures to natural lighting - how engineering innovation pushes the limits of architecture
    Schmid, V.
    STRUCTURES AND ARCHITECTURE, 2010, : 723 - 730