Fusion-Attention Network for person search with free-form natural language

被引:17
|
作者
Ji, Zhong [1 ]
Li, Shengjia [1 ]
Pang, Yanwei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; Natural language description; Attention network;
D O I
10.1016/j.patrec.2018.10.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the task of searching persons from surveillance videos or large scale image dataset, it is more challenging to utilize free-form natural language to retrieve persons than using images and attributes. Thus, to deal with the challenges brought from the complexity of free-from natural language and visual-description mapping, we propose to strengthen the role of textual descriptions by means of fusion and attention mechanisms to make the discriminative words visually sensitive. Specifically, we develop an end-to-end fusion-attention structure, called Description-Strengthened Fusion-Attention Network (DSFA-Net) to tackle the challenging task. Specifically, DSFA-Net has a fusion sub-network and an attention sub-network, where three attention mechanisms are applied. Extensive experiments are performed on the large-scale CUHK-PEDES, which demonstrate the superiority of DSFA-Net. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [31] Feature-level fusion for free-form object tracking using laserscanner and video
    Kaempchen, N
    Buehler, M
    Dietmayer, K
    2005 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2005, : 453 - 458
  • [32] Real-time tracking of free-form objects by range and intensity image fusion
    Hiura, Shinsaku
    Yamaguchi, Akashi
    Sato, Kosuke
    Inokuchi, Seiji
    Systems and Computers in Japan, 1998, 29 (08): : 19 - 27
  • [33] Cuckoo Search-based Range Image Registration for Free-form Surface Inspection
    Li, Taifeng
    Li, Peigen
    Li, Wenlong
    Gao, Liang
    Yi, Jin
    Bai, Yuewei
    PROCEEDINGS OF THE 2015 IEEE 19TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2015, : 505 - 510
  • [34] Multi-domain Attention Fusion Network For Language Recognition
    Ju M.
    Xu Y.
    Ke D.
    Su K.
    SN Computer Science, 4 (1)
  • [35] Study of geometrical defects of free-form surface machined using neural network
    Bloul, Benattia
    Chanal, Helene
    Aour, Benaoumeur
    Chtioui, Nargess
    ADVANCES IN MECHANICAL ENGINEERING, 2021, 13 (11)
  • [36] AATSN: Anatomy Aware Tumor Segmentation Network for PET-CT volumes and images using a lightweight fusion-attention mechanism
    Ahmad, Ibtihaj
    Xia, Yung
    Cui, Hengfei
    Ul Islam, Zain
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [37] Reconstruction algorithm for free-form surfaces based on neural network in W space
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design & Computer Graphics, 2000, 12 (05): : 344 - 348
  • [38] A Method of LED Free-form Tilted Lens Rapid Modeling Based on Scheme Language
    Dai Yi-dan
    AOPC 2017: OPTICAL STORAGE AND DISPLAY TECHNOLOGY, 2017, 10459
  • [39] Feature Fusion Transformer Network for Natural Language Inference
    Sun, Lei
    Yan, Hengxin
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 1009 - 1014
  • [40] Cross-scale global attention feature pyramid network for person search
    Li, Yang
    Xu, Huahu
    Bian, Minjie
    Xiao, Junsheng
    IMAGE AND VISION COMPUTING, 2021, 116