Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search

被引:7
|
作者
Huang, Wenxin [1 ]
Jia, Xuemei [2 ]
Zhong, Xian [3 ,4 ]
Wang, Xiao [5 ]
Jiang, Kui [2 ]
Wang, Zheng [2 ]
机构
[1] Hubei Univ, Sch Comp Sci & Informat Engn, 368 Youyi Ave, Wuhan 430062, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, 299 Bayi Rd, Wuhan 430072, Peoples R China
[3] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, 21 Gongda Rd, Wuhan 430070, Peoples R China
[4] Peking Univ, Sch Elect Engn & Comp Sci, 5 Yiheyuan Rd, Beijing 100091, Peoples R China
[5] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, 2 West Huangjiahu Rd, Wuhan 430081, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; alignment representation learning; coarse-to-fine; Part-Attentional Progressive Module; Re-weighting Alignment Module;
D O I
10.1145/3565886
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person search is a time-consuming computer vision task that entails locating and recognizing query people in scenic pictures. Body components are commonly mismatched during matching due to position variation, occlusions, and partially absent body parts, resulting in unsatisfactory person search results. Existing approaches for extracting local characteristics of the human body using keypoint information are unable to handle the search job when distinct body parts are misaligned, ignoring to exploit multiple granularities, which is crucial in the person search process. Moreover, the alignment learning methods learn body part features with fixed and equal weights, ignoring the beneficial contextual information, e.g., the umbrella carried by the pedestrian, which supplements compelling clues for identifying the person. In this paper, we propose a Coarse-to-Fine Adaptive Alignment Representation (CFA(2)R) network for learning multiple granular features in misaligned person search in the coarse-to-fine perspective. To exploit more beneficial body parts and related context of the cropped pedestrians, we design a Part-Attentional Progressive Module (PAPM) to guide the network to focus on informative body parts and positive accessorial regions. Besides, we propose a Re-weighting Alignment Module (RAM) shedding light on more contributive parts instead of treating them equally. Specifically, adaptive re-weighted but not fixed part features are reconstructed by Re-weighting Reconstruction module, considering that different parts serve unequally during image matching. Extensive experiments conducted on CUHK-SYSU and PRW datasets demonstrate competitive performance of our proposed method.
引用
下载
收藏
页数:19
相关论文
共 50 条
  • [31] Time course of visual perception:: Coarse-to-fine processing and beyond
    Hegde, Jay
    PROGRESS IN NEUROBIOLOGY, 2008, 84 (04) : 405 - 439
  • [32] COARSE-TO-FINE PARTIAL DISTORTION SEARCH ALGORITHM FOR MOTION ESTIMATION
    Yeh, Chia-Hung
    Wu, Ming-Te
    Chern, Shiunn-Jang
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (09): : 2523 - 2530
  • [33] Active speech source localization by a dual coarse-to-fine search
    Duraiswami, R
    Dmitry, Z
    Davis, LS
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 3309 - 3312
  • [34] Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
    Tian, Kaibin
    Cheng, Yanhua
    Liu, Yi
    Hou, Xinglin
    Chen, Quan
    Li, Han
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5207 - 5214
  • [35] A global cuckoo optimization algorithm using coarse-to-fine search
    Ma, Wei
    Sun, Zheng-Xing
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2015, 43 (12): : 2429 - 2439
  • [36] Is correspondence search in human stereo vision a coarse-to-fine process?
    Mallot, HA
    Gillner, S
    Arndt, PA
    BIOLOGICAL CYBERNETICS, 1996, 74 (02) : 95 - 106
  • [37] Circles Detection in Images by Using of Coarse-to-Fine Search Technique
    Fu Hudai
    Wang Hua
    Gao Jingang
    ADVANCES IN MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 220-223 : 1385 - 1388
  • [38] Coarse-to-fine eye movement behavior during visual search
    Hayward J. Godwin
    Erik D. Reichle
    Tamaryn Menneer
    Psychonomic Bulletin & Review, 2014, 21 : 1244 - 1249
  • [39] A Face Alignment Accelerator Based on Optimized Coarse-to-Fine Shape Searching
    Liu, Leibo
    Wang, Qiang
    Zhu, Wenping
    Mo, Huiyu
    Wang, Tianchen
    Yin, Shouyi
    Shi, Yiyu
    Wei, Shaojun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2467 - 2481
  • [40] Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation
    Honari, Sina
    Yosinski, Jason
    Vincent, Pascal
    Pal, Christopher
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5743 - 5752