Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search

被引:7
|
作者
Huang, Wenxin [1 ]
Jia, Xuemei [2 ]
Zhong, Xian [3 ,4 ]
Wang, Xiao [5 ]
Jiang, Kui [2 ]
Wang, Zheng [2 ]
机构
[1] Hubei Univ, Sch Comp Sci & Informat Engn, 368 Youyi Ave, Wuhan 430062, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, 299 Bayi Rd, Wuhan 430072, Peoples R China
[3] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, 21 Gongda Rd, Wuhan 430070, Peoples R China
[4] Peking Univ, Sch Elect Engn & Comp Sci, 5 Yiheyuan Rd, Beijing 100091, Peoples R China
[5] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, 2 West Huangjiahu Rd, Wuhan 430081, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; alignment representation learning; coarse-to-fine; Part-Attentional Progressive Module; Re-weighting Alignment Module;
D O I
10.1145/3565886
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person search is a time-consuming computer vision task that entails locating and recognizing query people in scenic pictures. Body components are commonly mismatched during matching due to position variation, occlusions, and partially absent body parts, resulting in unsatisfactory person search results. Existing approaches for extracting local characteristics of the human body using keypoint information are unable to handle the search job when distinct body parts are misaligned, ignoring to exploit multiple granularities, which is crucial in the person search process. Moreover, the alignment learning methods learn body part features with fixed and equal weights, ignoring the beneficial contextual information, e.g., the umbrella carried by the pedestrian, which supplements compelling clues for identifying the person. In this paper, we propose a Coarse-to-Fine Adaptive Alignment Representation (CFA(2)R) network for learning multiple granular features in misaligned person search in the coarse-to-fine perspective. To exploit more beneficial body parts and related context of the cropped pedestrians, we design a Part-Attentional Progressive Module (PAPM) to guide the network to focus on informative body parts and positive accessorial regions. Besides, we propose a Re-weighting Alignment Module (RAM) shedding light on more contributive parts instead of treating them equally. Specifically, adaptive re-weighted but not fixed part features are reconstructed by Re-weighting Reconstruction module, considering that different parts serve unequally during image matching. Extensive experiments conducted on CUHK-SYSU and PRW datasets demonstrate competitive performance of our proposed method.
引用
下载
收藏
页数:19
相关论文
共 50 条
  • [21] Coarse-to-fine eye movement strategy in visual search
    Over, E. A. B.
    Hooge, I. T. C.
    Vlaskamp, B. N. S.
    Erkelens, C. J.
    VISION RESEARCH, 2007, 47 (17) : 2272 - 2280
  • [22] Coarse-to-fine visual representation learning for medical images via class activation maps
    Yap B.P.
    Ng B.K.
    Computers in Biology and Medicine, 2024, 171
  • [23] Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval
    Zhu, Yunquan
    Gao, Xinkai
    Ke, Bo
    Qiao, Ruizhi
    Sun, Xing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11226 - 11235
  • [24] Unified Coarse-to-Fine Alignment for Video-Text Retrieval
    Wang, Ziyang
    Sung, Yi-Lin
    Cheng, Feng
    Bertasius, Gedas
    Bansal, Mohit
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2804 - 2815
  • [25] SANTO: a coarse-to-fine alignment and stitching method for spatial omics
    Li, Haoyang
    Lin, Yingxin
    He, Wenjia
    Han, Wenkai
    Xu, Xiaopeng
    Xu, Chencheng
    Gao, Elva
    Zhao, Hongyu
    Gao, Xin
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [26] Coarse-To-Fine Learning for Neural Machine Translation
    Zhang, Zhirui
    Liu, Shujie
    Li, Mu
    Zhou, Ming
    Chen, Enhong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, 2018, 11108 : 316 - 328
  • [27] Coarse-to-Fine Search Technique to Detect Circles in Images
    M. Atiquzzaman
    The International Journal of Advanced Manufacturing Technology, 1999, 15 : 96 - 102
  • [28] Learning based coarse-to-fine image registration
    Jiang, Jiayan
    Zheng, Songfeng
    Toga, Arthur W.
    Tu, Zhuowen
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 429 - +
  • [29] Reinforcement learning based coarse-to-fine search for the maximum k-plex problem
    Jin, Yan
    Drake, John H.
    He, Kun
    Benlic, Una
    APPLIED SOFT COMPUTING, 2022, 131
  • [30] PALMPRINT RECOGNITION USING COARSE-TO-FINE STATISTICAL IMAGE REPRESENTATION
    Han, Yufei
    Sun, Zhenan
    Tan, Tieniu
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1969 - 1972