Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search

被引:7
|
作者
Huang, Wenxin [1 ]
Jia, Xuemei [2 ]
Zhong, Xian [3 ,4 ]
Wang, Xiao [5 ]
Jiang, Kui [2 ]
Wang, Zheng [2 ]
机构
[1] Hubei Univ, Sch Comp Sci & Informat Engn, 368 Youyi Ave, Wuhan 430062, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, 299 Bayi Rd, Wuhan 430072, Peoples R China
[3] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, 21 Gongda Rd, Wuhan 430070, Peoples R China
[4] Peking Univ, Sch Elect Engn & Comp Sci, 5 Yiheyuan Rd, Beijing 100091, Peoples R China
[5] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, 2 West Huangjiahu Rd, Wuhan 430081, Peoples R China
基金
中国国家自然科学基金;
关键词
Person search; alignment representation learning; coarse-to-fine; Part-Attentional Progressive Module; Re-weighting Alignment Module;
D O I
10.1145/3565886
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person search is a time-consuming computer vision task that entails locating and recognizing query people in scenic pictures. Body components are commonly mismatched during matching due to position variation, occlusions, and partially absent body parts, resulting in unsatisfactory person search results. Existing approaches for extracting local characteristics of the human body using keypoint information are unable to handle the search job when distinct body parts are misaligned, ignoring to exploit multiple granularities, which is crucial in the person search process. Moreover, the alignment learning methods learn body part features with fixed and equal weights, ignoring the beneficial contextual information, e.g., the umbrella carried by the pedestrian, which supplements compelling clues for identifying the person. In this paper, we propose a Coarse-to-Fine Adaptive Alignment Representation (CFA(2)R) network for learning multiple granular features in misaligned person search in the coarse-to-fine perspective. To exploit more beneficial body parts and related context of the cropped pedestrians, we design a Part-Attentional Progressive Module (PAPM) to guide the network to focus on informative body parts and positive accessorial regions. Besides, we propose a Re-weighting Alignment Module (RAM) shedding light on more contributive parts instead of treating them equally. Specifically, adaptive re-weighted but not fixed part features are reconstructed by Re-weighting Reconstruction module, considering that different parts serve unequally during image matching. Extensive experiments conducted on CUHK-SYSU and PRW datasets demonstrate competitive performance of our proposed method.
引用
下载
收藏
页数:19
相关论文
共 50 条
  • [41] Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning
    Luo, Yadan
    Wang, Ziwei
    Huang, Zi
    Yang, Yang
    Zhao, Cong
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 237 - 246
  • [42] Coarse-to-Fine Entity Alignment for Chinese Heterogeneous Encyclopedia Knowledge Base
    Wu, Meng
    Jiang, Tingting
    Bu, Chenyang
    Zhu, Bin
    FUTURE INTERNET, 2022, 14 (02):
  • [43] WWN: Integration with Coarse-to-fine, Supervised and Reinforcement Learning
    Zheng, Zejia
    Weng, Juyang
    Zhang, Zhengyou
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1517 - 1524
  • [44] Coarse-to-Fine Semantic Alignment for Cross-Modal Moment Localization
    Hu, Yupeng
    Nie, Liqiang
    Liu, Meng
    Wang, Kun
    Wang, Yinglong
    Hua, Xian-Sheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5933 - 5943
  • [45] Coarse-To-Fine Incremental Few-Shot Learning
    Xiang, Xiang
    Tan, Yuwen
    Wan, Qian
    Ma, Jing
    Yuille, Alan
    Hager, Gregory D.
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 205 - 222
  • [46] Coarse-to-fine eye movement behavior during visual search
    Godwin, Hayward J.
    Reichle, Erik D.
    Menneer, Tamaryn
    PSYCHONOMIC BULLETIN & REVIEW, 2014, 21 (05) : 1244 - 1249
  • [47] ReFlowNet: Revisiting Coarse-to-fine Learning of Optical Flow
    Xu, Leyang
    Lu, Zongqing
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 429 - 442
  • [48] Federated Unsupervised Cluster-Contrastive learning for person Re-identification: A coarse-to-fine approach
    Weng, Jianfeng
    Hu, Kun
    Yao, Tingting
    Wang, Jingya
    Wang, Zhiyong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 237
  • [49] Coarse-to-fine adaptive masks for appearance matching of occluded scenes
    NTT Basic Research Lab, Kanagawa, Japan
    Mach Vision Appl, 5-6 (232-242):
  • [50] Coarse-to-fine adaptive masks for appearance matching of occluded scenes
    Edwards, JL
    Murase, H
    MACHINE VISION AND APPLICATIONS, 1998, 10 (5-6) : 232 - 242