Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning

被引:38
|
作者
Shi, Yuxuan [1 ]
Wei, Zhen [2 ]
Ling, Hefei [1 ]
Wang, Ziyang [1 ]
Shen, Jialie [3 ]
Li, Ping [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
[3] Queens Univ Belfast, Belfast BT7 1NN, Antrim, North Ireland
关键词
Cognition; Feature extraction; Hair; Semantics; Training; Robustness; Convolution; Person retrieval; person re-identification; human attribute; graph convolutional network; NEURAL-NETWORK; REIDENTIFICATION; IDENTIFICATION;
D O I
10.1109/TMM.2020.3042068
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Person retrieval largely relies on the appearance features of pedestrians. This task is rather more difficult in surveillance videos due to the limitations of extracting robust appearance features brought by the cross-view and cross-camera data with lower image resolution, motion blur, occlusion and other kinds of image degradation. To build up a more reliable person retrieval system, recent works introduced appearance attribute models to describe and distinguish different persons with high-level semantic concepts. Despite the progress of previous works, the value of utilizing appearance attributes is still under-explored. On one hand, existing methods lack for concise and precise attribute representations that are specific for each attribute category and, in the meantime, are able to filter noisy information in irrelevant spatial locations and useless patterns. On the other hand, correlation and reasoning between different attributes are neglected, which could generate more useful information and add more robustness to the retrieval system. In this paper, we propose an Attribute Mining and Reasoning (AMR) framework which is capable to handle the issues in question. The AMR makes better use of appearance attributes with two main components. First, the AMR disentangles the representations of different attributes by localizing their spatial positions and identifying their effective patterns in a weakly supervised manner. To achieve more reliable localization, we propose the Attribute Localization Ensemble (ALE) module that is consisted of multiple localization heads and a voting mechanism. Second, we introduce the Attribute Reasoning (AR) module to correlate different attributes together with the global appearance features and discover their latent relations to generate more comprehensive descriptions of pedestrians. Extensive experiments on DukeMTMC-ReID and Market-1501 datasets demonstrate the effectiveness of the proposed AMR framework as well as its superiority over the existing state-of-the-art methods. The AMR model also shows great generalization ability on the unseen CUHK03 dataset when it is only trained on Market-1501 dataset.
引用
收藏
页码:4376 / 4387
页数:12
相关论文
共 50 条
  • [41] UPAR Challenge 2024: Pedestrian Attribute Recognition and Attribute-based Person Retrieval - Dataset, Design, and Results
    Cormier, Mickael
    Specker, Andreas
    Jacques, Julio C. S., Jr.
    Moritz, Lennart
    Metzler, Juergen
    Moeslund, Thomas B.
    Nasrollahi, Kamal
    Escalera, Sergio
    Beyerer, Juergen
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 359 - 367
  • [42] An Interactive Framework for Cross-modal Attribute-based Person Retrieval
    Specker, Andreas
    Schumann, Arne
    Beyerer, Juergen
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [43] Global Based Deep Refineing Model For Person Retrieval
    Wang, Zhihao
    Zhou, Feng
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 504 - 509
  • [44] Person Re-Identification With Visual Semantic Representation Mining and Reasoning
    Zhao, Chuang
    Shi, Yuxuan
    Ling, Hefei
    Wang, Qian
    Zhao, Chengxin
    Chen, Jiazhong
    Li, Ping
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (04): : 486 - 497
  • [45] Deep anomaly detection through visual attention in surveillance videos
    Nasaruddin, Nasaruddin
    Muchtar, Kahlil
    Afdhal, Afdhal
    Dwiyantoro, Alvin Prayuda Juniarta
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [46] Crowd aware summarization of surveillance videos by deep reinforcement learning
    Xu, Junfeng
    Sun, Zhengxing
    Ma, Chen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (04) : 6121 - 6141
  • [47] A Deep Learning Based Technique for Anomaly Detection in Surveillance Videos
    Singh, Prakhar
    Pankajakshan, Vinod
    2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
  • [48] Deep Learning Based Fire Detection System for Surveillance Videos
    Wang, Hao
    Pan, Zhiying
    Zhang, Zhifei
    Song, Hongzhang
    Zhang, Shaobo
    Zhang, Jianhua
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT II, 2019, 11741 : 318 - 328
  • [49] Gun Detection in Surveillance Videos using Deep Neural Networks
    Lim, JunYi
    Al Jobayer, Md Istiaque
    Baskaran, Vishnu Monn
    Lim, Joanne MunYee
    Wong, KokSheik
    See, John
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1998 - 2002
  • [50] Pedestrian search in surveillance videos by learning discriminative deep features
    Zhang, Shizhou
    Cheng, De
    Gong, Yihong
    Shi, Dahu
    Qiu, Xi
    Xia, Yong
    Zhang, Yanning
    NEUROCOMPUTING, 2018, 283 : 120 - 128