Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning

被引：38

作者：

Shi, Yuxuan ^{[1
]}

Wei, Zhen ^{[2
]}

Ling, Hefei ^{[1
]}

Wang, Ziyang ^{[1
]}

Shen, Jialie ^{[3
]}

Li, Ping ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China

[2] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland

[3] Queens Univ Belfast, Belfast BT7 1NN, Antrim, North Ireland

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2021年 / 23卷

关键词：

Cognition; Feature extraction; Hair; Semantics; Training; Robustness; Convolution; Person retrieval; person re-identification; human attribute; graph convolutional network; NEURAL-NETWORK; REIDENTIFICATION; IDENTIFICATION;

D O I：

10.1109/TMM.2020.3042068

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Person retrieval largely relies on the appearance features of pedestrians. This task is rather more difficult in surveillance videos due to the limitations of extracting robust appearance features brought by the cross-view and cross-camera data with lower image resolution, motion blur, occlusion and other kinds of image degradation. To build up a more reliable person retrieval system, recent works introduced appearance attribute models to describe and distinguish different persons with high-level semantic concepts. Despite the progress of previous works, the value of utilizing appearance attributes is still under-explored. On one hand, existing methods lack for concise and precise attribute representations that are specific for each attribute category and, in the meantime, are able to filter noisy information in irrelevant spatial locations and useless patterns. On the other hand, correlation and reasoning between different attributes are neglected, which could generate more useful information and add more robustness to the retrieval system. In this paper, we propose an Attribute Mining and Reasoning (AMR) framework which is capable to handle the issues in question. The AMR makes better use of appearance attributes with two main components. First, the AMR disentangles the representations of different attributes by localizing their spatial positions and identifying their effective patterns in a weakly supervised manner. To achieve more reliable localization, we propose the Attribute Localization Ensemble (ALE) module that is consisted of multiple localization heads and a voting mechanism. Second, we introduce the Attribute Reasoning (AR) module to correlate different attributes together with the global appearance features and discover their latent relations to generate more comprehensive descriptions of pedestrians. Extensive experiments on DukeMTMC-ReID and Market-1501 datasets demonstrate the effectiveness of the proposed AMR framework as well as its superiority over the existing state-of-the-art methods. The AMR model also shows great generalization ability on the unseen CUHK03 dataset when it is only trained on Market-1501 dataset.

引用

页码：4376 / 4387

页数：12

共 50 条

[41] UPAR Challenge 2024: Pedestrian Attribute Recognition and Attribute-based Person Retrieval - Dataset, Design, and Results
Cormier, Mickael
Specker, Andreas
Jacques, Julio C. S., Jr.
Moritz, Lennart
Metzler, Juergen
Moeslund, Thomas B.
Nasrollahi, Kamal
Escalera, Sergio
Beyerer, Juergen
2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 359 - 367
[42] An Interactive Framework for Cross-modal Attribute-based Person Retrieval
Specker, Andreas
Schumann, Arne
Beyerer, Juergen
2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
[43] Global Based Deep Refineing Model For Person Retrieval
Wang, Zhihao
Zhou, Feng
2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS (ICCCAS 2018), 2018, : 504 - 509
[44] Person Re-Identification With Visual Semantic Representation Mining and Reasoning
Zhao, Chuang
Shi, Yuxuan
Ling, Hefei
Wang, Qian
Zhao, Chengxin
Chen, Jiazhong
Li, Ping
IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 5 (04): : 486 - 497
[45] Deep anomaly detection through visual attention in surveillance videos
Nasaruddin, Nasaruddin
Muchtar, Kahlil
Afdhal, Afdhal
Dwiyantoro, Alvin Prayuda Juniarta
JOURNAL OF BIG DATA, 2020, 7 (01)
[46] Crowd aware summarization of surveillance videos by deep reinforcement learning
Xu, Junfeng
Sun, Zhengxing
Ma, Chen
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (04) : 6121 - 6141
[47] A Deep Learning Based Technique for Anomaly Detection in Surveillance Videos
Singh, Prakhar
Pankajakshan, Vinod
2018 TWENTY FOURTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2018,
[48] Deep Learning Based Fire Detection System for Surveillance Videos
Wang, Hao
Pan, Zhiying
Zhang, Zhifei
Song, Hongzhang
Zhang, Shaobo
Zhang, Jianhua
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT II, 2019, 11741 : 318 - 328
[49] Gun Detection in Surveillance Videos using Deep Neural Networks
Lim, JunYi
Al Jobayer, Md Istiaque
Baskaran, Vishnu Monn
Lim, Joanne MunYee
Wong, KokSheik
See, John
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1998 - 2002
[50] Pedestrian search in surveillance videos by learning discriminative deep features
Zhang, Shizhou
Cheng, De
Gong, Yihong
Shi, Dahu
Qiu, Xi
Xia, Yong
Zhang, Yanning
NEUROCOMPUTING, 2018, 283 : 120 - 128

← 1 2 3 4 5 →