Pose Calibrated Feature Aggregation for Video Face Set Recognition in Unconstrained Environments

被引：0

作者：

Ali Hasani, Ibrahim ^{[1
]}

Arif, Omar ^{[1
,2
]}

机构：

[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad 44000, Pakistan

[2] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Face recognition; Metadata; Vectors; Feature extraction; Streaming media; Accuracy; Training; Fans; Three-dimensional displays; Video face recognition; feature aggregation; frame selection; open sets; multi-stream networks;

D O I：

10.1109/ACCESS.2024.3481636

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents Pose Calibrated Feature Aggregation Network (PCFAN), an architecture for set/video face recognition. Using stacked attention blocks and a multi-modal architecture, it automatically assigns adaptive weights to every instance in the set, based on both the recognition embeddings and the associated face metadata. It uses these weights to produce a single, compact feature vector for the set. The model automatically learns to advocate for features from images with more favourable qualities and poses, which inherently hold more information. Our block can be inserted on top of any standard recognition model for set prediction and improved performance, particularly in unconstrained scenarios where subject pose and image quality vary considerably between frames. We test our approach on three challenging video face-recognition datasets, IJB-A, IJB-B, and YTF, and report state-of-the-art results. Moreover, a comparison with top aggregation methods as our baselines demonstrates that PCFAN is the superior approach.

引用

页码：156337 / 156346

页数：10

共 50 条

[1] POSE CALIBRATED FEATURE AGGREGATION FOR FACE SET RECOGNITION
Hasani, Ibrahim
Arif, Omar
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 161 - 165
[2] A feature map aggregation network for unconstrained video face recognition
Zhang, Luyang
Wang, Huaibin
Wang, Haitao
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2413 - 2425
[3] Still-to-video face recognition in unconstrained environments
Wang, Haoyu
Liu, Changsong
Ding, Xiaoqing
IMAGE PROCESSING: MACHINE VISION APPLICATIONS VIII, 2015, 9405
[4] Feature Aggregation Network for Video Face Recognition
Liu, Zhaoxiang
Hu, Huan
Bai, Jinqiang
Li, Shaohua
Lian, Shiguo
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 990 - 998
[5] Pose and occlusion invariant face recognition system for video surveillance using extensive feature set
Yoganand, A. Vivek
Kavida, A. Celine
Devi, D. Rukmani
INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2020, 33 (03) : 222 - 239
[6] Face Recognition in Unconstrained Environments
Kim, Dong-Ju
Lee, Sang-Heon
Sohn, Myoung-Kyu
Kim, Byungmin
Kim, Hyunduk
2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 143 - 144
[7] Geometry Guided Feature Aggregation in Video Face Recognition
Peng, Baoyun
Jin, Xiao
Wu, Yichao
Li, Dongsheng
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2670 - 2677
[8] Face Recognition in Unconstrained Environments A Deep Architecture on A Small Training Set
Saffar, Mohammad Taghi
Rekabdar, Banafsheh
Louis, Sushil
Nicolescu, Mircea
2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
[9] Robust real-time video face recognition system for unconstrained environments
Rajak, Amir
Dailey, Matthew N.
Ekpanyapong, Mongkol
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
[10] Face Recognition in a Video by pose variations
Bichwe, Madhavi R.
Shende, Ranjana
2015 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONTROL (IC4), 2015,

← 1 2 3 4 5 →