Pose Calibrated Feature Aggregation for Video Face Set Recognition in Unconstrained Environments

被引：0

作者：

Ali Hasani, Ibrahim ^{[1
]}

Arif, Omar ^{[1
,2
]}

机构：

[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad 44000, Pakistan

[2] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Face recognition; Metadata; Vectors; Feature extraction; Streaming media; Accuracy; Training; Fans; Three-dimensional displays; Video face recognition; feature aggregation; frame selection; open sets; multi-stream networks;

D O I：

10.1109/ACCESS.2024.3481636

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents Pose Calibrated Feature Aggregation Network (PCFAN), an architecture for set/video face recognition. Using stacked attention blocks and a multi-modal architecture, it automatically assigns adaptive weights to every instance in the set, based on both the recognition embeddings and the associated face metadata. It uses these weights to produce a single, compact feature vector for the set. The model automatically learns to advocate for features from images with more favourable qualities and poses, which inherently hold more information. Our block can be inserted on top of any standard recognition model for set prediction and improved performance, particularly in unconstrained scenarios where subject pose and image quality vary considerably between frames. We test our approach on three challenging video face-recognition datasets, IJB-A, IJB-B, and YTF, and report state-of-the-art results. Moreover, a comparison with top aggregation methods as our baselines demonstrates that PCFAN is the superior approach.

引用

页码：156337 / 156346

页数：10

共 50 条

[31] Automatic face region tracking for highly accurate face recognition in unconstrained environments
Kim, YO
Paik, J
Heo, J
Koschan, A
Abidi, B
Abidi, M
IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2003, : 29 - 36
[32] A comparative study of thermal face recognition methods in unconstrained environments
Hermosilla, Gabriel
Ruiz-del-Solar, Javier
Verschae, Rodrigo
Correa, Mauricio
PATTERN RECOGNITION, 2012, 45 (07) : 2445 - 2459
[33] Robust Video Face Recognition Under Pose Variation
Su, Ya
NEURAL PROCESSING LETTERS, 2018, 47 (01) : 277 - 291
[34] Robust Video Face Recognition Under Pose Variation
Ya Su
Neural Processing Letters, 2018, 47 : 277 - 291
[35] Efficient face recognition with variant pose and illumination in video
Dai, Yi
Xiao, Guoqiang
Qiu, Kaijin
ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 18 - 22
[36] Set-to-set face recognition under variations in pose and illumination
Chang, Jen-Mei
Kirby, Michael
Peterson, Chris
2007 BIOMETRICS SYMPOSIUM, 2007, : 150 - 155
[37] An automatic system for unconstrained video-based face recognition
Zheng J.
Ranjan R.
Chen C.-H.
Chen J.-C.
Castillo C.D.
Chellappa R.
IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (03): : 194 - 209
[38] CoNAN: Conditional Neural Aggregation Network For Unconstrained Face Feature Fusion
Jawade, Bhavin
Mohan, Deen Dayal
Fedorishin, Dennis
Setlur, Srirangaraj
Govindaraju, Venu
2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
[39] Audio and video feature fusion for activity recognition in unconstrained videos
Lopes, Jose
Singh, Sameer
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 823 - 831
[40] Neural Aggregation Network for Video Face Recognition
Yang, Jiaolong
Ren, Peiran
Zhang, Dongqing
Chen, Dong
Wen, Fang
Li, Hongdong
Hua, Gang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5216 - 5225

← 1 2 3 4 5 →