Pose Calibrated Feature Aggregation for Video Face Set Recognition in Unconstrained Environments

被引:0
|
作者
Ali Hasani, Ibrahim [1 ]
Arif, Omar [1 ,2 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad 44000, Pakistan
[2] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Face recognition; Metadata; Vectors; Feature extraction; Streaming media; Accuracy; Training; Fans; Three-dimensional displays; Video face recognition; feature aggregation; frame selection; open sets; multi-stream networks;
D O I
10.1109/ACCESS.2024.3481636
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents Pose Calibrated Feature Aggregation Network (PCFAN), an architecture for set/video face recognition. Using stacked attention blocks and a multi-modal architecture, it automatically assigns adaptive weights to every instance in the set, based on both the recognition embeddings and the associated face metadata. It uses these weights to produce a single, compact feature vector for the set. The model automatically learns to advocate for features from images with more favourable qualities and poses, which inherently hold more information. Our block can be inserted on top of any standard recognition model for set prediction and improved performance, particularly in unconstrained scenarios where subject pose and image quality vary considerably between frames. We test our approach on three challenging video face-recognition datasets, IJB-A, IJB-B, and YTF, and report state-of-the-art results. Moreover, a comparison with top aggregation methods as our baselines demonstrates that PCFAN is the superior approach.
引用
收藏
页码:156337 / 156346
页数:10
相关论文
共 50 条
  • [31] Automatic face region tracking for highly accurate face recognition in unconstrained environments
    Kim, YO
    Paik, J
    Heo, J
    Koschan, A
    Abidi, B
    Abidi, M
    IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2003, : 29 - 36
  • [32] A comparative study of thermal face recognition methods in unconstrained environments
    Hermosilla, Gabriel
    Ruiz-del-Solar, Javier
    Verschae, Rodrigo
    Correa, Mauricio
    PATTERN RECOGNITION, 2012, 45 (07) : 2445 - 2459
  • [33] Robust Video Face Recognition Under Pose Variation
    Su, Ya
    NEURAL PROCESSING LETTERS, 2018, 47 (01) : 277 - 291
  • [34] Robust Video Face Recognition Under Pose Variation
    Ya Su
    Neural Processing Letters, 2018, 47 : 277 - 291
  • [35] Efficient face recognition with variant pose and illumination in video
    Dai, Yi
    Xiao, Guoqiang
    Qiu, Kaijin
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 18 - 22
  • [36] Set-to-set face recognition under variations in pose and illumination
    Chang, Jen-Mei
    Kirby, Michael
    Peterson, Chris
    2007 BIOMETRICS SYMPOSIUM, 2007, : 150 - 155
  • [37] An automatic system for unconstrained video-based face recognition
    Zheng J.
    Ranjan R.
    Chen C.-H.
    Chen J.-C.
    Castillo C.D.
    Chellappa R.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 2 (03): : 194 - 209
  • [38] CoNAN: Conditional Neural Aggregation Network For Unconstrained Face Feature Fusion
    Jawade, Bhavin
    Mohan, Deen Dayal
    Fedorishin, Dennis
    Setlur, Srirangaraj
    Govindaraju, Venu
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [39] Audio and video feature fusion for activity recognition in unconstrained videos
    Lopes, Jose
    Singh, Sameer
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 823 - 831
  • [40] Neural Aggregation Network for Video Face Recognition
    Yang, Jiaolong
    Ren, Peiran
    Zhang, Dongqing
    Chen, Dong
    Wen, Fang
    Li, Hongdong
    Hua, Gang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5216 - 5225