Pose Calibrated Feature Aggregation for Video Face Set Recognition in Unconstrained Environments

被引:0
|
作者
Ali Hasani, Ibrahim [1 ]
Arif, Omar [1 ,2 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad 44000, Pakistan
[2] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Face recognition; Metadata; Vectors; Feature extraction; Streaming media; Accuracy; Training; Fans; Three-dimensional displays; Video face recognition; feature aggregation; frame selection; open sets; multi-stream networks;
D O I
10.1109/ACCESS.2024.3481636
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents Pose Calibrated Feature Aggregation Network (PCFAN), an architecture for set/video face recognition. Using stacked attention blocks and a multi-modal architecture, it automatically assigns adaptive weights to every instance in the set, based on both the recognition embeddings and the associated face metadata. It uses these weights to produce a single, compact feature vector for the set. The model automatically learns to advocate for features from images with more favourable qualities and poses, which inherently hold more information. Our block can be inserted on top of any standard recognition model for set prediction and improved performance, particularly in unconstrained scenarios where subject pose and image quality vary considerably between frames. We test our approach on three challenging video face-recognition datasets, IJB-A, IJB-B, and YTF, and report state-of-the-art results. Moreover, a comparison with top aggregation methods as our baselines demonstrates that PCFAN is the superior approach.
引用
收藏
页码:156337 / 156346
页数:10
相关论文
共 50 条
  • [1] POSE CALIBRATED FEATURE AGGREGATION FOR FACE SET RECOGNITION
    Hasani, Ibrahim
    Arif, Omar
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 161 - 165
  • [2] A feature map aggregation network for unconstrained video face recognition
    Zhang, Luyang
    Wang, Huaibin
    Wang, Haitao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 2413 - 2425
  • [3] Still-to-video face recognition in unconstrained environments
    Wang, Haoyu
    Liu, Changsong
    Ding, Xiaoqing
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS VIII, 2015, 9405
  • [4] Feature Aggregation Network for Video Face Recognition
    Liu, Zhaoxiang
    Hu, Huan
    Bai, Jinqiang
    Li, Shaohua
    Lian, Shiguo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 990 - 998
  • [5] Pose and occlusion invariant face recognition system for video surveillance using extensive feature set
    Yoganand, A. Vivek
    Kavida, A. Celine
    Devi, D. Rukmani
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2020, 33 (03) : 222 - 239
  • [6] Face Recognition in Unconstrained Environments
    Kim, Dong-Ju
    Lee, Sang-Heon
    Sohn, Myoung-Kyu
    Kim, Byungmin
    Kim, Hyunduk
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 143 - 144
  • [7] Geometry Guided Feature Aggregation in Video Face Recognition
    Peng, Baoyun
    Jin, Xiao
    Wu, Yichao
    Li, Dongsheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2670 - 2677
  • [8] Face Recognition in Unconstrained Environments A Deep Architecture on A Small Training Set
    Saffar, Mohammad Taghi
    Rekabdar, Banafsheh
    Louis, Sushil
    Nicolescu, Mircea
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [9] Robust real-time video face recognition system for unconstrained environments
    Rajak, Amir
    Dailey, Matthew N.
    Ekpanyapong, Mongkol
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [10] Face Recognition in a Video by pose variations
    Bichwe, Madhavi R.
    Shende, Ranjana
    2015 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONTROL (IC4), 2015,