Audiovisual Biometric Network with Deep Feature Fusion for Identification and Text Prompted Verification

被引:1
|
作者
Atenco, Juan Carlos [1 ]
Moreno, Juan Carlos [1 ]
Ramirez, Juan Manuel [1 ]
机构
[1] Sta Maria Tonanzintla, Natl Inst Astrophys Opt & Elect, Dept Elect, Luis Enrique Erro 1, Cholula 72840, Puebla, Mexico
关键词
multimodal biometrics; text prompted verification; multitask learning; deep feature fusion; speaker recognition; face recognition; RECOGNITION;
D O I
10.3390/a16020066
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we present a bimodal multitask network for audiovisual biometric recognition. The proposed network performs the fusion of features extracted from face and speech data through a weighted sum to jointly optimize the contribution of each modality, aiming for the identification of a client. The extracted speech features are simultaneously used in a speech recognition task with random digit sequences. Text prompted verification is performed by fusing the scores obtained from the matching of bimodal embeddings with the Word Error Rate (WER) metric calculated from the accuracy of the transcriptions. The score fusion outputs a value that can be compared with a threshold to accept or reject the identity of a client. Training and evaluation was carried out by using our proprietary database BIOMEX-DB and VidTIMIT audiovisual database. Our network achieved an accuracy of 100% and an Equal Error Rate (EER) of 0.44% for identification and verification, respectively, in the best case. To the best of our knowledge, this is the first system that combines the mutually related tasks previously described for biometric recognition.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Text-Independent Speaker Identification Through Feature Fusion and Deep Neural Network
    Jahangir, Rashid
    TEh, Ying Wah
    Memon, Nisar Ahmed
    Mujtaba, Ghulam
    Zareei, Mahdi
    Ishtiaq, Uzair
    Akhtar, Muhammad Zaheer
    Ali, Ihsan
    IEEE ACCESS, 2020, 8 : 32187 - 32202
  • [2] Bimodal biometric recognition system using Convolutional Neural Networks and fusion of deep audiovisual feature vectors
    Atenco-Vazquez, Juan Carlos
    Moreno-Rodriguez, Juan Carlos
    Ramirez-Cortes, Juan Manuel
    Arechiga-Martine, Rene
    Gomez-Gil, Pilar
    Fonseca-Delgado, Rigoberto
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (03): : 4 - 14
  • [3] Feature Level Fusion in Multimodal Biometric Identification
    Belhia, S.
    Gafour, A.
    2012 SECOND INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2012, : 418 - 423
  • [4] Deep Learning Convolutional Network for Bimodal Biometric Recognition with Information Fusion at Feature Level
    Atenco Vazquez, Juan Carlos
    Moreno Rodriguez, Juan Carlos
    Ramirez Cortes, Juan Manuel
    IEEE LATIN AMERICA TRANSACTIONS, 2023, 21 (05) : 652 - 661
  • [5] Biometric Identification Based on Feature Fusion with PCA and SVM
    Lefkovits, Laszlo
    Lefkovits, Szidonia
    Emerich, Simina
    TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696
  • [6] DEEP CNN BASED FEATURE EXTRACTOR FOR TEXT-PROMPTED SPEAKER RECOGNITION
    Novoselov, Sergey
    Kudashev, Oleg
    Shchemelinin, Vadim
    Kremnev, Ivan
    Lavrentyeva, Galina
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5334 - 5338
  • [7] Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities
    Mporas, Iosif
    Safavi, Saeid
    Sotudeh, Reza
    SPEECH AND COMPUTER, 2016, 9811 : 378 - 385
  • [8] Deep feature for text-dependent speaker verification
    Liu, Yuan
    Qian, Yanmin
    Chen, Nanxin
    Fu, Tianfan
    Zhang, Ya
    Yu, Kai
    SPEECH COMMUNICATION, 2015, 73 : 1 - 13
  • [9] Deep Multi-biometric Fusion for Audio-Visual User Re-Identification and Verification
    Marras, Mirko
    Marin-Reyes, Pedro A.
    Lorenzo-Navarro, Javier
    Castrillon-Santana, Modesto
    Fenu, Gianni
    PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2019), 2020, 11996 : 136 - 157
  • [10] Deep Feature Fusion for IRIS Based on Industrial Biometric Engineering
    Surendra, I.
    Sashank, T. Sai
    Praveena, M. D. Anto
    Manoj, R. Joseph
    PROCEEDINGS OF THE 2019 1ST INTERNATIONAL CONFERENCE ON SUSTAINABLE MANUFACTURING, MATERIALS AND TECHNOLOGIES, 2020, 2207