Audiovisual Biometric Network with Deep Feature Fusion for Identification and Text Prompted Verification

被引:1
|
作者
Atenco, Juan Carlos [1 ]
Moreno, Juan Carlos [1 ]
Ramirez, Juan Manuel [1 ]
机构
[1] Sta Maria Tonanzintla, Natl Inst Astrophys Opt & Elect, Dept Elect, Luis Enrique Erro 1, Cholula 72840, Puebla, Mexico
关键词
multimodal biometrics; text prompted verification; multitask learning; deep feature fusion; speaker recognition; face recognition; RECOGNITION;
D O I
10.3390/a16020066
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we present a bimodal multitask network for audiovisual biometric recognition. The proposed network performs the fusion of features extracted from face and speech data through a weighted sum to jointly optimize the contribution of each modality, aiming for the identification of a client. The extracted speech features are simultaneously used in a speech recognition task with random digit sequences. Text prompted verification is performed by fusing the scores obtained from the matching of bimodal embeddings with the Word Error Rate (WER) metric calculated from the accuracy of the transcriptions. The score fusion outputs a value that can be compared with a threshold to accept or reject the identity of a client. Training and evaluation was carried out by using our proprietary database BIOMEX-DB and VidTIMIT audiovisual database. Our network achieved an accuracy of 100% and an Equal Error Rate (EER) of 0.44% for identification and verification, respectively, in the best case. To the best of our knowledge, this is the first system that combines the mutually related tasks previously described for biometric recognition.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Semi-supervised text classification with deep convolutional neural network using feature fusion approach
    Shayegh, Parvaneh
    Li, Yuefeng
    Zhang, Jinglan
    Zhang, Qing
    2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 363 - 366
  • [22] CovTiNet: Covid text identification network using attention-based positional embedding feature fusion
    Md. Rajib Hossain
    Mohammed Moshiul Hoque
    Nazmul Siddique
    Iqbal H. Sarker
    Neural Computing and Applications, 2023, 35 : 13503 - 13527
  • [23] CovTiNet: Covid text identification network using attention-based positional embedding feature fusion
    Hossain, Md. Rajib
    Hoque, Mohammed Moshiul
    Siddique, Nazmul
    Sarker, Iqbal H. H.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18): : 13503 - 13527
  • [24] Implementation of multimodal biometric recognition via multi-feature deep learning networks and feature fusion
    Tiong, Leslie Ching Ow
    Kim, Seong Tae
    Ro, Yong Man
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (16) : 22743 - 22772
  • [25] Implementation of multimodal biometric recognition via multi-feature deep learning networks and feature fusion
    Leslie Ching Ow Tiong
    Seong Tae Kim
    Yong Man Ro
    Multimedia Tools and Applications, 2019, 78 : 22743 - 22772
  • [26] Acoustic Feature Shuffling Network for Text-Independent Speaker Verification
    Li, Jin
    Fang, Xin
    Chu, Fan
    Gao, Tian
    Song, Yan
    Dai, Lirong
    INTERSPEECH 2022, 2022, : 4790 - 4794
  • [27] A deep feature fusion network for fetal state assessment
    Xiao, Yahui
    Lu, Yaosheng
    Liu, Mujun
    Zeng, Rongdan
    Bai, Jieyun
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [28] Hyperspectral Image Classification With Deep Feature Fusion Network
    Song, Weiwei
    Li, Shutao
    Fang, Leyuan
    Lu, Ting
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3173 - 3184
  • [29] Cancelable HD-SEMG Biometric Identification via Deep Feature Learning
    Fan, Jiahao
    Jiang, Xinyu
    Liu, Xiangyu
    Zhao, Xian
    Ye, Xinming
    Dai, Chenyun
    Akay, Metin
    Chen, Wei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (04) : 1782 - 1793
  • [30] An Entity Recognition Model Based on Deep Learning Fusion of Text Feature
    Shang, Fengjun
    Ran, Chunfu
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (02)