Text-Independent Speaker Verification Using Lightweight 3D Convolutional Neural Networks

被引:0
|
作者
Chen, Jyun-Yan [1 ]
Jeng, Jin-Tsong [1 ]
机构
[1] Natl Formosa Univ, Huwei Township, Taiwan
关键词
lightweight; 3D-CNN; speaker verification system; cosine similarity; text-independent; RECOGNITION;
D O I
10.1109/ICSSE61472.2024.10608879
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a text-independent speaker verification system leveraging lightweight 3D Convolutional Neural Networks (3D-CNN). Our system independently operates of text and focuses on classifying speakers based on their extracted features. We employ lightweight 3D-CNN to capture the nuances within speech samples from the same speaker. Initially, speaker speech data is used for enrollment, generating corresponding speaker features that form the basis of the speaker model, also referred to as the identity discriminator. Subsequently, speech data requiring verification is utilized as evaluation data, and the resulting speaker features are compared with the speaker model using cosine similarity. Experimental findings show that our system achieves 14.3% on Equal Error Rate (EER). Additionally, the performance of the lightweight 3D-CNN system remains consistent compared to the 3D-CNN system. At the same time, the proposed highlighting system can effective to reduce on computational cost.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] TEXT-INDEPENDENT SPEAKER VERIFICATION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Toifi, Amirsina
    Dawson, Jeremy
    Nasrabadi, Nasser M.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [2] Text-independent speaker verification using predictive neural networks
    Finan, RA
    Sapeluk, AT
    Damper, RI
    [J]. FIFTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1997, (440): : 274 - 279
  • [3] Text-Independent Speaker Identification Using Formants and Convolutional Neural Networks
    Camarena-Ibarrola, Antonio
    Reynoso, Miguel
    Figueroa, Karina
    [J]. ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 108 - 119
  • [5] Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings
    Zhang, Chunlei
    Koishida, Kazuhito
    Hansen, John H. L.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1633 - 1644
  • [6] Neural Embedding Extractors for Text-Independent Speaker Verification
    Alam, Jahangir
    Kang, Woohyun
    Fathan, Abderrahim
    [J]. SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 10 - 23
  • [7] Text-Independent Speaker Identification with Glottal Flow and 1D Convolutional Neural Networks
    Camarena-Ibarrola, Antonio
    Ruiz-Gaona, Erick
    Figueroa, Karina
    [J]. PATTERN RECOGNITION, MCPR 2024, 2024, 14755 : 287 - 296
  • [8] TEMPORAL DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR TEXT-INDEPENDENT SPEAKER VERIFICATION AND PHONEMIC ANALYSIS
    Kim, Seong-Hu
    Nam, Hyeonuk
    Park, Yong-Hwa
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6742 - 6746
  • [9] Generalized locally recurrent probabilistic neural networks for text-independent speaker verification
    Ganchev, T
    Fakotakis, N
    Tasoulis, DK
    Vrahatis, MN
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 41 - 44
  • [10] Automatic text-independent speaker verification using convolutional deep belief network
    Rakhmanenko, I. A.
    Shelupanov, A. A.
    Kostyuchenko, E. Y.
    [J]. COMPUTER OPTICS, 2020, 44 (04) : 596 - +