A deep learning approach for speaker recognition

被引:2
|
作者
Soufiane Hourri
Jamal Kharroubi
机构
[1] Université Sidi Mohamed Ben Abdellah,
[2] Faculté des Sciences et Techniques,undefined
[3] Laboratoire des Systèmes Intelligents et Applications,undefined
关键词
Speaker recognition; Speaker verification; MFCC; DeepSF; Deep Neural Network - DNN; Deep Belief Network - DBN; Restricted Boltzmann Machine - RBM; Deep learning; K-means; Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Speaker verification (SV) is an important branch in speaker recognition. Several approaches have been investigated within the last few decades. In this context, deep learning has received much more interest by speech processing researchers, and it was introduced recently in speaker recognition. In most cases, deep learning models are adapted from speech recognition applications and applied to speaker recognition, and they have been showing their capability of being competitors to the state-of-the-art approaches. Nevertheless, the use of deep learning in speaker recognition is still linked to speech recognition. In this study, we are proposing a new way to use deep neural networks (DNNs) in speaker recognition, in the purpose to facilitate to DNN to learn features distribution. We have been motivated by our previous work, where we have proposed a novel scoring method that works perfectly with clean speech, but it needs improvements under noisy conditions. For this reason, we are aiming to transform the extracted feature vectors (MFCCs) into enhanced feature vectors, that we denote Deep Speaker Features (DeepSFs). Experiments have been conducted on THUYG-20 SRE corpus, and significant results have been achieved. Moreover, this new method outperformed both i-vector/PLDA and our baseline system in both clean and noisy conditions.
引用
收藏
页码:123 / 131
页数:8
相关论文
共 50 条
  • [1] A deep learning approach for speaker recognition
    Hourri, Soufiane
    Kharroubi, Jamal
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (01) : 123 - 131
  • [2] Automatic Speaker Recognition using Transfer Learning Approach of Deep Learning Models
    Ganvir, Sonal
    Lal, Nidhi
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 595 - 601
  • [3] A deep learning approach to integrate convolutional neural networks in speaker recognition
    Soufiane Hourri
    Nikola S. Nikolov
    Jamal Kharroubi
    [J]. International Journal of Speech Technology, 2020, 23 : 615 - 623
  • [4] A deep learning approach to integrate convolutional neural networks in speaker recognition
    Hourri, Soufiane
    Nikolov, Nikola S.
    Kharroubi, Jamal
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 615 - 623
  • [5] Deep learning methods in speaker recognition: A review
    Sztahó, Dávid
    Szaszák, György
    Beke, András
    [J]. Periodica polytechnica Electrical engineering and computer science, 2021, 65 (04): : 310 - 328
  • [6] A deep learning approach for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    [J]. Multimedia Tools and Applications, 2023, 82 : 33111 - 33133
  • [7] Speaker recognition based on deep learning: An overview
    Bai, Zhongxin
    Zhang, Xiao-Lei
    [J]. NEURAL NETWORKS, 2021, 140 : 65 - 99
  • [8] A deep learning approach for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 33111 - 33133
  • [9] An extreme learning machine approach for speaker recognition
    Lan, Yuan
    Hu, Zongjiang
    Soh, Yeng Chai
    Huang, Guang-Bin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 22 (3-4): : 417 - 425
  • [10] An extreme learning machine approach for speaker recognition
    Yuan Lan
    Zongjiang Hu
    Yeng Chai Soh
    Guang-Bin Huang
    [J]. Neural Computing and Applications, 2013, 22 : 417 - 425