A deep learning approach to integrate convolutional neural networks in speaker recognition

被引:0
|
作者
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
机构
[1] Université Sidi Mohamed Ben Abdellah,Faculté des Sciences et Techniques, Laboratoire des Systèmes Intelligents et Applications
[2] University of Limerick,undefined
关键词
Speaker recognition; MFCC; Convolutional neural network; Restricted Boltzmann Machine; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a novel usage of convolutional neural networks (CNNs) for the problem of speaker recognition. While being particularly designed for computer vision problems, CNNs have recently been applied for speaker recognition by using spectrograms as input images. We believe that this approach is not optimal as it may result in two cumulative errors in solving both a computer vision and a speaker recognition problem. In this work, we aim at integrating CNNs in speaker recognition without relying on images. We use Restricted Boltzmann Machines (RBMs) to extract speakers models as matrices and introduce a new way to model target and non-target speakers, in order to perform speaker verification. Thus, we use a CNN to discriminate between target and non-target matrices. Experiments were conducted with the THUYG-20 SRE corpus under three noise conditions: clean, 9 db, and 0 db. The results demonstrate that our method outperforms the state-of-the-art approaches by decreasing the error rate by up to 60%.
引用
收藏
页码:615 / 623
页数:8
相关论文
共 50 条
  • [1] A deep learning approach to integrate convolutional neural networks in speaker recognition
    Hourri, Soufiane
    Nikolov, Nikola S.
    Kharroubi, Jamal
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 615 - 623
  • [2] A Deep Learning Approach for Automatic Ionogram Parameters Recognition With Convolutional Neural Networks
    Sherstyukov, Ruslan
    Moges, Samson
    Kozlovsky, Alexander
    Ulich, Thomas
    [J]. EARTH AND SPACE SCIENCE, 2024, 11 (10)
  • [3] Deep Learning based on Image Recognition Convolutional Neural Networks
    Alamri, Salah
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04): : 559 - 566
  • [4] A Novel Approach of Deep Convolutional Neural Networks for Sketch Recognition
    Sadouk, Lamyaa
    Gadi, Taoufiq
    Essoufi, El Hassan
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 99 - 112
  • [5] Speaker recognition using convolutional siamese neural networks
    Jung H.
    Yoon S.
    Park N.
    [J]. Transactions of the Korean Institute of Electrical Engineers, 2020, 69 (01): : 164 - 169
  • [6] A deep learning approach for speaker recognition
    Hourri, Soufiane
    Kharroubi, Jamal
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (01) : 123 - 131
  • [7] A deep learning approach for speaker recognition
    Soufiane Hourri
    Jamal Kharroubi
    [J]. International Journal of Speech Technology, 2020, 23 : 123 - 131
  • [8] Insights into Deep Neural Networks for Speaker Recognition
    Garcia-Romero, Daniel
    McCree, Alan
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1141 - 1145
  • [9] A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition
    School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta
    GA
    30332, United States
    不详
    Sicily, Italy
    [J]. Neurocomputing, (448-459):
  • [10] A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition
    Huang, Zhen
    Siniscalchi, Sabato Marco
    Lee, Chin-Hui
    [J]. NEUROCOMPUTING, 2016, 218 : 448 - 459