Utilizing Tandem Features for Text-Independent Speaker Recognition on Short Utterances

被引:0
|
作者
Alvarez, Arvin Kenneth [1 ]
Pelipas, Mary Tricia Ann [1 ]
Rayos del Sol, Carl Ivan [1 ]
Tomas, John Paul [1 ]
机构
[1] Mapua Univ, Makati, Philippines
关键词
Speaker Recognition; Tandem Feature; Mel Frequency Cepstral Coefficients; Short Utterance Speaker Recognition;
D O I
10.1145/3366650.3366677
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study focuses on the application of Deep Neural Networks trained to discriminate amongst senones to improve speaker recognition performance when dealing with text-independent Short Utterance Speaker Recognition (SUSR). The features derived from the said network are theorized to be more robust given that they can eliminate any unnecessary information in the final representation of the speaker. The efficacy of these features is evaluated using the test subset of the LibriSpeech speech corpus. It is found that the system's performance, especially when dealing with SUSR is greatly improved when these above-mentioned features are concatenated with traditional, more widely used Mel Frequency Cepstral Coefficients (MFCC) as measured in terms of Equal Error Rate (EER).
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [1] Robust features for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    [J]. Neural Computing and Applications, 2020, 32 : 13863 - 13883
  • [2] Robust features for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17): : 13863 - 13883
  • [3] A deep learning approach for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    [J]. Multimedia Tools and Applications, 2023, 82 : 33111 - 33133
  • [4] A deep learning approach for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 33111 - 33133
  • [5] TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES
    Liu, Kai
    Zhou, Huan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6569 - 6573
  • [6] Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments
    Chakroun, Rania
    Frikha, Mondher
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (29-30) : 21279 - 21298
  • [7] Robust Text-independent Speaker recognition with Short Utterances using Gaussian Mixture Models
    Chakroun, Rania
    Frikha, Mondher
    [J]. 2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 2204 - 2209
  • [8] Efficient text-independent speaker recognition with short utterances in both clean and uncontrolled environments
    Rania Chakroun
    Mondher Frikha
    [J]. Multimedia Tools and Applications, 2020, 79 : 21279 - 21298
  • [9] End-to-end DNN based text-independent speaker recognition for long and short utterances
    Rohdin, Johan
    Silnova, Anna
    Diez, Mireia
    Plchot, Oldrich
    Matejka, Pavel
    Burget, Lukas
    Glembek, Ondrej
    [J]. COMPUTER SPEECH AND LANGUAGE, 2020, 59 : 22 - 35
  • [10] TEXT-INDEPENDENT SPEAKER RECOGNITION
    ATAL, BS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 181 - &