EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION

被引:0
|
作者
Chen, Liping [1 ]
Zhao, Yong [2 ]
Zhang, Shi-Xiong [2 ]
Li, Jie [1 ]
Ye, Guoli [2 ]
Soong, Frank [3 ]
机构
[1] Microsoft Search Technol Ctr Asia, Beijing, Peoples R China
[2] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
[3] Microsoft Res Asia, Beijing, Peoples R China
关键词
Text-dependent speaker verification; sequential speaker characteristics; speaker supervector; dynamic time warping; VARIABILITY;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, given the speaker bottleneck feature vectors extracted with speaker discriminant neural networks, we focus on using the sequential speaker characteristics for text-dependent speaker verification. In each evaluation trial, speaker supervectors are used as the representations of the sequential speaker characteristics rendered in the compared speech utterances. To this end, dynamic time warping is used to warp the variable-length speaker feature vector sequences of the utterances to the same length. Thereafter for every utterance, a speaker supervector can be obtained as the concatenation of its speaker feature vectors. We use Euclidean distance and support vector machine (SVM) to compute the decision score on the speaker supervectors. Our experiments on a Microsoft internal keyword-spotting database showed the effectiveness of the proposed speaker supervector for text-dependent speaker verification. Moreover, when SVM backend was used in scoring, the speaker supervector achieved the best EER performance 1.627%, better than the combination of i-vector and probabilistic linear discriminant analysis.
引用
收藏
页码:5364 / 5368
页数:5
相关论文
共 50 条
  • [31] ATTENTION-BASED MODELS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Chowdhury, F. A. Rezaur Rahman
    Wang, Quan
    Moreno, Ignacio Lopez
    Wan, Li
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5359 - 5363
  • [32] Cohort Selection for Text-dependent Speaker Verification Score Normalization
    Khemiri, Houssemeddine
    Petrovska-Delacretaz, Dijana
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 689 - 692
  • [33] BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020
    Lozano-Diez, Alicia
    Silnova, Anna
    Pulugundla, Bhargav
    Rohdin, Johan
    Vesely, Karel
    Burget, Lukas
    Plchot, Oldrich
    Glembek, Ondrej
    Novotny, Ondvrej
    Matejka, Pavel
    [J]. INTERSPEECH 2020, 2020, : 761 - 765
  • [34] Multi-Task Learning for Text-dependent Speaker Verification
    Chen, Nanxin
    Qian, Yanmin
    Yu, Kai
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 185 - 189
  • [35] Tandem Features for Text-dependent Speaker Verification on the RedDots Corpus
    Alam, Md Jahangir
    Kenny, Patrick
    Gupta, Vishwa
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 420 - 424
  • [36] Sub-band based text-dependent speaker verification
    Sivakumaran, P
    Ariyaeeinia, AM
    Loomes, MJ
    [J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 485 - 509
  • [37] Unsupervised Learning of HMM Topology for Text-dependent Speaker Verification
    Liu, Ming
    Huang, Thomas
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 921 - 924
  • [38] A study of the relative importance of temporal characteristics in text-dependent and text-constrained speaker verification
    Nealand, JH
    Pelecanos, JW
    Zilca, RD
    Ramaswamy, GN
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 653 - 656
  • [39] ON INSTANTANEOUS AND TRANSITIONAL SPECTRAL INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    BERNASCONI, C
    [J]. SPEECH COMMUNICATION, 1990, 9 (02) : 129 - 139
  • [40] Addressing Text-Dependent Speaker Verification Using Singing Speech
    Shi, Yan
    Zhou, Juanjuan
    Long, Yanhua
    Li, Yijie
    Mao, Hongwei
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):