The role of dynamic features in text dependent and -independent speaker verification

被引:0
|
作者
Liu, Yang [1 ]
Russell, Martin [1 ]
Carey, Michael [1 ]
机构
[1] Univ Birmingham, Dept Elect Elect & Comp Engn, Birmingham, W Midlands, England
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A segmental hidden Markov model (SHMM) is a hidden Markov model (HMM) whose states are associated with sequences of acoustic feature vectors (or segments), rather than individual vectors. By treating segments as homogeneous units it is possible, for example, to develop better models of speech dynamics. This paper considers the potential benefits of a trajectory-based segmental HMM for speaker recognition. Text-dependent speaker verification (TDSV) results obtained on YOHO and text-independent speaker verification (TI-SV) results on Switchboard are presented. The YOHO results show a 44% reduction in false acceptances using the segmental model compared with a conventional HMM, while the Switchboard results do not show any improvement relative to a conventional Gausian Mixture Model (GMM) system. Further experiments were conducted to explain these results. They indicate that the priority of a "segmental GMM" is to model stationary regions and shed light on the role of delta parameters in conventional TI-SV.
引用
收藏
页码:669 / 672
页数:4
相关论文
共 50 条
  • [21] Graphical models for text-independent speaker verification
    Sánchez-Soto, E
    Sigelle, M
    Chollet, G
    NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 410 - 415
  • [22] Optimization of GMM for text independent speaker verification system
    Varchol, Peter
    Levicky, Dusan
    Juhar, Jozef
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA 2008, 2008, : 45 - 48
  • [23] Language dependency in text-independent speaker verification
    Auckenthaler, R
    Carey, MJ
    Mason, JSD
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 441 - 444
  • [25] Text-independent speaker verification in embedded environments
    Tydlitat, Borivoj
    Navratil, Jiri
    Pelecanos, Jason W.
    Ramaswamy, Ganesh N.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 293 - +
  • [26] ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Zhu, Yingke
    Mak, Brian
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6584 - 6588
  • [27] Articulatory movement features for short-duration text-dependent speaker verification
    Zhang Y.
    Long Y.
    Shen X.
    Wei H.
    Yang M.
    Ye H.
    Mao H.
    International Journal of Speech Technology, 2017, 20 (4) : 753 - 759
  • [28] The Role of 'Delta' Features in Speaker Verification
    Liu, Ying
    Russell, Martin J.
    Carey, Michael J.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1425 - 1428
  • [29] DEEP BOTTLENECK FEATURES FOR I-VECTOR BASED TEXT-INDEPENDENT SPEAKER VERIFICATION
    Ghalehjegh, Sina Hamidi
    Rose, Richard C.
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 555 - 560
  • [30] Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities
    Mporas, Iosif
    Safavi, Saeid
    Sotudeh, Reza
    SPEECH AND COMPUTER, 2016, 9811 : 378 - 385