Articulatory movement features for short-duration text-dependent speaker verification

被引:5
|
作者
Zhang Y. [2 ]
Long Y. [2 ]
Shen X. [1 ]
Wei H. [2 ]
Yang M. [2 ]
Ye H. [2 ]
Mao H. [2 ]
机构
[1] College of Humanities and Communications, Shanghai Normal University, Shanghai
[2] Department of Electronical and Information Engineering, Shanghai Normal University, Shanghai
关键词
Articulatory movement features; Dynamic time warping; Speaker verification; Text-dependent;
D O I
10.1007/s10772-017-9447-8
中图分类号
学科分类号
摘要
During our pronunciation process, the position and movement properties of articulators such as tongue, jaw, lips, etc are mainly captured by the articulatory movement features (AMFs). This paper investigates to use the AMFs for short-duration text-dependent speaker verification. The AMFs can characterize the relative motion trajectory of articulators of individual speakers directly, which is rarely affected by the external environment. Therefore, we expect that, the AMFs are superior to the traditional acoustic features, such as mel-frequency cepstral coefficients (MFCC), to characterize the speaker identity differences between speakers. The speaker similarity scores measured by the dynamic time warping (DTW) algorithm are used to make the speaker verification decisions. Experimental results show that the AMFs can bring significant performance gains over the traditional MFCC features for short-duration text-dependent speaker verification task. © 2017, Springer Science+Business Media, LLC.
引用
收藏
页码:753 / 759
页数:6
相关论文
共 50 条
  • [31] Covariance Based Deep Feature for Text-Dependent Speaker Verification
    Wang, Shuai
    Dinkel, Heinrich
    Qian, Yanmin
    Yu, Kai
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 231 - 242
  • [32] Transfer Learning for Speaker Verification with Short-Duration Audio
    Fathima, Noor
    Simha, J. B.
    Abhi, Shinu
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 5, SMARTCOM 2024, 2024, 949 : 195 - 205
  • [33] PHONE ADAPTIVE TRAINING FOR SHORT-DURATION SPEAKER VERIFICATION
    Soldi, Giovanni
    Bozonnet, Simon
    Beaugeant, Christophe
    Evans, Nicholas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2107 - 2111
  • [34] BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020
    Lozano-Diez, Alicia
    Silnova, Anna
    Pulugundla, Bhargav
    Rohdin, Johan
    Vesely, Karel
    Burget, Lukas
    Plchot, Oldrich
    Glembek, Ondrej
    Novotny, Ondvrej
    Matejka, Pavel
    INTERSPEECH 2020, 2020, : 761 - 765
  • [35] Cohort Selection for Text-dependent Speaker Verification Score Normalization
    Khemiri, Houssemeddine
    Petrovska-Delacretaz, Dijana
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 689 - 692
  • [36] Multi-Task Learning for Text-dependent Speaker Verification
    Chen, Nanxin
    Qian, Yanmin
    Yu, Kai
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 185 - 189
  • [37] Sub-band based text-dependent speaker verification
    Sivakumaran, P
    Ariyaeeinia, AM
    Loomes, MJ
    SPEECH COMMUNICATION, 2003, 41 (2-3) : 485 - 509
  • [38] Unsupervised Learning of HMM Topology for Text-dependent Speaker Verification
    Liu, Ming
    Huang, Thomas
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 921 - 924
  • [39] EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Chen, Liping
    Zhao, Yong
    Zhang, Shi-Xiong
    Li, Jie
    Ye, Guoli
    Soong, Frank
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5364 - 5368
  • [40] ON INSTANTANEOUS AND TRANSITIONAL SPECTRAL INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    BERNASCONI, C
    SPEECH COMMUNICATION, 1990, 9 (02) : 129 - 139