Pitch prosodic feature for speaker verification

被引:0
|
作者
Xu, Dongxing [1 ]
Dai, Beiqian [1 ]
Xu, Minqiang [1 ]
liu, Qingsong [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Sci & Technol, Hefei 230027, Anhui, Peoples R China
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-level information, such as speaking rate, idiolect, artificial feature, though proved useful for speaker verification and complementary to short-term spectral features, is difficult to extract from speech. Pitch is the frequency of vibration of vocal cord and pitch contour is its variation as a function of time, conveying much speaker specific information. We propose a method to exact pitch prosodic feature (PPF) from pitch contour by means of wavelet analysis. Approximation coefficients and detail coefficients from multiscale analysis with Harr wavelet compose prosodic feature, which describes both general shape and dynamics of pitch contour. Experimental results on NIST06 SRE 8side-1side task show that we can achieve an equal error rate (EER) of 18.42% with PPF, a 23.66% relative reduction comparing with pitch baseline system. Simple fusion of the prosodic system with the MFCC system can further reduce the EER by 5% and minimum DCF by 7% comparing with single MFCC system.
引用
收藏
页码:388 / 392
页数:5
相关论文
共 50 条
  • [1] Prosodic Features for Speaker Verification
    Mary, Leena
    Yegnanarayana, B.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
  • [2] RECENT PROGRESS IN PROSODIC SPEAKER VERIFICATION
    Kockmann, Marcel
    Ferrer, Luciana
    Burget, Lukas
    Shriberg, Elizabeth
    Cernocky, Jan 'Honza'
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4556 - 4559
  • [3] SPEAKER VERIFICATION USING VARIOUS PROSODIC KERNELS
    Drgas, Szymon
    Zamorski, Dariusz
    Dabrowski, Adam
    [J]. SPA 2011: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2011, : 169 - 173
  • [4] Pitch synchronous based feature extraction for noise-robust speaker verification
    Gong Wei-Guo
    Yang Li-Ping
    Chen Di
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 295 - 298
  • [5] Effect of VoIP on Prosodic Features for Speaker Verification
    Cherian, Athira Jess
    Antony, Anil P.
    Mary, Leena
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 487 - 490
  • [6] Modeling prosodic feature sequences for speaker recognition
    Shriberg, E
    Ferrer, L
    Kajarekar, S
    Venkataraman, A
    Stolcke, A
    [J]. SPEECH COMMUNICATION, 2005, 46 (3-4) : 455 - 472
  • [7] A Text-Constrained Prosodic System for Speaker Verification
    Shriberg, Elizabeth
    Ferrer, Luciana
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2636 - +
  • [8] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
    Kockmann, Marcel
    Ferrer, Luciana
    Burget, Lukas
    Cernocky, Jan Honza
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
  • [9] Robust Speaker Verification with Principal Pitch Components
    Robert M. Nickel
    Sachin P. Oswal
    Ananth N. Iyer
    [J]. International Journal of Speech Technology, 2005, 8 (4) : 323 - 339
  • [10] Robust Speaker Verification with Principal Pitch Components
    Nickel, Robert M.
    Oswal, Sachin P.
    Iyer, Ananth N.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (04) : 323 - 339