Pitch prosodic feature for speaker verification

被引：0

作者：

Xu, Dongxing ^{[1
]}

Dai, Beiqian ^{[1
]}

Xu, Minqiang ^{[1
]}

liu, Qingsong ^{[1
]}

机构：

[1] Univ Sci & Technol China, Dept Elect Sci & Technol, Hefei 230027, Anhui, Peoples R China

来源：

ADVANCING SCIENCE THROUGH COMPUTATION | 2008年

关键词：

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

High-level information, such as speaking rate, idiolect, artificial feature, though proved useful for speaker verification and complementary to short-term spectral features, is difficult to extract from speech. Pitch is the frequency of vibration of vocal cord and pitch contour is its variation as a function of time, conveying much speaker specific information. We propose a method to exact pitch prosodic feature (PPF) from pitch contour by means of wavelet analysis. Approximation coefficients and detail coefficients from multiscale analysis with Harr wavelet compose prosodic feature, which describes both general shape and dynamics of pitch contour. Experimental results on NIST06 SRE 8side-1side task show that we can achieve an equal error rate (EER) of 18.42% with PPF, a 23.66% relative reduction comparing with pitch baseline system. Simple fusion of the prosodic system with the MFCC system can further reduce the EER by 5% and minimum DCF by 7% comparing with single MFCC system.

引用

页码：388 / 392

页数：5

共 50 条

[1] Prosodic Features for Speaker Verification
Mary, Leena
Yegnanarayana, B.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
[2] RECENT PROGRESS IN PROSODIC SPEAKER VERIFICATION
Kockmann, Marcel
Ferrer, Luciana
Burget, Lukas
Shriberg, Elizabeth
Cernocky, Jan 'Honza'
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4556 - 4559
[3] SPEAKER VERIFICATION USING VARIOUS PROSODIC KERNELS
Drgas, Szymon
Zamorski, Dariusz
Dabrowski, Adam
[J]. SPA 2011: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2011, : 169 - 173
[4] Pitch synchronous based feature extraction for noise-robust speaker verification
Gong Wei-Guo
Yang Li-Ping
Chen Di
[J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 295 - 298
[5] Effect of VoIP on Prosodic Features for Speaker Verification
Cherian, Athira Jess
Antony, Anil P.
Mary, Leena
[J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 487 - 490
[6] Modeling prosodic feature sequences for speaker recognition
Shriberg, E
Ferrer, L
Kajarekar, S
Venkataraman, A
Stolcke, A
[J]. SPEECH COMMUNICATION, 2005, 46 (3-4) : 455 - 472
[7] A Text-Constrained Prosodic System for Speaker Verification
Shriberg, Elizabeth
Ferrer, Luciana
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2636 - +
[8] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
Kockmann, Marcel
Ferrer, Luciana
Burget, Lukas
Cernocky, Jan Honza
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
[9] Robust Speaker Verification with Principal Pitch Components
Robert M. Nickel
Sachin P. Oswal
Ananth N. Iyer
[J]. International Journal of Speech Technology, 2005, 8 (4) : 323 - 339
[10] Robust Speaker Verification with Principal Pitch Components
Nickel, Robert M.
Oswal, Sachin P.
Iyer, Ananth N.
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (04) : 323 - 339

← 1 2 3 4 5 →