Pertinent Prosodic Features for Speaker Identification by Voice

被引：0

作者：

Sayoud, Halim ^{[1
,2
,3
,4
,5
,6
]}

Ouamour, Siham ^{[1
]}

机构：

[1] USTHB Univ, Bab Ezzouar, Algeria

[2] USTHB Univ, FEI, Bab Ezzouar, Algeria

[3] IRIT, Toulouse, France

[4] LIA, Avignon, France

[5] ENST, Paris, France

[6] ILSP, Athens, Greece

来源：

INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS | 2010年 / 2卷 / 02期

关键词：

LVQ; Prosodic Features; Security & Multimedia Applications; Speaker Identification; Speech Processing;

D O I：

10.4018/jmcmc.2010040102

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Most existing systems of speaker recognition use "state of the art" acoustic features. However, many times one can only recognize a speaker by his or her prosodic features, especially by the accent. For this reason, the authors investigate some pertinent prosodic features that can be associated with other classic acoustic features, in order to improve the recognition accuracy. The authors have developed a new prosodic model using a modified LVQ (Learning Vector Quantization) algorithm, which is called MLVQ (Modified LVQ). This model is composed of three reduced prosodic features: the mean of the pitch, original duration, and low-frequency energy. Since these features are heterogeneous, a new optimized metric has been proposed that is called Optimized Distance for Heterogeneous Features (ODHEF). Tests of speaker identification are done on Arabic corpus because the NIST evaluations showed that speaker verification scores depend on the spoken language and that some of the worst scores were got for the Arabic language. Experimental results show good performances of the new prosodic approach.

引用

页码：18 / 33

页数：16

共 50 条

[1] Robust prosodic features for speaker identification
Carey, MJ
Parris, ES
LloydThomas, H
Bennett, S
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1800 - 1803
[2] Improvement of speaker identification by combining prosodic features with acoustic features
Zheng, R
Zhang, SW
Xu, B
[J]. ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 569 - 576
[3] Prosodic and Voice Quality Features for Speaker Verification Over Coded Channel
Polacky, Jozef
Chmulik, Michal
Jarina, Roman
[J]. 2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 327 - 330
[4] Using Voice-quality Measurements with Prosodic and Spectral Features for Speaker Diarization
Woubie, Abraham
Luque, Jordi
Hernando, Javier
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3100 - 3104
[5] Prosodic Features for Speaker Verification
Mary, Leena
Yegnanarayana, B.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
[6] Prosodic variables in speaker identification
Dorta, Josefa
Diaz, Chaxiraxi
[J]. QUADERNS DE FILOLOGIA-ESTUDIS LINGUISTICS, 2014, 19 : 113 - 133
[7] Effect of Voice Features Cancellation in Speaker Identification System
Mostafa, Alzharaa
Soliman, Naglaa F.
Abdalluh, Mohamoud
Abd El-Samie, Fathi E.
[J]. 2016 FOURTH INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2016, : 139 - 142
[8] Speaker overlap detection with prosodic features for speaker diarisation
Zelenak, M.
Hernando, J.
[J]. IET SIGNAL PROCESSING, 2012, 6 (08) : 798 - 804
[9] Prosodic and Phonetic Features for Speaker Clustering in Speaker Diarization Systems
Zibert, Janez
Mihelic, France
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1040 - +
[10] A pertinent learning machine input feature for speaker discrimination by voice
S. Ouamour
H. Sayoud
[J]. International Journal of Speech Technology, 2012, 15 (2) : 181 - 190

← 1 2 3 4 5 →