Pertinent Prosodic Features for Speaker Identification by Voice

被引:0
|
作者
Sayoud, Halim [1 ,2 ,3 ,4 ,5 ,6 ]
Ouamour, Siham [1 ]
机构
[1] USTHB Univ, Bab Ezzouar, Algeria
[2] USTHB Univ, FEI, Bab Ezzouar, Algeria
[3] IRIT, Toulouse, France
[4] LIA, Avignon, France
[5] ENST, Paris, France
[6] ILSP, Athens, Greece
关键词
LVQ; Prosodic Features; Security & Multimedia Applications; Speaker Identification; Speech Processing;
D O I
10.4018/jmcmc.2010040102
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Most existing systems of speaker recognition use "state of the art" acoustic features. However, many times one can only recognize a speaker by his or her prosodic features, especially by the accent. For this reason, the authors investigate some pertinent prosodic features that can be associated with other classic acoustic features, in order to improve the recognition accuracy. The authors have developed a new prosodic model using a modified LVQ (Learning Vector Quantization) algorithm, which is called MLVQ (Modified LVQ). This model is composed of three reduced prosodic features: the mean of the pitch, original duration, and low-frequency energy. Since these features are heterogeneous, a new optimized metric has been proposed that is called Optimized Distance for Heterogeneous Features (ODHEF). Tests of speaker identification are done on Arabic corpus because the NIST evaluations showed that speaker verification scores depend on the spoken language and that some of the worst scores were got for the Arabic language. Experimental results show good performances of the new prosodic approach.
引用
收藏
页码:18 / 33
页数:16
相关论文
共 50 条
  • [1] Robust prosodic features for speaker identification
    Carey, MJ
    Parris, ES
    LloydThomas, H
    Bennett, S
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1800 - 1803
  • [2] Improvement of speaker identification by combining prosodic features with acoustic features
    Zheng, R
    Zhang, SW
    Xu, B
    [J]. ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 569 - 576
  • [3] Prosodic and Voice Quality Features for Speaker Verification Over Coded Channel
    Polacky, Jozef
    Chmulik, Michal
    Jarina, Roman
    [J]. 2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 327 - 330
  • [4] Using Voice-quality Measurements with Prosodic and Spectral Features for Speaker Diarization
    Woubie, Abraham
    Luque, Jordi
    Hernando, Javier
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3100 - 3104
  • [5] Prosodic Features for Speaker Verification
    Mary, Leena
    Yegnanarayana, B.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 917 - 920
  • [6] Prosodic variables in speaker identification
    Dorta, Josefa
    Diaz, Chaxiraxi
    [J]. QUADERNS DE FILOLOGIA-ESTUDIS LINGUISTICS, 2014, 19 : 113 - 133
  • [7] Effect of Voice Features Cancellation in Speaker Identification System
    Mostafa, Alzharaa
    Soliman, Naglaa F.
    Abdalluh, Mohamoud
    Abd El-Samie, Fathi E.
    [J]. 2016 FOURTH INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2016, : 139 - 142
  • [8] Speaker overlap detection with prosodic features for speaker diarisation
    Zelenak, M.
    Hernando, J.
    [J]. IET SIGNAL PROCESSING, 2012, 6 (08) : 798 - 804
  • [9] Prosodic and Phonetic Features for Speaker Clustering in Speaker Diarization Systems
    Zibert, Janez
    Mihelic, France
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1040 - +
  • [10] A pertinent learning machine input feature for speaker discrimination by voice
    S. Ouamour
    H. Sayoud
    [J]. International Journal of Speech Technology, 2012, 15 (2) : 181 - 190