Speaker Modeling Using Emotional Speech for More Robust Speaker Identification

被引:1
|
作者
Milosevic, M. [1 ]
Nedeljkovic, Z. [1 ]
Glavitsch, U. [2 ]
Durovic, Z. [1 ]
机构
[1] Univ Belgrade, Sch Elect Engn, Belgrade 11000, Serbia
[2] GlavitschEggler Software, CH-5400 Baden, Switzerland
关键词
emotion recognition; Gaussian mixture models; i-vectors; human voice; identification of persons; speaker recognition; RECOGNITION; MACHINES; VECTOR;
D O I
10.1134/S1064226919110184
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic identity recognition in fast, reliable and non-intrusive way is one of the most challenging topics in digital world of today. A possible approach to identity recognition is the identification by voice. Characteristics of speech relevant for automatic speaker recognition can be affected by external factors such as noise and channel distortions, but also by speaker-specific conditions-emotional or health states. The improvement of a speaker recognition system by different model training strategies are addressed in this paper in order to obtain the best performance of the system with only a limited amount of neutral and emotional speech data. The models adopted are a Gaussian Mixture Model and i-vectors whose inputs are Mel Frequency Cepstral Coefficients, and the experiments have been conducted on the Russian Language Affective speech database. The results show that the appropriate use of emotional speech in speaker model training improves the robustness of a speaker recognition system - both when tested on neutral and emotional speech.
引用
收藏
页码:1256 / 1265
页数:10
相关论文
共 50 条
  • [1] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    M. Milošević
    Ž. Nedeljković
    U. Glavitsch
    Ž. Đurović
    [J]. Journal of Communications Technology and Electronics, 2019, 64 : 1256 - 1265
  • [2] Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech
    Sarma, Biswajit Dev
    Das, Rohan Kumar
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 610 - 615
  • [3] Speaker Identification Using Robust Speech Detection and Neural Network
    Ouzounov, Atanas
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 48 - 54
  • [4] Speaker Identification using Whispered Speech
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
  • [5] Emotional Speech Clustering based Robust Speaker Recognition System
    Li, Dongdong
    Yang, Yingchun
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4576 - +
  • [6] Noise robust speaker identification for spontaneous Arabic speech
    Graciarena, Martin
    Kajarekar, Sachin
    Stolcke, Andreas
    Shriberg, Elizabeth
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 245 - +
  • [7] An Integrated Approach to Robust Speaker Identification and Speech Recognition
    Kwan, C.
    Yin, J.
    Ayhan, B.
    Chu, S.
    Liu, X.
    Puckett, K.
    Zhao, Y.
    Ho, K. C.
    Kruger, M.
    Sityar, I.
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1635 - +
  • [8] Speaker identification from emotional and noisy speech using learned voice segregation and speech VGG
    Hamsa, Shibani
    Shahin, Ismail
    Iraqi, Youssef
    Damiani, Ernesto
    Nassif, Ali Bou
    Werghi, Naoufel
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
  • [9] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
    Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
  • [10] Speaker Identification Enhancement Using Emotional Features
    Jabnoun, Jihed
    Zrigui, Ahmed
    Slimi, Anwer
    Ringeval, Fabien
    Schwab, Didier
    Zrigui, Mounir
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 526 - 539