Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender

被引:0
|
作者
Weninger, Felix [1 ]
Marchi, Erik [1 ]
Schuller, Bjoern [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-80290 Munich, Germany
关键词
ACCIDENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the fully automatic recognition of intoxication, sleepiness, age and gender from speech in medium-term observation intervals of up to several minutes. The nature of these speaker states and traits as being medium-term or long-term, as opposed to short-term states such as emotion, makes it possible to collect cumulative evidence in the form of utterance level decisions; we show that by fusing these decisions along the time axis, more and more accurate decisions can be obtained. In extensive test runs on three official INTERSPEECH Challenge corpora, we show that the average recall can be improved by up to 5%, 6%, 10% and 11% absolute by longer-term observation of speaker sleepiness, gender, intoxication, and age, respectively, compared to the accuracy of a decision from a single utterance.
引用
收藏
页码:1158 / 1161
页数:4
相关论文
共 17 条
  • [1] Medium-term speaker states-A review on intoxication, sleepiness and the first challenge
    Schuller, Bjoern
    Steidl, Stefan
    Batliner, Anton
    Schiele, Florian
    Krajewski, Jarek
    Weninger, Felix
    Eyben, Florian
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (02): : 346 - 374
  • [2] Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks
    Markitantov, Maxim
    Verkholyak, Oxana
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 327 - 336
  • [3] \ COMBINING REGRESSION AND CLASSIFICATION METHODS FOR IMPROVING AUTOMATIC SPEAKER AGE RECOGNITION
    van Heerden, Charl
    Barnard, Etienne
    Davel, Marelie
    van der Walt, Christiaan
    van Dyk, Ewald
    Feld, Michael
    Mueller, Christian
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5174 - 5177
  • [4] Effect of age and gender on sleepiness and sleep traits in the general population of Lausanne - HypnoLaus study
    Luca, G.
    Andries, D.
    Tobback, N.
    Haba-Rubio, J.
    Heinzer, R.
    Tafti, M.
    JOURNAL OF SLEEP RESEARCH, 2012, 21 : 366 - 367
  • [5] A Paralinguistic Approach To Speaker Diarisation Using Age, Gender, Voice Likability and Personality Traits
    Zhang, Yue
    Weninger, Felix
    Liu, Boqing
    Schmitt, Maximilian
    Eyben, Florian
    Schuller, Bjorn
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 387 - 392
  • [6] Age and Gender Recognition of a Speaker from Short-duration Phone Conversations
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 751 - 754
  • [7] Automatic Speaker Age and Gender Recognition in the Car for Tailoring Dialog and Mobile Services
    Feld, Michael
    Burkhardt, Felix
    Mueller, Christian
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2838 - 2841
  • [8] Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition
    Shepstone, Sven Ewan
    Tang, Zheng-Hua
    Jensen, Soren Holdt
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2826 - 2830
  • [9] Automatic speaker age and gender recognition using acoustic and prosodic level information fusion
    Li, Ming
    Han, Kyu J.
    Narayanan, Shrikanth
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 151 - 167
  • [10] Could speaker, gender or age awareness be beneficial in speech-based emotion recognition?
    Sidorov, Maxim
    Schmitt, Alexander
    Semenkin, Eugene
    Minker, Wolfgang
    Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, 2016, : 61 - 68