Drink and Speak: On the automatic classification of alcohol intoxication by acoustic, prosodic and text-based features

被引:0
|
作者
Bocklet, Tobias [1 ]
Riedhammer, Korbinian [1 ]
Noeth, Elmar [1 ]
机构
[1] Univ Erlangen Nurnberg, Chair Pattern Recognit, Erlangen, Germany
关键词
GMM; alcohol intoxication; system fusion; SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on the automatic detection of a person's blood level alcohol based on automatic speech processing approaches. We compare 5 different feature types with different ways of modeling. Experiments are based on the ALC corpus of IS2011 Speaker State Challenge. The classification task is restricted to the detection of a blood alcohol level above 0.5 parts per thousand. Three feature sets are based on spectral observations: MFCCs, PLPs, TRAPS. These are modeled by GMMs. Classification is either done by a Gaussian classifier or by SVMs. In the later case classification is based on GMM-based supervectors, i.e. concatenation of GMM mean vectors. A prosodic system extracts a 292-dimensional feature vector based on a voiced-unvoiced decision. A transcription-based system makes use of text transcriptions related to phoneme durations and textual structure. We compare the stand-alone performances of these systems and combine them on score level by logistic regression. The best stand-alone performance is the transcription-based system which outperforms the baseline by 4.8 % on the development set. A Combination on score level gave a huge boost when the spectral-based systems were added (73.6 %). This is a relative improvement of 12.7 % to the baseline. On the test-set we achieved an UA of 68.6 % which is a significant improvement of 4.1 % to the baseline system.
引用
收藏
页码:3220 / 3223
页数:4
相关论文
共 50 条
  • [1] Automatic Rating of Hoarseness by Text-based Cepstral and Prosodic Evaluation
    Haderlein, Tino
    Moers, Cornelia
    Moebius, Bernd
    Noeth, Elmar
    [J]. TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 573 - 580
  • [2] Does it Groove or Does it Stumble - Automatic Classification of Alcoholic Intoxication Using Prosodic Features
    Hoenig, Florian
    Batliner, Anton
    Noeth, Elmar
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3232 - 3235
  • [3] Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
    Haderlein, Tino
    Schwemmle, Cornelia
    Doellinger, Michael
    Matousek, Vaclav
    Ptok, Martin
    Noeth, Elmar
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [4] Automatic image and text-based description for colorectal polyps using BASIC classification
    Fonolla, Roger
    van der Zander, Quirine E. W.
    Schreuder, Ramon M.
    Subramaniam, Sharmila
    Bhandari, Pradeep
    Masclee, Ad A. M.
    Schoon, Erik J.
    van Der Sommen, Fons
    de With, Peter H. N.
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 121
  • [5] Deep Learning for Asphyxiated Infant Cry Classification Based on Acoustic Features and Weighted Prosodic Features
    Ji, Chunyan
    Xiao, Xueli
    Basodi, Sunitha
    Pan, Yi
    [J]. 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 1233 - 1240
  • [6] AUTOMATIC PROSODIC EVENTS DETECTION USING SYLLABLE-BASED ACOUSTIC AND SYNTACTIC FEATURES
    Jeon, Je Hun
    Liu, Yang
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4565 - 4568
  • [7] An Automatic Voice Pleasantness Classification System based on Prosodic and Acoustic Patterns of Voice Preference
    Coelho, Luis
    Braga, Daniela
    Sales-Dias, Miguel
    Garcia-Mateo, Carmen
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2468 - +
  • [8] FUSION APPROACHES FOR EMOTION RECOGNITION FROM SPEECH USING ACOUSTIC AND TEXT-BASED FEATURES
    Pepino, Leonardo
    Riera, Pablo
    Ferrer, Luciana
    Gravano, Agustin
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6484 - 6488
  • [9] Age and Gender Classification using Fusion of Acoustic and Prosodic Features
    Meinedo, Hugo
    Trancoso, Isabel
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2822 - 2825
  • [10] Hierarchical approaches to Text-based Offense Classification
    Choi, Jay
    Kilmer, David
    Mueller-Smith, Michael
    Taheri, Sema A.
    [J]. SCIENCE ADVANCES, 2023, 9 (09)