Automatic discrimination between laughter and speech

被引:101
|
作者
Truong, Khiet P. [1 ]
van Leeuwen, David A. [1 ]
机构
[1] TNO HUman Factors, Dept Human Interfaces, NL-3769 ZG Soesterberg, Netherlands
关键词
automatic detection laughter; automatic detection emotion;
D O I
10.1016/j.specom.2007.01.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Emotions can be recognized by audible paralinguistic cues in speech. By detecting these paralinguistic cues that can consist of laughter, a trembling voice, coughs, changes in the intonation contour etc., information about the speaker's state and emotion can be revealed. This paper describes the development of a gender-independent laugh detector with the aim to enable automatic emotion recognition. Different types of features (spectral, prosodic) for laughter detection were investigated using different classification techniques (Gaussian Mixture Models, Support Vector Machines, Multi Layer Perceptron) often used in language and speaker recognition. Classification experiments were carried out with short pre-segmented speech and laughter segments extracted from the ICSI Meeting Recorder Corpus (with a mean duration of approximately 2 s). Equal error rates of around 3% were obtained when tested on speaker-independent speech data. We found that a fusion between classifiers based on Gaussian Mixture Models and classifiers based on Support Vector Machines increases discriminative power. We also found that a fusion between classifiers that use spectral features and classifiers that use prosodic information usually increases the performance for discrimination between laughter and speech. Our acoustic measurements showed differences between laughter and speech in mean pitch and in the ratio of the durations of unvoiced to voiced portions, which indicate that these prosodic features are indeed useful for discrimination between laughter and speech. (C) 2007 Published by Elsevier B.V.
引用
收藏
页码:144 / 158
页数:15
相关论文
共 50 条
  • [1] Audiovisual discrimination between laughter and speech
    Petridis, Stavros
    Pantic, Maja
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5117 - 5120
  • [2] AUTOMATIC AND PERCEPTUAL DISCRIMINATION BETWEEN DYSARTHRIA, APRAXIA OF SPEECH, AND NEUROTYPICAL SPEECH
    Kodrasi, Ina
    Pernon, Michaela
    Laganaro, Marina
    Bourlard, Herve
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7308 - 7312
  • [3] Automatic discrimination of laughter using distributed sEMG
    Cosentino, S.
    Sessa, S.
    Kong, W.
    Zhang, D.
    Takanishi, A.
    Bianchi-Berthouze, N.
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 691 - 697
  • [4] Audiovisual Discrimination Between Speech and Laughter: Why and When Visual Information Might Help
    Petridis, Stavros
    Pantic, Maja
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 216 - 234
  • [5] Automatic discrimination of several types of speech pathologies
    Sztaho, David
    Kiss, Gabor
    Tulics, Miklos Gabriel
    Hajduska-Der, Balint
    Vicsi, Klara
    2019 10TH INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2019,
  • [6] Automatic Laughter Detection in Spontaneous Speech Using GMM-SVM Method
    Neuberger, Tilda
    Beke, Andras
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 113 - 120
  • [7] Detecting laughter in spontaneous speech by constructing laughter bouts
    Li, Yan-Xiong
    He, Qian-Hua
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) : 211 - 225
  • [8] Automatic Discrimination between Cognates and Borrowings
    Ciobanu, Alina Maria
    Dinu, Liviu P.
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 431 - 437
  • [9] OATHS AND LAUGHTER AND INDECENT SPEECH
    GRAY, P
    LANGUAGE & COMMUNICATION, 1993, 13 (04) : 311 - 325
  • [10] LAUGHTER PUNCTUATES SPEECH - LINGUISTIC, SOCIAL AND GENDER CONTEXTS OF LAUGHTER
    PROVINE, RR
    ETHOLOGY, 1993, 95 (04) : 291 - 298