Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms

被引:6
|
作者
Patro, Badri N. [1 ,3 ]
Lunayach, Mayank [1 ]
Srivastava, Deepankar [1 ]
Sarvesh, Sarvesh [1 ]
Singh, Hunar [1 ]
Namboodiri, Vinay P. [2 ]
机构
[1] IIT Kanpur, Kanpur, Uttar Pradesh, India
[2] Univ Bath, Bath, Avon, England
[3] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1109/WACV48630.2021.00062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A great number of situational comedies (sitcoms) are being regularly made and the task of adding laughter tracks to these is a critical task. Providing an ability to be able to predict whether something will be humorous to the audience is also crucial. In this project, we aim to automate this task. Towards doing so, we annotate an existing sitcom ('Big Bang Theory') and use the laughter cues present to obtain a manual annotation for this show. We provide detailed analysis for the dataset design and further evaluate various state of the art baselines for solving this task. We observe that existing LSTM and BERT based networks on the text alone do not perform as well as joint text and video or only video-based networks. Moreover, it is challenging to ascertain that the words attended to while predicting laughter are indeed humorous. Our dataset and analysis provided through this paper is a valuable resource towards solving this interesting semantic and practical task. As an additional contribution, we have developed a novel model for solving this task that is a multi-modal self-attention based model that outperforms currently prevalent models for solving this task. The project page for our paper is https://delta-lab-iitk.github.io/Multimodal-Humor-Dataser/.
引用
收藏
页码:576 / 585
页数:10
相关论文
共 50 条
  • [1] PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS
    Bertero, Dario
    Fung, Pascale
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5780 - 5784
  • [2] MULTIMODAL DEEP NEURAL NETS FOR DETECTING HUMOR IN TV SITCOMS
    Bertero, Dario
    Fung, Pascale
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 383 - 390
  • [3] IN PRAISE OF CANNED LAUGHTER + TELEVISION SITCOMS
    BARTH, J
    [J]. FILM COMMENT, 1986, 22 (02) : 73 - 75
  • [4] A humor typology to identify humor styles used in sitcoms
    Juckel, Jennifer
    Bellman, Steven
    Varan, Duane
    [J]. HUMOR-INTERNATIONAL JOURNAL OF HUMOR RESEARCH, 2016, 29 (04): : 583 - 603
  • [5] Memeplate: A Chinese Multimodal Dataset for Humor Understanding in Meme Templates
    Li, Zefeng
    Lin, Hongfei
    Yang, Liang
    Xu, Bo
    Zhang, Shaowu
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 527 - 538
  • [6] UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
    Hasan, Md Kamrul
    Rahman, Wasifur
    Zadeh, Amir
    Zhong, Jianyuan
    Tanveer, Md Iftekhar
    Morency, Louis-Philippe
    Hoque, Mohammed
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2046 - 2056
  • [7] Easy laughter, difficult laughter (Humor)
    Mizzau, M
    [J]. AUT AUT, 1997, (282): : 95 - 101
  • [8] A Study of Linguistic Humor Translation of British Sitcoms
    孟瑾
    [J]. 海外英语, 2015, (04) : 103 - 104
  • [9] THE SOCIOLOGY OF HUMOR AND LAUGHTER
    ZIJDERVELD, AC
    [J]. CURRENT SOCIOLOGY-SOCIOLOGIE CONTEMPORAINE, 1983, 31 (03): : 1 - 103
  • [10] CHILDRENS HUMOR AND LAUGHTER
    CHAPMAN, T
    [J]. BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1983, 36 (FEB): : A26 - A26