Memeplate: A Chinese Multimodal Dataset for Humor Understanding in Meme Templates

被引:0
|
作者
Li, Zefeng [1 ]
Lin, Hongfei [1 ]
Yang, Liang [1 ]
Xu, Bo [1 ]
Zhang, Shaowu [1 ]
机构
[1] Dalian Univ Technol, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodality; Sentiment analysis; Humor recognition;
D O I
10.1007/978-3-031-17120-8_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humor plays an important role in human communication. Besides language, multimodal information is also of great significance in humor expression and understanding, which promotes the development of multimodal humor research. However, in existing datasets, images and text often have a one-to-one relationship, making it difficult to control image modality variables. It causes the low correlation and low enhancement between the two modalities in humor recognition tasks. Moreover, with the development of Vision Transformers (ViTs), the generalization ability of visual models has been greatly enhanced. Using ViTs alone can achieve impressive performance, but is difficult to explain. In this paper, we introduce Memeplate (Our dataset is available at https:// github.com/chineselzf/memeplate.), a novel multimodal humor dataset containing 203 templates, 5,184 memes and manually annotated humor levels. The template transfers images and text into a one-to-many relationship, which can make it easier for researchers to cut through the linguistic lens to multimodal humor. And it provides examples closer to human behavior for generation research. In addition, we provide multiple baseline results on the humor recognition task, which demonstrate the effectiveness of our control over image modality and the importance of introducing multimodal cues.
引用
收藏
页码:527 / 538
页数:12
相关论文
共 31 条
  • [1] UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
    Hasan, Md Kamrul
    Rahman, Wasifur
    Zadeh, Amir
    Zhong, Jianyuan
    Tanveer, Md Iftekhar
    Morency, Louis-Philippe
    Hoque, Mohammed
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2046 - 2056
  • [2] MET-Meme: a Multimodal Meme Dataset Rich in Metaphors
    Xu, Bo
    Li, Tingting
    Zheng, Junzhe
    Naseriparsa, Mehdi
    Zhao, Zhehuan
    Lin, Hongfei
    Xia, Feng
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2887 - 2899
  • [3] Humor Knowledge Enriched Transformer for Understanding Multimodal Humor
    Hasan, Md Kamrul
    Lee, Sangwu
    Rahman, Wasifur
    Zadeh, Amir
    Mihalcea, Rada
    Morency, Louis-Philippe
    Hoque, Ehsan
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12972 - 12980
  • [4] Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms
    Patro, Badri N.
    Lunayach, Mayank
    Srivastava, Deepankar
    Sarvesh, Sarvesh
    Singh, Hunar
    Namboodiri, Vinay P.
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 576 - 585
  • [5] "Cats be outside, how about meow": Multimodal humor and creativity in an internet meme
    Vasquez, Camilla
    Aslan, Erhan
    [J]. JOURNAL OF PRAGMATICS, 2021, 171 : 101 - 117
  • [6] MultiMET: A Multimodal Dataset for Metaphor Understanding
    Zhang, Dongyu
    Zhang, Minghao
    Zhang, Heting
    Yang, Liang
    Lin, Hongfei
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3214 - 3225
  • [7] Chinese Whispers: A Multimodal Dataset for Embodied Language Grounding
    Kontogiorgos, Dimosthenis
    Sibirtseva, Elena
    Gustafson, Joakim
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 743 - 749
  • [8] Telling the Whole Story: A Manually Annotated Chinese Dataset for the Analysis of Humor in Jokes
    Zhang, Dongyu
    Zhang, Heting
    Liu, Xikai
    Lin, Hongfei
    Xia, Feng
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6402 - 6407
  • [9] Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos
    Lee, Dong Won
    Ahuja, Chaitanya
    Liang, Paul Pu
    Natu, Sanika
    Morency, Louis-Philippe
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20030 - 20041
  • [10] A Large-Scale Chinese Multimodal NER Dataset with Speech Clues
    Sui, Dianbo
    Tian, Zhengkun
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2807 - 2818