Memeplate: A Chinese Multimodal Dataset for Humor Understanding in Meme Templates

被引：0

作者：

Li, Zefeng ^{[1
]}

Lin, Hongfei ^{[1
]}

Yang, Liang ^{[1
]}

Xu, Bo ^{[1
]}

Zhang, Shaowu ^{[1
]}

机构：

[1] Dalian Univ Technol, Dalian 116024, Peoples R China

来源：

NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I | 2022年 / 13551卷

基金：

中国国家自然科学基金;

关键词：

Multimodality; Sentiment analysis; Humor recognition;

D O I：

10.1007/978-3-031-17120-8_41

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Humor plays an important role in human communication. Besides language, multimodal information is also of great significance in humor expression and understanding, which promotes the development of multimodal humor research. However, in existing datasets, images and text often have a one-to-one relationship, making it difficult to control image modality variables. It causes the low correlation and low enhancement between the two modalities in humor recognition tasks. Moreover, with the development of Vision Transformers (ViTs), the generalization ability of visual models has been greatly enhanced. Using ViTs alone can achieve impressive performance, but is difficult to explain. In this paper, we introduce Memeplate (Our dataset is available at https:// github.com/chineselzf/memeplate.), a novel multimodal humor dataset containing 203 templates, 5,184 memes and manually annotated humor levels. The template transfers images and text into a one-to-many relationship, which can make it easier for researchers to cut through the linguistic lens to multimodal humor. And it provides examples closer to human behavior for generation research. In addition, we provide multiple baseline results on the humor recognition task, which demonstrate the effectiveness of our control over image modality and the importance of introducing multimodal cues.

引用

页码：527 / 538

页数：12

共 31 条

[1] UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
Hasan, Md Kamrul
Rahman, Wasifur
Zadeh, Amir
Zhong, Jianyuan
Tanveer, Md Iftekhar
Morency, Louis-Philippe
Hoque, Mohammed
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2046 - 2056
[2] MET-Meme: a Multimodal Meme Dataset Rich in Metaphors
Xu, Bo
Li, Tingting
Zheng, Junzhe
Naseriparsa, Mehdi
Zhao, Zhehuan
Lin, Hongfei
Xia, Feng
[J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2887 - 2899
[3] Humor Knowledge Enriched Transformer for Understanding Multimodal Humor
Hasan, Md Kamrul
Lee, Sangwu
Rahman, Wasifur
Zadeh, Amir
Mihalcea, Rada
Morency, Louis-Philippe
Hoque, Ehsan
[J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12972 - 12980
[4] Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms
Patro, Badri N.
Lunayach, Mayank
Srivastava, Deepankar
Sarvesh, Sarvesh
Singh, Hunar
Namboodiri, Vinay P.
[J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 576 - 585
[5] "Cats be outside, how about meow": Multimodal humor and creativity in an internet meme
Vasquez, Camilla
Aslan, Erhan
[J]. JOURNAL OF PRAGMATICS, 2021, 171 : 101 - 117
[6] MultiMET: A Multimodal Dataset for Metaphor Understanding
Zhang, Dongyu
Zhang, Minghao
Zhang, Heting
Yang, Liang
Lin, Hongfei
[J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3214 - 3225
[7] Chinese Whispers: A Multimodal Dataset for Embodied Language Grounding
Kontogiorgos, Dimosthenis
Sibirtseva, Elena
Gustafson, Joakim
[J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 743 - 749
[8] Telling the Whole Story: A Manually Annotated Chinese Dataset for the Analysis of Humor in Jokes
Zhang, Dongyu
Zhang, Heting
Liu, Xikai
Lin, Hongfei
Xia, Feng
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6402 - 6407
[9] Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos
Lee, Dong Won
Ahuja, Chaitanya
Liang, Paul Pu
Natu, Sanika
Morency, Louis-Philippe
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20030 - 20041
[10] A Large-Scale Chinese Multimodal NER Dataset with Speech Clues
Sui, Dianbo
Tian, Zhengkun
Chen, Yubo
Liu, Kang
Zhao, Jun
[J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2807 - 2818

← 1 2 3 4 →