MultiMET: A Multimodal Dataset for Metaphor Understanding

被引:0
|
作者
Zhang, Dongyu [1 ]
Zhang, Minghao [1 ]
Zhang, Heting [1 ]
Yang, Liang [2 ]
Lin, Hongfei [2 ]
机构
[1] Dalian Univ Technol, Sch Software, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Metaphor involves not only a linguistic phenomenon, but also a cognitive phenomenon structuring human thought, which makes understanding it challenging. As a means of cognition, metaphor is rendered by more than texts alone, and multimodal information in which vision/audio content is integrated with the text can play an important role in expressing and understanding metaphor. However, previous metaphor processing and understanding has focused on texts, partly due to the unavailability of large-scale datasets with ground truth labels of multimodal metaphor. In this paper, we introduce MultiMET, a novel multimodal metaphor dataset to facilitate understanding metaphorical information from multimodal text and image. It contains 10,437 text-image pairs from a range of sources with multimodal annotations of the occurrence of metaphors, domain relations, sentiments metaphors convey, and author intents. MultiMET opens the door to automatic metaphor understanding by investigating multimodal cues and their interplay. Moreover, we propose a range of strong baselines and show the importance of combining multimodal cues for metaphor understanding. MultiMET will be released publicly for research.
引用
收藏
页码:3214 / 3225
页数:12
相关论文
共 50 条
  • [1] Multimodal Metaphor
    Stoeeckl, Hartmut
    [J]. VISUAL COMMUNICATION, 2012, 11 (03) : 383 - 388
  • [2] Multimodal Metaphor
    Gibbons, Alison
    [J]. LANGUAGE AND LITERATURE, 2011, 20 (01) : 78 - 81
  • [3] Multimodal Metaphor
    Johnson, Mark
    [J]. JOURNAL OF PRAGMATICS, 2010, 42 (10) : 2848 - 2850
  • [4] Memeplate: A Chinese Multimodal Dataset for Humor Understanding in Meme Templates
    Li, Zefeng
    Lin, Hongfei
    Yang, Liang
    Xu, Bo
    Zhang, Shaowu
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 527 - 538
  • [5] UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
    Hasan, Md Kamrul
    Rahman, Wasifur
    Zadeh, Amir
    Zhong, Jianyuan
    Tanveer, Md Iftekhar
    Morency, Louis-Philippe
    Hoque, Mohammed
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2046 - 2056
  • [6] From the computer metaphor to multimodal metaphor
    Gabriel Rodriguez, Fernando
    [J]. DESIGNIS, 2021, (35): : 165 - 172
  • [7] Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos
    Lee, Dong Won
    Ahuja, Chaitanya
    Liang, Paul Pu
    Natu, Sanika
    Morency, Louis-Philippe
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20030 - 20041
  • [8] Multimodal blends and metaphor
    Bernardo, Sandra
    Rejala, Ruth
    Barbosa, Tamires
    Gustavo, Luanda
    [J]. ANTARES-LETRAS E HUMANIDADES, 2015, 7 (14): : 156 - 186
  • [9] ON UNDERSTANDING METAPHOR
    Bazzanella, Carla
    Morra, Lucia
    [J]. LINGUE E LINGUAGGIO, 2007, 6 (01) : 65 - 84
  • [10] Multimodal Metaphor and Metonymy in Advertising
    Bolognesi, Marianna
    [J]. LANGUAGE AND COGNITION, 2018, 10 (03) : 552 - 559