Building a Multimodal Entity Linking Dataset From Tweets

被引:0
|
作者
Adjali, Omar [1 ]
Besancon, Romaric [1 ]
Ferret, Olivier [1 ]
Le Borgne, Herve [1 ]
Grau, Brigitte [2 ]
机构
[1] CEA, LIST, F-91191 Gif Sur Yvette, France
[2] Univ Paris Saclay, LIMSI, CNRS, F-91405 Orsay, France
关键词
Entity linking; social media; multimodality; multimedia entity linking;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The task of Entity linking, which aims at associating an entity mention with a unique entity in a knowledge base (KB), is useful for advanced Information Extraction tasks such as relation extraction or event detection. Most of the studies that address this problem rely only on textual documents while an increasing number of sources are multimedia, in particular in the context of social media where messages are often illustrated with images. In this article, we address the Multimodal Entity Linking (MEL) task, and more particularly the problem of its evaluation. To this end, we propose a novel method to quasi-automatically build annotated datasets to evaluate methods on the MEL task. The method collects text and images to jointly build a corpus of tweets with ambiguous mentions along with a Twitter KB defining the entities. We release a new annotated dataset of Twitter posts associated with images. We study the key characteristics of the proposed dataset and evaluate the performance of several MEL approaches on it.
引用
收藏
页码:4285 / 4292
页数:8
相关论文
共 50 条
  • [1] Multimodal Entity Linking: A New Dataset and A Baseline
    Gan, Jingru
    Luo, Jinchang
    Wang, Haiwei
    Wang, Shuhui
    He, Wei
    Huang, Qingming
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 993 - 1001
  • [2] WIKIDiverse: A Multimodal Entity Linking Dataset with Diversified Contextual Topics and Entity Types
    Wang, Xuwu
    Tian, Junfeng
    Gui, Min
    Li, Zhixu
    Wang, Rui
    Yang, Ming
    Chen, Lihan
    Xiao, Yanghua
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4785 - 4797
  • [3] Implicit Entity Linking in Tweets
    Perera, Sujan
    Mendes, Pablo N.
    Alex, Adarsh
    Sheth, Amit P.
    Thirunarayan, Krishnaprasad
    [J]. SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 118 - 132
  • [4] Entity linking of tweets based on dominant entity candidates
    Feng, Yue
    Zarrinkalam, Fattane
    Bagheri, Ebrahim
    Fani, Hossein
    Al-Obeidat, Feras
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2018, 8 (01)
  • [5] Reddit entity linking dataset
    Botzer, Nicholas
    Ding, Yifan
    Weninger, Tim
    [J]. Weninger, Tim (tweninger@nd.edu), 1600, Elsevier Ltd (58):
  • [6] Reddit entity linking dataset
    Botzer, Nicholas
    Ding, Yifan
    Weninger, Tim
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)
  • [7] Analysis of named entity recognition and linking for tweets
    Derczynski, Leon
    Maynard, Diana
    Rizzo, Giuseppe
    van Erp, Marieke
    Gorrell, Genevieve
    Troncy, Raphael
    Petrak, Johann
    Bontcheva, Kalina
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (02) : 32 - 49
  • [8] Implicit Entity Recognition, Classification and Linking in Tweets
    Hosseini, Hawre
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1448 - 1448
  • [9] Collective Entity Linking in Tweets Over Space and Time
    Chong, Wen-Haw
    Lim, Ee-Peng
    Cohen, William
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 82 - 94
  • [10] TweetNERD - End to End Entity Linking Benchmark for Tweets
    Mishra, Shubhanshu
    Saini, Aman
    Makki, Raheleh
    Mehta, Sneha
    Haghighi, Aria
    Mollahosseini, Ali
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,