Lightweight Multilingual Entity Extraction and Linking

被引:38
|
作者
Pappu, Aasish [1 ]
Blanco, Roi [2 ]
Mehdad, Yashar [3 ]
Stent, Amanda [4 ]
Thadani, Kapil [1 ]
机构
[1] Yahoo Res, New York, NY USA
[2] Univ A Coruna, La Coruna, Spain
[3] AirBnB, San Francisco, CA USA
[4] Bloomberg LP, New York, NY USA
关键词
D O I
10.1145/3018661.3018724
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text analytics systems often rely heavily on detecting and linking entity mentions in documents to knowledge bases for downstream applications such as sentiment analysis, question answering and recommender systems. A major challenge for this task is to be able to accurately detect entities in new languages with limited labeled resources. In this paper we present an accurate and lightweight(1) multilingual named entity recognition (NER) and linking (NEL) system. The contributions of this paper are three-fold: 1) Lightweight named entity recognition with competitive accuracy; 2) Candidate entity retrieval that uses search click log data and entity embeddings to achieve high precision with a low memory footprint; and 3) efficient entity disambiguation. Our system achieves state-of-the-art performance on TAC KBP 2013 multilingual data and on English AIDA-CONLL data.
引用
收藏
页码:365 / 374
页数:10
相关论文
共 50 条
  • [21] Entity and Event Topic Extraction from Podcast Episode Title and Description Using Entity Linking
    Siagian, Christian
    Shabbeer, Amina
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 768 - 772
  • [22] Lightweight Named Entity Extraction for Korean Short Message Service Text
    Seon, Choong-Nyoung
    Yoo, JinHwan
    Kim, Harksoo
    Kim, Ji-Hwan
    Seo, Jungyun
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2011, 5 (03): : 560 - 574
  • [23] Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
    Ehrmann, Maud
    Romanello, Matteo
    Doucet, Antoine
    Clematide, Simon
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 : 347 - 354
  • [24] Extraction of Context Information from Web Content Using Entity Linking
    Hirata, Norifumi
    Shiramatsu, Shun
    Ozono, Tadachika
    Shintani, Toramatsu
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (02): : 18 - 23
  • [25] Entity Summarization Based on Entity Grouping in Multilingual Projected Entity Space
    Kim, Eun-kyung
    Choi, Key-Sun
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (09): : 2138 - 2146
  • [26] Lightweight Multilingual Software Analysis
    Lyons, Damian M.
    Bogar, Anne Marie
    Baird, David
    [J]. ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 201 - 207
  • [27] Chinese Named Entity Recognition Based on BERT and Lightweight Feature Extraction Model
    Yang, Ruisen
    Gan, Yong
    Zhang, Chenfang
    [J]. INFORMATION, 2022, 13 (11)
  • [28] Joint Entity Linking and Relation Extraction with Neural Networks for Knowledge Base Population
    Zhang, Zhenyu
    Shu, Xiaobo
    Liu, Tingwen
    Fang, Zheng
    Li, Quangang
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [30] Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction
    Schone, Patrick
    Allison, Tim
    Giannella, Chris
    Pfeifer, Craig
    [J]. 2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 944 - 951