Lightweight Multilingual Entity Extraction and Linking

被引:38
|
作者
Pappu, Aasish [1 ]
Blanco, Roi [2 ]
Mehdad, Yashar [3 ]
Stent, Amanda [4 ]
Thadani, Kapil [1 ]
机构
[1] Yahoo Res, New York, NY USA
[2] Univ A Coruna, La Coruna, Spain
[3] AirBnB, San Francisco, CA USA
[4] Bloomberg LP, New York, NY USA
关键词
D O I
10.1145/3018661.3018724
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text analytics systems often rely heavily on detecting and linking entity mentions in documents to knowledge bases for downstream applications such as sentiment analysis, question answering and recommender systems. A major challenge for this task is to be able to accurately detect entities in new languages with limited labeled resources. In this paper we present an accurate and lightweight(1) multilingual named entity recognition (NER) and linking (NEL) system. The contributions of this paper are three-fold: 1) Lightweight named entity recognition with competitive accuracy; 2) Candidate entity retrieval that uses search click log data and entity embeddings to achieve high precision with a low memory footprint; and 3) efficient entity disambiguation. Our system achieves state-of-the-art performance on TAC KBP 2013 multilingual data and on English AIDA-CONLL data.
引用
收藏
页码:365 / 374
页数:10
相关论文
共 50 条
  • [41] Lightweight Liquid Metal Entity
    Yuan, Bo
    Zhao, Chenjia
    Sun, Xuyang
    Liu, Jing
    ADVANCED FUNCTIONAL MATERIALS, 2020, 30 (14)
  • [42] Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach
    Gattani, Abhishek
    Lamba, Digvijay S.
    Garera, Nikesh
    Tiwari, Mitul
    Chai, Xiaoyong
    Das, Sanjib
    Subramaniam, Sri
    Rajaraman, Anand
    Harinarayan, Venky
    Doan, Anhai
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1126 - 1137
  • [43] ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
    Orlandi, Riccardo
    Cabot, Pere-Lluis Huguet
    Barbat, Edoardo
    Navigli, Roberto
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 14114 - 14132
  • [44] Headword-Oriented Entity Linking: A Special Entity Linking Task with Dataset and Baseline
    Yang, Mu
    Chen, Chi-Yen
    Lee, Yi-Hui
    Zeng, Qian-Hui
    Ma, Wei-Yun
    Shih, Chen-Yang
    Chen, Wei-Jhih
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1910 - 1917
  • [45] Joint Learning of Named Entity Recognition and Entity Linking
    Martins, Pedro Henrique
    Marinho, Zita
    Martins, Andre F. T.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 190 - 196
  • [46] Personal Entity, Concept, and Named Entity Linking in Conversations
    Joko, Hideaki
    Hasibi, Faegheh
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4099 - 4103
  • [47] Entity Extraction within Plain-Text Collections WISE 2013 Challenge-T1: Entity Linking Track
    Abreu, Carolina
    Costa, Flavio
    Santos, Laecio
    Monteiro, Lucas
    Peres de Oliveira, Luiz Fernando
    Lustosa, Patricia
    Li Weigang
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT I, 2013, 8180 : 491 - 496
  • [48] A statistical model for multilingual entity detection and tracking
    Florian, R
    Hassan, H
    Ittycheriah, A
    Jing, H
    Kambhatla, N
    Luo, X
    Nicolov, N
    Roukos, S
    HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 1 - 8
  • [49] Entity linking of tweets based on dominant entity candidates
    Feng, Yue
    Zarrinkalam, Fattane
    Bagheri, Ebrahim
    Fani, Hossein
    Al-Obeidat, Feras
    SOCIAL NETWORK ANALYSIS AND MINING, 2018, 8 (01)
  • [50] Exploiting anonymous entity mentions for named entity linking
    Feng Hou
    Ruili Wang
    See-Kiong Ng
    Michael Witbrock
    Fangyi Zhu
    Xiaoyun Jia
    Knowledge and Information Systems, 2023, 65 : 1221 - 1242