Lightweight Multilingual Entity Extraction and Linking

被引:38
|
作者
Pappu, Aasish [1 ]
Blanco, Roi [2 ]
Mehdad, Yashar [3 ]
Stent, Amanda [4 ]
Thadani, Kapil [1 ]
机构
[1] Yahoo Res, New York, NY USA
[2] Univ A Coruna, La Coruna, Spain
[3] AirBnB, San Francisco, CA USA
[4] Bloomberg LP, New York, NY USA
关键词
D O I
10.1145/3018661.3018724
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text analytics systems often rely heavily on detecting and linking entity mentions in documents to knowledge bases for downstream applications such as sentiment analysis, question answering and recommender systems. A major challenge for this task is to be able to accurately detect entities in new languages with limited labeled resources. In this paper we present an accurate and lightweight(1) multilingual named entity recognition (NER) and linking (NEL) system. The contributions of this paper are three-fold: 1) Lightweight named entity recognition with competitive accuracy; 2) Candidate entity retrieval that uses search click log data and entity embeddings to achieve high precision with a low memory footprint; and 3) efficient entity disambiguation. Our system achieves state-of-the-art performance on TAC KBP 2013 multilingual data and on English AIDA-CONLL data.
引用
收藏
页码:365 / 374
页数:10
相关论文
共 50 条
  • [1] Multilingual Autoregressive Entity Linking
    De Cao, Nicola
    Wu, Ledell
    Popat, Kashyap
    Artetxe, Mikel
    Goyal, Naman
    Plekhanov, Mikhail
    Zettlemoyer, Luke
    Cancedda, Nicola
    Riedel, Sebastian
    Petroni, Fabio
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 274 - 290
  • [2] VoxEL: A Benchmark Dataset for Multilingual Entity Linking
    Rosales-Mendez, Henry
    Hogan, Aidan
    Poblete, Barbara
    SEMANTIC WEB - ISWC 2018, PT II, 2018, 11137 : 170 - 186
  • [3] Controllable Contrastive Generation for Multilingual Biomedical Entity Linking
    Zhu, Tiantian
    Qin, Yang
    Chen, Qingcai
    Mu, Xin
    Yu, Changlong
    Xiang, Yang
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5742 - 5753
  • [4] A Lightweight Neural Model for Biomedical Entity Linking
    Chen, Lihu
    Varoquaux, Gael
    Suchanek, Fabian M.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12657 - 12665
  • [5] Multilingual Entity and Relation Extraction Dataset and Model
    Seganti, Alessandro
    Firlag, Klaudia
    Skowronska, Helena
    Satlawa, Michal
    Andruszkiewicz, Piotr
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1946 - 1955
  • [6] Multilingual bi-encoder models for biomedical entity linking
    Guven, Zekeriya Anil
    Lamurias, Andre
    EXPERT SYSTEMS, 2023, 40 (09)
  • [7] DeepType: Multilingual Entity Linking by Neural Type System Evolution
    Raiman, Jonathan
    Raiman, Olivier
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5406 - 5413
  • [8] Joint Multilingual Supervision for Cross-lingual Entity Linking
    Upadhyay, Shyam
    Gupta, Nitish
    Roth, Dan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2486 - 2495
  • [9] MELHISSA: a multilingual entity linking architecture for historical press articles
    Pontes, Elvys Linhares
    Cabrera-Diego, Luis Adrian
    Moreno, Jose G.
    Boros, Emanuela
    Hamdi, Ahmed
    Doucet, Antoine
    Sidere, Nicolas
    Coustaty, Mickael
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2022, 23 (02) : 133 - 160
  • [10] Adaptive Multilingual Representations for Cross-Lingual Entity Linking with Attention on Entity Descriptions
    Wang, Chenhao
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 1 - 12