Lightweight Multilingual Entity Extraction and Linking

被引:38
|
作者
Pappu, Aasish [1 ]
Blanco, Roi [2 ]
Mehdad, Yashar [3 ]
Stent, Amanda [4 ]
Thadani, Kapil [1 ]
机构
[1] Yahoo Res, New York, NY USA
[2] Univ A Coruna, La Coruna, Spain
[3] AirBnB, San Francisco, CA USA
[4] Bloomberg LP, New York, NY USA
关键词
D O I
10.1145/3018661.3018724
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text analytics systems often rely heavily on detecting and linking entity mentions in documents to knowledge bases for downstream applications such as sentiment analysis, question answering and recommender systems. A major challenge for this task is to be able to accurately detect entities in new languages with limited labeled resources. In this paper we present an accurate and lightweight(1) multilingual named entity recognition (NER) and linking (NEL) system. The contributions of this paper are three-fold: 1) Lightweight named entity recognition with competitive accuracy; 2) Candidate entity retrieval that uses search click log data and entity embeddings to achieve high precision with a low memory footprint; and 3) efficient entity disambiguation. Our system achieves state-of-the-art performance on TAC KBP 2013 multilingual data and on English AIDA-CONLL data.
引用
收藏
页码:365 / 374
页数:10
相关论文
共 50 条
  • [1] Multilingual Autoregressive Entity Linking
    De Cao, Nicola
    Wu, Ledell
    Popat, Kashyap
    Artetxe, Mikel
    Goyal, Naman
    Plekhanov, Mikhail
    Zettlemoyer, Luke
    Cancedda, Nicola
    Riedel, Sebastian
    Petroni, Fabio
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 274 - 290
  • [2] VoxEL: A Benchmark Dataset for Multilingual Entity Linking
    Rosales-Mendez, Henry
    Hogan, Aidan
    Poblete, Barbara
    [J]. SEMANTIC WEB - ISWC 2018, PT II, 2018, 11137 : 170 - 186
  • [3] A Lightweight Neural Model for Biomedical Entity Linking
    Chen, Lihu
    Varoquaux, Gael
    Suchanek, Fabian M.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12657 - 12665
  • [4] Multilingual Entity and Relation Extraction Dataset and Model
    Seganti, Alessandro
    Firlag, Klaudia
    Skowronska, Helena
    Satlawa, Michal
    Andruszkiewicz, Piotr
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1946 - 1955
  • [5] Multilingual bi-encoder models for biomedical entity linking
    Guven, Zekeriya Anil
    Lamurias, Andre
    [J]. EXPERT SYSTEMS, 2023, 40 (09)
  • [6] DeepType: Multilingual Entity Linking by Neural Type System Evolution
    Raiman, Jonathan
    Raiman, Olivier
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5406 - 5413
  • [7] Joint Multilingual Supervision for Cross-lingual Entity Linking
    Upadhyay, Shyam
    Gupta, Nitish
    Roth, Dan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2486 - 2495
  • [8] Adaptive Multilingual Representations for Cross-Lingual Entity Linking with Attention on Entity Descriptions
    Wang, Chenhao
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    [J]. KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 1 - 12
  • [9] A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
    Hamdi, Ahmed
    Pontes, Elvys Linhares
    Boros, Emanuela
    Thi Tuyet Hai Nguyen
    Hackl, Guenter
    Moreno, Jose G.
    Doucet, Antoine
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2328 - 2334
  • [10] Evaluation of Incremental Entity Extraction with Background Knowledge and Entity Linking
    Pozzi, Riccardo
    Moiraghi, Federico
    Lodi, Fausto
    Palmonari, Matteo
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE GRAPHS, IJCKG 2022, 2022, : 30 - 38