Multilingual Entity and Relation Extraction Dataset and Model

被引:0
|
作者
Seganti, Alessandro [1 ,2 ]
Firlag, Klaudia [1 ]
Skowronska, Helena [1 ,3 ]
Satlawa, Michal [1 ]
Andruszkiewicz, Piotr [1 ,4 ]
机构
[1] Samsung R&D Inst Poland, Warsaw, Poland
[2] Equinix, Redwood City, CA 94065 USA
[3] NextSell, ODC Grp, Warsaw, Poland
[4] Warsaw Univ Technol, Warsaw, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel dataset and model for a multilingual setting to approach the task of Joint Entity and Relation Extraction. The SMi-LER dataset consists of 1.1 M annotated sentences, representing 36 relations, and 14 languages. To the best of our knowledge, this is currently both the largest and the most comprehensive dataset of this type. We introduce HERBERTa, a pipeline that combines two independent BERT models: one for sequence classification, and the other for entity tagging. The model achieves micro F-1 81.49 for English on this dataset, which is close to the current SOTA on CoNLL, SpERT.
引用
收藏
页码:1946 / 1955
页数:10
相关论文
共 50 条
  • [41] A neural joint model for entity and relation extraction from biomedical text
    Fei Li
    Meishan Zhang
    Guohong Fu
    Donghong Ji
    [J]. BMC Bioinformatics, 18
  • [42] The Overview of Entity Relation Extraction Methods
    Cheng, Xian-Yi
    Chen, Xiao-hong
    Hua, Jin
    [J]. INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT I, 2011, 134 (0I): : 749 - 754
  • [43] Review of Chinese Entity Relation Extraction
    Wang Zirui
    Miao Fang
    Jin Libiao
    [J]. CONFERENCE PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON CONTROL SCIENCE AND SYSTEMS ENGINEERING (ICCSSE), 2017, : 633 - 637
  • [44] Entity Relation Extraction to Free Text
    Zhang, Suxiang
    [J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 524 - 528
  • [45] Research of unsupervised entity relation extraction
    Liu, Yun
    Li, Mingxin
    Liu, Hui
    Cheng, Junjun
    Fu, Yanping
    [J]. Journal of Computers (Taiwan), 2019, 30 (01) : 31 - 41
  • [46] REFinD: Relation Extraction Financial Dataset
    Kaur, Simerjot
    Smiley, Charese
    Gupta, Akshat
    Sain, Joy
    Wang, Dongsheng
    Siddagangappa, Suchetha
    Aguda, Toyin
    Shah, Sameena
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3054 - 3063
  • [47] A Triple Relation Network for Joint Entity and Relation Extraction
    Wang, Zixiang
    Yang, Liqun
    Yang, Jian
    Li, Tongliang
    He, Longtao
    Li, Zhoujun
    [J]. ELECTRONICS, 2022, 11 (10)
  • [48] ENPAR:Enhancing Entity and Entity Pair Representations for Joint Entity Relation Extraction
    Wang, Yijun
    Sun, Changzhi
    Wu, Yuanbin
    Zhou, Hao
    Li, Lei
    Yan, Junchi
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2877 - 2887
  • [49] WeDGeM: A Domain-Specific Evaluation Dataset Generator for Multilingual Entity Linking Systems
    Inan, Emrah
    Dikenelli, Oguz
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT II, 2017, 10570 : 221 - 228
  • [50] Joint Entity and Relation Extraction with a Hybrid Transformer and Reinforcement Learning Based Model
    Xiao, Ya
    Tan, Chengxiang
    Fan, Zhijie
    Xu, Qian
    Zhu, Wenye
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9314 - 9321