A large-scale dataset for korean document-level relation extraction from encyclopedia texts

被引:0
|
作者
Son, Suhyune [1 ]
Lim, Jungwoo [1 ]
Koo, Seonmin [1 ]
Kim, Jinsung [1 ]
Kim, Younghoon [2 ]
Lim, Youngsik [2 ]
Hyun, Dongseok [2 ]
Lim, Heuiseok [1 ]
机构
[1] Korea Univ, Comp Sci & Engn, 1 5-ka,Anam Dong, Seoul 02841, South Korea
[2] NAVER, 5 Jeongjail ro,Buljeong ro, Seongnam 13561, South Korea
基金
新加坡国家研究基金会;
关键词
Natural Language Processing; Information Extraction; Document-level Relation Extraction; Korean Relation Extraction; ENTITY;
D O I
10.1007/s10489-024-05605-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction (RE) aims to predict the relational facts between two given entities from a document. Unlike widespread research on document-level RE in English, Korean document-level RE research is still at the very beginning due to the absence of a dataset. To accelerate the studies, we present TREK (Toward Document-Level Relation Extraction in Korean) dataset constructed from Korean encyclopedia documents written by the domain experts. We provide detailed statistical analyses for our large-scale dataset and human evaluation results suggest the assured quality of TREK . Also, we introduce the document-level RE model that considers the named entity-type while considering the Korean language's properties. In the experiments, we demonstrate that our proposed model outperforms the baselines and conduct qualitative analysis.
引用
收藏
页码:8681 / 8701
页数:21
相关论文
共 50 条
  • [21] DLEE: a dataset for Chinese document-level legal event extraction
    Xian G.
    Du S.
    Tang X.
    Shi Y.
    Jia B.
    Tang B.
    Leng Z.
    Li L.
    Neural Computing and Applications, 2024, 36 (25) : 15581 - 15597
  • [22] Document-level relation extraction with global and path dependencies
    Jia, Wei
    Ma, Ruizhe
    Yan, Li
    Niu, Weinan
    Ma, Zongmin
    KNOWLEDGE-BASED SYSTEMS, 2024, 289
  • [23] Inter span learning for document-level relation extraction
    Liao, Tao
    Sun, Haojie
    Zhang, Shunxiang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9965 - 9977
  • [24] Entity and Evidence Guided Document-Level Relation Extraction
    Huang, Kevin
    Qi, Peng
    Wang, Guangtao
    Ma, Tengyu
    Huang, Jing
    REPL4NLP 2021: PROCEEDINGS OF THE 6TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2021, : 307 - 315
  • [25] Exploiting Ubiquitous Mentions for Document-Level Relation Extraction
    Zhang, Ruoyu
    Li, Yanzeng
    Zhang, Minhao
    Zou, Lei
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1986 - 1990
  • [26] Few-Shot Document-Level Relation Extraction
    Popovic, Nicholas
    Faerber, Michael
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5733 - 5746
  • [27] Document-level Relation Extraction With Entity and Context Information
    Huang, He-Yan
    Yuan, Chang-Sen
    Feng, Chong
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (10): : 1953 - 1962
  • [28] Learning Logic Rules for Document-level Relation Extraction
    Ru, Dongyu
    Sun, Changzhi
    Feng, Jiangtao
    Qiu, Lin
    Zhou, Hao
    Zhang, Weinan
    Yu, Yong
    Li, Lei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1239 - 1250
  • [29] Automatic Graph Generation for Document-Level Relation Extraction
    Yu, Yanhua
    Shen, Fangting
    Yang, Shengli
    Li, Jie
    Wang, Yuling
    Ma, Ang
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [30] Evidence-aware Document-level Relation Extraction
    Xu, Tianyu
    Hua, Wen
    Qu, Jianfeng
    Li, Zhixu
    Xu, Jiajie
    Liu, An
    Zhao, Lei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2311 - 2320