A large-scale dataset for korean document-level relation extraction from encyclopedia texts

被引:0
|
作者
Son, Suhyune [1 ]
Lim, Jungwoo [1 ]
Koo, Seonmin [1 ]
Kim, Jinsung [1 ]
Kim, Younghoon [2 ]
Lim, Youngsik [2 ]
Hyun, Dongseok [2 ]
Lim, Heuiseok [1 ]
机构
[1] Korea Univ, Comp Sci & Engn, 1 5-ka,Anam Dong, Seoul 02841, South Korea
[2] NAVER, 5 Jeongjail ro,Buljeong ro, Seongnam 13561, South Korea
基金
新加坡国家研究基金会;
关键词
Natural Language Processing; Information Extraction; Document-level Relation Extraction; Korean Relation Extraction; ENTITY;
D O I
10.1007/s10489-024-05605-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction (RE) aims to predict the relational facts between two given entities from a document. Unlike widespread research on document-level RE in English, Korean document-level RE research is still at the very beginning due to the absence of a dataset. To accelerate the studies, we present TREK (Toward Document-Level Relation Extraction in Korean) dataset constructed from Korean encyclopedia documents written by the domain experts. We provide detailed statistical analyses for our large-scale dataset and human evaluation results suggest the assured quality of TREK . Also, we introduce the document-level RE model that considers the named entity-type while considering the Korean language's properties. In the experiments, we demonstrate that our proposed model outperforms the baselines and conduct qualitative analysis.
引用
收藏
页码:8681 / 8701
页数:21
相关论文
共 50 条
  • [41] A Personalized Federated Framework for Document-level Biomedical Relation Extraction
    Xiao, Yan
    Jin, Yaochu
    Zhang, Haoyu
    Huo, Xu
    Liu, Qiqi
    Zheng, Zeqi
    2024 6TH INTERNATIONAL CONFERENCE ON DATA-DRIVEN OPTIMIZATION OF COMPLEX SYSTEMS, DOCS 2024, 2024, : 457 - 461
  • [42] Denoising Graph Inference Network for Document-Level Relation Extraction
    Wang, Hailin
    Qin, Ke
    Duan, Guiduo
    Luo, Guangchun
    BIG DATA MINING AND ANALYTICS, 2023, 6 (02) : 248 - 262
  • [43] Towards Integration of Discriminability and Robustness for Document-Level Relation Extraction
    Guo, Jia
    Kok, Stanley
    Bing, Lidong
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2606 - 2617
  • [44] CorefDRE: Coref-Aware Document-Level Relation Extraction
    Xue, Zhongxuan
    Zhong, Jiang
    Dai, Qizhu
    Li, Rongzhen
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 116 - 128
  • [45] Document-level relation extraction with Entity-Selection Attention
    Yuan, Changsen
    Huang, Heyan
    Feng, Chong
    Shi, Ge
    Wei, Xiaochi
    INFORMATION SCIENCES, 2021, 568 : 163 - 174
  • [46] Document-level Relation Extraction with Progressive Self-distillation
    Wang, Quan
    Mao, Zhendong
    Gao, Jie
    Zhang, Yongdong
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
  • [47] DoreBer: Document-Level Relation Extraction Method Based on BernNet
    Yuan, Boya
    Xu, Liwen
    IEEE ACCESS, 2023, 11 : 136468 - 136477
  • [48] Document-Level Event Temporal Relation Extraction with Context Information
    Wang J.
    Shi C.
    Zhang J.
    Yu X.
    Liu Y.
    Cheng X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (11): : 2475 - 2484
  • [49] Document-Level Relation Extraction with Structure Enhanced Transformer Encoder
    Liu, Wanlong
    Zhou, Li
    Zeng, Dingyi
    Qu, Hong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [50] Document-level Relation Extraction with Entity Interaction and Commonsense Knowledge
    Liu, Shen
    Shen, Xinshu
    Liu, Tingting
    Lan, Man
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,