LTACL: long-tail awareness contrastive learning for distantly supervised relation extraction

被引:4
|
作者
Yan, Tianwei [1 ]
Zhang, Xiang [1 ]
Luo, Zhigang [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci, Changsha, Peoples R China
关键词
Distantly supervised learning; Information extraction; Relation extraction; Contrastive learning;
D O I
10.1007/s40747-023-01226-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distantly supervised relation extraction is an automatically annotating method for large corpora by classifying a bound of sentences with two same entities and the relation. Recent works exploit sound performance by adopting contrastive learning to efficiently obtain instance representations under the multi-instance learning framework. Though these methods weaken the impact of noisy labels, it ignores the long-tail distribution problem in distantly supervised sets and fails to capture the mutual information of different parts. We are thus motivated to tackle these issues and establishing a long-tail awareness contrastive learning method for efficiently utilizing the long-tail data. Our model treats major and tail parts differently by adopting hyper-augmentation strategies. Moreover, the model provides various views by constructing novel positive and negative pairs in contrastive learning for gaining a better representation between different parts. The experimental results on the NYT10 dataset demonstrate our model surpasses the existing SOTA by more than 2.61% AUC score on relation extraction. In manual evaluation datasets including NYT10m and Wiki20m, our method obtains competitive results by achieving 59.42% and 79.19% AUC scores on relation extraction, respectively. Extensive discussions further confirm the effectiveness of our approach.
引用
收藏
页码:1551 / 1563
页数:13
相关论文
共 50 条
  • [1] LTACL: long-tail awareness contrastive learning for distantly supervised relation extraction
    Tianwei Yan
    Xiang Zhang
    Zhigang Luo
    Complex & Intelligent Systems, 2024, 10 (1) : 1551 - 1563
  • [2] Exploring Long Tail Data in Distantly Supervised Relation Extraction
    Gui, Yaocheng
    Liu, Qian
    Zhu, Man
    Gao, Zhiqiang
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 514 - 522
  • [3] A Multi-granularity Contrastive Learning for Distantly Supervised Relation Extraction
    Jian, Zhaorui
    Liu, Shenquan
    Yin, Huixin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 352 - 364
  • [4] HiCLRE: A Hierarchical Contrastive Learning Framework for Distantly Supervised Relation Extraction
    Li, Dongyang
    Zhang, Taolin
    Hu, Nan
    Wang, Chengyu
    He, Xiaofeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2567 - 2578
  • [5] Distantly supervised relation extraction with a Meta-Relation enhanced Contrastive learning framework
    Chen, Chuanshu
    Hao, Shuang
    Liu, Jian
    NEUROCOMPUTING, 2025, 617
  • [6] Learning Relation Prototype From Unlabeled Texts for Long-Tail Relation Extraction
    Cao, Yixin
    Kuang, Jun
    Gao, Ming
    Zhou, Aoying
    Wen, Yonggang
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1761 - 1774
  • [7] Reinforcement Learning for Distantly Supervised Relation Extraction
    Sun, Tingting
    Zhang, Chunhong
    Ji, Yang
    Hu, Zheng
    IEEE ACCESS, 2019, 7 : 98023 - 98033
  • [8] Long-Tail Augmented Graph Contrastive Learning for Recommendation
    Zhao, Qian
    Wu, Zhengwei
    Zhang, Zhiqiang
    Zhou, Jun
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 387 - 403
  • [9] Learning labeling functions in distantly supervised relation extraction
    Gui, Yaocheng
    Liu, Qian
    Gao, Zhiqiang
    INTELLIGENT DATA ANALYSIS, 2020, 24 (02) : 427 - 443
  • [10] GACRec: Generative adversarial contrastive learning for improved long-tail item recommendation
    Qin, Bingjun
    Huang, Zhenhua
    Tian, Xing
    Chen, Yunwen
    Wang, Wenguang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300