Counterfactual Generator: A Weakly-Supervised Method for Named Entity Recognition

被引:0
|
作者
Zeng, Xiangji [1 ]
Li, Yunliang [1 ]
Zhai, Yuchen [1 ]
Zhang, Yin [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Past progress on neural models has proven that named entity recognition is no longer a problem if we have enough labeled data. However, collecting enough data and annotating them are labor-intensive, time-consuming, and expensive. In this paper, we decompose the sentence into two parts: entity and context, and rethink the relationship between them and model performance from a causal perspective. Based on this, we propose the Counterfactual Generator, which generates counterfactual examples by the interventions on the existing observational examples to enhance the original dataset. Experiments across three datasets show that our method improves the generalization ability of models under limited observational examples. Besides, we provide a theoretical foundation by using a structural causal model to explore the spurious correlations between input features and output labels. We investigate the causal effects of entity or context on model performance under both conditions: the non-augmented and the augmented. Interestingly, we find that the non-spurious correlations are more located in entity representation rather than context representation. As a result, our method eliminates part of the spurious correlations between context representation and output labels. The code is available at https://github.com/xijiz/cfgen.
引用
收藏
页码:7270 / 7280
页数:11
相关论文
共 50 条
  • [21] Weakly Supervised Attention Networks for Entity Recognition
    Patra, Barun
    Moniz, Joel Ruben Antony
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6268 - 6273
  • [22] Noise Detection for Distant Supervised Named Entity Recognition
    Wang J.
    Wang K.
    Wang H.
    Du W.
    He Z.
    Ruan T.
    Liu J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 916 - 928
  • [23] Analysis of Different Supervised Techniques for Named Entity Recognition
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 : 184 - 195
  • [24] Weakly Supervised Cross-Lingual Named Entity Recognition via Effective Annotation and Representation Projection
    Ni, Jian
    Dinu, Georgiana
    Florian, Radu
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1470 - 1480
  • [25] External Knowledge-Based Weakly Supervised Learning Approach on Chinese Clinical Named Entity Recognition
    Duan, Yeheng
    Ma, Long-Long
    Han, Xianpei
    Sun, Le
    Dong, Bin
    Jiang, Shanshan
    SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 : 336 - 352
  • [26] A Method of Named Entity Recognition for Tigrinya
    Yohannes, Hailemariam Mehari
    Amagasa, Toshiyuki
    APPLIED COMPUTING REVIEW, 2022, 22 (03): : 56 - 68
  • [27] Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data
    Zhang, Feifei
    Xu, Mingliang
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1800 - 1814
  • [28] Weakly-Supervised Symptom Recognition for Rare Diseases in Biomedical Text
    Holat, Pierre
    Tomeh, Nadi
    Charnois, Thierry
    Battistelli, Delphine
    Jaulent, Marie-Christine
    Metivier, Jean-Philippe
    ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 : 192 - 203
  • [29] WEAKLY-SUPERVISED DEEP SELF-LEARNING FOR FACE RECOGNITION
    Chen, Binghui
    Deng, Weihong
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [30] Learning to select pseudo labels: a semi-supervised method for named entity recognition
    Zhen-zhen Li
    Da-wei Feng
    Dong-sheng Li
    Xi-cheng Lu
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 903 - 916