Pattern-enhanced Named Entity Recognition with Distant Supervision

被引:6
|
作者
Wang, Xuan [1 ]
Guan, Yingjun [1 ]
Zhang, Yu [1 ]
Li, Qi [2 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Iowa State Univ, Dept Comp Sci, Ames, IA USA
关键词
named entity recognition; distant supervision; pattern mining; neural network;
D O I
10.1109/BigData50022.2020.9378052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised deep learning methods have achieved state-of-the-art performance on the task of named entity recognition (NER). However, such methods suffer from high cost and low efficiency in training data annotation, leading to highly specialized NER models that cannot be easily adapted to new domains. Recently, distant supervision has been applied to replace human annotation, thanks to the fast development of domain specific knowledge bases. However, the generated noisy labels pose significant challenges in learning effective neural models with distant supervision. We propose PATNER, a distantly supervised NER model that effectively deals with noisy distant supervision from domain-specific dictionaries. PATNER does not require human-annotated training data but only relies on unlabeled data and incomplete domain-specific dictionaries for distant supervision. It incorporates the distant labeling uncertainty into the neural model training to enhance distant supervision. We go beyond the traditional sequence labeling framework and propose a more effective fuzzy neural model using the tie-or break tagging scheme for the NER task. Extensive experiments on three benchmark datasets in two domains demonstrate the power of PATNER. Case studies on two additional real-world datasets demonstrate that PATNER improves the distant NER performance in both entity boundary detection and entity type recognition. The results show a great promise in supporting high quality named entity recognition with domain-specific dictionaries on a wide variety of entity types.
引用
收藏
页码:818 / 827
页数:10
相关论文
共 50 条
  • [1] PENNER: Pattern-enhanced Nested Named Entity Recognition in Biomedical Literature
    Wang, Xuan
    Zhang, Yu
    Li, Qi
    Wu, Cathy H.
    Han, Jiawei
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 540 - 547
  • [2] Named Entity Recognition for Cancer Immunology Research Using Distant Supervision
    Hai-Long Trieu
    Miwa, Makoto
    Ananiadou, Sophia
    [J]. PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 171 - 177
  • [3] Named Entity Recognition for Open Domain Data Based on Distant Supervision
    Wu, Junshuang
    Zhang, Richong
    Deng, Ting
    Huai, Jinpeng
    [J]. KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 185 - 197
  • [4] A template augmented distant supervision framework for Chinese named entity recognition
    Qi, Chengwen
    Laili, Yuanjun
    Ren, Lei
    Zhang, Lin
    Li, Bowen
    [J]. INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2024, 15 (01)
  • [5] Adaptive Named Entity Recognition Using Distant Supervision for Contemporary Written Texts
    Kim, Juae
    Kim, Yejin
    Kang, Sangwoo
    Seo, Jungyun
    [J]. IEEE ACCESS, 2021, 9 : 80405 - 80414
  • [6] Research on the Named Entity Recognition for Rail Fault Text Based on Distant Supervision
    Cai, Yi
    Su, Shuai
    Li, Zheng
    Han, Qinglong
    Zhang, Jianxia
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 939 - 944
  • [7] Bagging-Based Active Learning Model for Named Entity Recognition with Distant Supervision
    Lee, Sunghee
    Song, Yeongkil
    Choi, Maengsik
    Kim, Harksoo
    [J]. 2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 321 - 324
  • [8] Biomedical Named Entity Recognition with Less Supervision
    Ghiasvand, Omid
    Kate, Rohit J.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2015), 2015, : 495 - 495
  • [9] Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature
    Wang, Xuan
    Song, Xiangchen
    Li, Bangzheng
    Zhou, Kang
    Li, Qi
    Han, Jiawei
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 491 - 494
  • [10] BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
    Liang, Chen
    Yu, Yue
    Jiang, Haoming
    Er, Siawpeng
    Wang, Ruijia
    Zhao, Tuo
    Zhang, Chao
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1054 - 1064