Named Entity Recognition for Cancer Immunology Research Using Distant Supervision

被引:0
|
作者
Hai-Long Trieu [1 ,3 ]
Miwa, Makoto [1 ,2 ]
Ananiadou, Sophia [3 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr AIRC, Tsukuba, Ibaraki, Japan
[2] Toyota Technol Inst, Toyota, Japan
[3] Univ Manchester, Natl Ctr Text Min, Manchester, Lancs, England
来源
PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022) | 2022年
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer immunology research involves several important cell and protein factors. Extracting the information of such cells and proteins and the interactions between them from text are crucial in text mining for cancer immunology research. However, there are few available datasets for these entities, and the amount of annotated documents is not sufficient compared with other major named entity types. In this work, we introduce our automatically annotated dataset of key named entities, i.e., T-cells, cytokines, and transcription factors, which engages the recent cancer immunotherapy. The entities are annotated based on the UniProtKB knowledge base using dictionary matching. We build a neural named entity recognition (NER) model to be trained on this dataset and evaluate it on a manually-annotated data. Experimental results show that we can achieve a promising NER performance even though our data is automatically annotated. Our dataset also enhances the NER performance when combined with existing data, especially gaining improvement in yet investigated named entities such as cytokines and transcription factors.
引用
收藏
页码:171 / 177
页数:7
相关论文
共 50 条
  • [31] Research on Chinese named entity recognition using combined boundary-PoS feature
    Qiang, Bao-Hua
    Huang, Jun
    Wang, Yu-Feng
    Wang, Sai
    Wang, Yong
    DESIGN, MANUFACTURING AND MECHATRONICS (ICDMM 2015), 2016, : 839 - 848
  • [32] Using WordNet Predicates for Multilingual Named Entity Recognition
    Negri, Matteo
    Magnini, Bernardo
    GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 169 - 174
  • [33] Named Entity Recognition for Amharic Using Deep Learning
    Gamback, Bjorn
    Sikdar, Utpal Kumar
    2017 IST-AFRICA WEEK CONFERENCE (IST-AFRICA), 2017,
  • [34] Less than One-shot: Named Entity Recognition via ExtremelyWeak Supervision
    Peng, Letian
    Wang, Zihan
    Shang, Jingbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 13603 - 13616
  • [35] Weak Supervision and Clustering-Based Sample Selection for Clinical Named Entity Recognition
    Sun, Wei
    Ji, Shaoxiong
    Denti, Tuulia
    Moen, Hans
    Kerro, Oleg
    Rannikko, Antti
    Marttinen, Pekka
    Koskinen, Miika
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 444 - 459
  • [36] Research on Named Entity Recognition Method of Network Threat Intelligence
    Zhang, Keke
    Chen, Xu
    Jing, Yongjun
    Wang, Shuyang
    Tang, Lijun
    CYBER SECURITY, CNCERT 2022, 2022, 1699 : 213 - 224
  • [37] Named entity recognition for Arabic using syntactic grammars
    Mesfar, Slim
    Natural Language Processing and Information Systems, Proceedings, 2007, 4592 : 305 - 316
  • [38] Domain Adaptation for Named Entity Recognition Using CRFs
    Tian, Tian
    Dinarelli, Marco
    Tellier, Isabelle
    Cardoso, Pedro Dias
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 561 - 565
  • [39] Named Entity Recognition for Vietnamese
    Dat Ba Nguyen
    Son Huu Hoang
    Son Bao Pham
    Thai Phuong Nguyen
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 205 - 214
  • [40] Named Entity Recognition using Conditional Random Fields
    Patil, Nita
    Patil, Ajay
    Pawar, B., V
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1181 - 1188