A semi-supervised learning framework for biomedical event extraction based on hidden topics

被引:33
|
作者
Zhou, Deyu [1 ]
Zhong, Dayou [1 ]
机构
[1] Southeast Univ, Minist Educ, Key Lab Comp Network & Informat Integrat, Sch Comp Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Semi-supervised learning; Biomedical event extraction; Latent Dirichlet allocation; K nearest neighbor;
D O I
10.1016/j.artmed.2015.03.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objectives: Scientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, due to the lack of explicit structure, literature in life science, one of the most important sources of this information, prevents computer-based systems from accessing. Therefore, biomedical event extraction, automatically acquiring knowledge of molecular events in research articles, has attracted community-wide efforts recently. Most approaches are based on statistical models, requiring large-scale annotated corpora to precisely estimate models' parameters. However, it is usually difficult to obtain in practice. Therefore, employing un-annotated data based on semi-supervised learning for biomedical event extraction is a feasible solution and attracts more interests. Methods and material: In this paper, a semi-supervised learning framework based on hidden topics for biomedical event extraction is presented. In this framework, sentences in the un-annotated corpus are elaborately and automatically assigned with event annotations based on their distances to these sentences in the annotated corpus. More specifically, not only the structures of the sentences, but also the hidden topics embedded in the sentences are used for describing the distance. The sentences and newly assigned event annotations, together with the annotated corpus, are employed for training. Results: Experiments were conducted on the multi-level event extraction corpus, a golden standard corpus. Experimental results show that more than 2.2% improvement on F-score on biomedical event extraction is achieved by the proposed framework when compared to the state-of-the-art approach. Conclusion: The results suggest that by incorporating un-annotated data, the proposed framework indeed improves the performance of the state-of-the-art event extraction system and the similarity between sentences might be precisely described by hidden topics and structures of the sentences. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:51 / 58
页数:8
相关论文
共 50 条
  • [1] Semi-supervised method for biomedical event extraction
    Jian Wang
    Qian Xu
    Hongfei Lin
    Zhihao Yang
    Yanpeng Li
    [J]. Proteome Science, 11
  • [2] Semi-supervised method for biomedical event extraction
    Wang, Jian
    Xu, Qian
    Lin, Hongfei
    Yang, Zhihao
    Li, Yanpeng
    [J]. PROTEOME SCIENCE, 2013, 11
  • [3] Semantic Relation Extraction Based on Semi-supervised Learning
    Li, Haibo
    Matsuo, Yutaka
    Ishizuka, Mitsuru
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 270 - 279
  • [4] Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
    Li, Chun-Guang
    Lin, Zhouchen
    Zhang, Honggang
    Guo, Jun
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2767 - 2775
  • [5] Semi-supervised reference-based sketch extraction using a contrastive learning framework
    Seo, Chang Wook
    Ashtari, Amirsaman
    Noh, Junyong
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (04):
  • [6] STIOCS: Active learning-based semi-supervised training framework for IOC extraction
    Tang, Binhui
    Li, Xiaohui
    Wang, Junfeng
    Ge, Wenhan
    Yu, Zhongkun
    Lin, Tongcan
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 112
  • [7] Semi-Supervised Event Extraction Incorporated With Topic Event Frame
    Wu, Gongqing
    Miao, Zhuochun
    Hu, Shengjie
    Wang, Yinghuan
    Zhang, Zan
    Bao, Xianyu
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (01)
  • [8] Using Semi-Supervised Learning andWikipedia to Train an Event Argument Extraction System
    Zajec, Patrik
    Mladenic, Dunja
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2022, 46 (01): : 121 - 128
  • [9] Semi-supervised learning using hidden feature augmentation
    Hang, Wenlong
    Choi, Kup-Sze
    Wang, Shitong
    Qian, Pengjiang
    [J]. APPLIED SOFT COMPUTING, 2017, 59 : 448 - 461
  • [10] Semi-supervised Learning Framework for UAV Detection
    Medaiyese, Olusiji O.
    Ezuma, Martins
    Lauf, Adrian P.
    Guvenc, Ismail
    [J]. 2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,