A semi-supervised learning framework for biomedical event extraction based on hidden topics

被引:33
|
作者
Zhou, Deyu [1 ]
Zhong, Dayou [1 ]
机构
[1] Southeast Univ, Minist Educ, Key Lab Comp Network & Informat Integrat, Sch Comp Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Semi-supervised learning; Biomedical event extraction; Latent Dirichlet allocation; K nearest neighbor;
D O I
10.1016/j.artmed.2015.03.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objectives: Scientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, due to the lack of explicit structure, literature in life science, one of the most important sources of this information, prevents computer-based systems from accessing. Therefore, biomedical event extraction, automatically acquiring knowledge of molecular events in research articles, has attracted community-wide efforts recently. Most approaches are based on statistical models, requiring large-scale annotated corpora to precisely estimate models' parameters. However, it is usually difficult to obtain in practice. Therefore, employing un-annotated data based on semi-supervised learning for biomedical event extraction is a feasible solution and attracts more interests. Methods and material: In this paper, a semi-supervised learning framework based on hidden topics for biomedical event extraction is presented. In this framework, sentences in the un-annotated corpus are elaborately and automatically assigned with event annotations based on their distances to these sentences in the annotated corpus. More specifically, not only the structures of the sentences, but also the hidden topics embedded in the sentences are used for describing the distance. The sentences and newly assigned event annotations, together with the annotated corpus, are employed for training. Results: Experiments were conducted on the multi-level event extraction corpus, a golden standard corpus. Experimental results show that more than 2.2% improvement on F-score on biomedical event extraction is achieved by the proposed framework when compared to the state-of-the-art approach. Conclusion: The results suggest that by incorporating un-annotated data, the proposed framework indeed improves the performance of the state-of-the-art event extraction system and the similarity between sentences might be precisely described by hidden topics and structures of the sentences. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:51 / 58
页数:8
相关论文
共 50 条
  • [41] A framework for semi-supervised metric transfer learning on manifolds
    Sanodiya, Rakesh Kumar
    Mathew, Jimson
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 176 : 1 - 14
  • [42] Data-driven Event Detection with Partial Knowledge: A Hidden Structure Semi-Supervised Learning Method
    Zhou, Yuxun
    Arghandeh, Reza
    Konstantakopoulos, Ioannis
    Abdullah, Shayaan
    Spanos, Costas J.
    [J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5962 - 5968
  • [43] ESA*: A generic framework for semi-supervised inductive learning
    Yang, Shuyi
    Ienco, Dino
    Esposito, Roberto
    Pensa, Ruggero G.
    [J]. NEUROCOMPUTING, 2021, 447 (447) : 102 - 117
  • [44] A viable framework for semi-supervised learning on realistic dataset
    Chang, Hao
    Xie, Guochen
    Yu, Jun
    Ling, Qiang
    Gao, Fang
    Yu, Ye
    [J]. MACHINE LEARNING, 2023, 112 (06) : 1847 - 1869
  • [45] A semi-supervised learning framework for micropapillary adenocarcinoma detection
    Yuan Gao
    Yanhui Ding
    Wei Xiao
    Zhigang Yao
    Xiaoming Zhou
    Xiaodan Sui
    Yanna Zhao
    Yuanjie Zheng
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 639 - 648
  • [46] Graph Diffusion & PCA Framework for Semi-supervised Learning
    Avrachenkov, Konstantin
    Boisbunon, Aurelie
    Kamalov, Mikhail
    [J]. LEARNING AND INTELLIGENT OPTIMIZATION, LION 15, 2021, 12931 : 25 - 39
  • [47] A semi-supervised active learning framework for image retrieval
    Hoi, SCH
    Lyu, MR
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 302 - 309
  • [48] SSRCNN: A Semi-Supervised Learning Framework for Signal Recognition
    Dong, Yihong
    Jiang, Xiaohan
    Cheng, Lei
    Shi, Qingjiang
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (03) : 780 - 789
  • [49] A viable framework for semi-supervised learning on realistic dataset
    Hao Chang
    Guochen Xie
    Jun Yu
    Qiang Ling
    Fang Gao
    Ye Yu
    [J]. Machine Learning, 2023, 112 : 1847 - 1869
  • [50] Pseudo-label based semi-supervised learning in the distributed machine learning framework
    王晓曦
    WU Wenjun
    YANG Feng
    SI Pengbo
    ZHANG Xuanyi
    ZHANG Yanhua
    [J]. High Technology Letters, 2022, 28 (02) : 172 - 180