A framework for information extraction from tables in biomedical literature

被引:0
|
作者
Nikola Milosevic
Cassie Gregson
Robert Hernandez
Goran Nenadic
机构
[1] University of Manchester,School of Computer Science
[2] AstraZeneca plc,undefined
关键词
Table mining; Text mining; Information extraction; Natural language processing; Semantic analysis;
D O I
暂无
中图分类号
学科分类号
摘要
The scientific literature is growing exponentially, and professionals are no more able to cope with the current amount of publications. Text mining provided in the past methods to retrieve and extract information from text; however, most of these approaches ignored tables and figures. The research done in mining table data still does not have an integrated approach for mining that would consider all complexities and challenges of a table. Our research is examining the methods for extracting numerical (number of patients, age, gender distribution) and textual (adverse reactions) information from tables in the clinical literature. We present a requirement analysis template and an integral methodology for information extraction from tables in clinical domain that contains 7 steps: (1) table detection, (2) functional processing, (3) structural processing, (4) semantic tagging, (5) pragmatic processing, (6) cell selection and (7) syntactic processing and extraction. Our approach performed with the F-measure ranged between 82 and 92%, depending on the variable, task and its complexity.
引用
收藏
页码:55 / 78
页数:23
相关论文
共 50 条
  • [1] A framework for information extraction from tables in biomedical literature
    Milosevic, Nikola
    Gregson, Cassie
    Hernandez, Robert
    Nenadic, Goran
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (01) : 55 - 78
  • [2] The extraction of useful information from the biomedical literature
    Kostoff, R
    [J]. ACADEMIC MEDICINE, 2001, 76 (12) : 1265 - 1270
  • [3] Automatic pathway information extraction from biomedical literature
    [J]. Yang, Z. (yangzh@dlut.edu.cn), 2013, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [4] αExtractor: a system for automatic extraction of chemical information from biomedical literature
    Jiacheng Xiong
    Xiaohong Liu
    Zhaojun Li
    Hongzhong Xiao
    Guangchao Wang
    Zhenjiang Niu
    Chaoyuan Fei
    Feisheng Zhong
    Gang Wang
    Wei Zhang
    Zunyun Fu
    Zhiguo Liu
    Kaixian Chen
    Hualiang Jiang
    Mingyue Zheng
    [J]. Science China Life Sciences, 2024, 67 (03) : 618 - 621
  • [5] αExtractor: a system for automatic extraction of chemical information from biomedical literature
    Xiong, Jiacheng
    Liu, Xiaohong
    Li, Zhaojun
    Xiao, Hongzhong
    Wang, Guangchao
    Niu, Zhenjiang
    Fei, Chaoyuan
    Zhong, Feisheng
    Wang, Gang
    Zhang, Wei
    Fu, Zunyun
    Liu, Zhiguo
    Chen, Kaixian
    Jiang, Hualiang
    Zheng, Mingyue
    [J]. SCIENCE CHINA-LIFE SCIENCES, 2024, 67 (03) : 618 - 621
  • [6] αExtractor: a system for automatic extraction of chemical information from biomedical literature
    Jiacheng Xiong
    Xiaohong Liu
    Zhaojun Li
    Hongzhong Xiao
    Guangchao Wang
    Zhenjiang Niu
    Chaoyuan Fei
    Feisheng Zhong
    Gang Wang
    Wei Zhang
    Zunyun Fu
    Zhiguo Liu
    Kaixian Chen
    Hualiang Jiang
    Mingyue Zheng
    [J]. Science China Life Sciences, 2024, 67 : 618 - 621
  • [7] AutoIE: An Automated Framework for Information Extraction from Scientific Literature
    Liu, Yangyang
    Li, Shoubin
    Huang, Kai
    Wang, Qing
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2024, 2024, 14885 : 424 - 436
  • [8] Automatic Extraction of HLA-Disease Interaction Information from Biomedical Literature
    Chae, JeongMin
    Chae, JiEun
    Lee, Taemin
    Jung, YoungHee
    Oh, HeungBum
    Jung, SoonYoung
    [J]. ADVANCES IN COMPUTATIONAL SCIENCE AND ENGINEERING, 2009, 28 : 219 - +
  • [9] Information extraction from biomedical text
    Hobbs, JR
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2002, 35 (04) : 260 - 264
  • [10] Simple tricks for improving pattern-based information extraction from the biomedical literature
    Quang Long Nguyen
    Tikk, Domonkos
    Leser, Ulf
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2010, 1