Linking genes to literature: text mining, information extraction, and retrieval applications for biology

被引:90
|
作者
Krallinger, Martin [1 ]
Valencia, Alfonso [1 ]
Hirschman, Lynette [2 ]
机构
[1] Spanish Nacl Canc Res Ctr CNIO, Struct Biol & BioComp Programme, E-28029 Madrid, Spain
[2] Mitre Corp, Bedford, MA 01730 USA
来源
GENOME BIOLOGY | 2008年 / 9卷
基金
美国国家科学基金会;
关键词
D O I
10.1186/gb-2008-9-S2-S8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Efficient access to information contained in online scientific literature collections is essential for life science research, playing a crucial role from the initial stage of experiment planning to the final interpretation and communication of the results. The biological literature also constitutes the main information source for manual literature curation used by expert-curated databases. Following the increasing popularity of web-based applications for analyzing biological data, new text-mining and information extraction strategies are being implemented. These systems exploit existing regularities in natural language to extract biologically relevant information from electronic texts automatically. The aim of the BioCreative challenge is to promote the development of such tools and to provide insight into their performance. This review presents a general introduction to the main characteristics and applications of currently available text-mining systems for life sciences in terms of the following: the type of biological information demands being addressed; the level of information granularity of both user queries and results; and the features and methods commonly exploited by these applications. The current trend in biomedical text mining points toward an increasing diversification in terms of application types and techniques, together with integration of domain-specific resources such as ontologies. Additional descriptions of some of the systems discussed here are available on the internet http://zope.bioinfo.cnio.es/bionlp_tools/.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] INFORMATION EXTRACTION VERSUS TEXT SEGMENTATION FOR WEB CONTENT MINING
    Fragkou, Pavlina
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2013, 23 (08) : 1109 - 1137
  • [32] Combining Information Extraction and Text Mining for Cancer Biomarker Detection
    Dawoud, Khaled
    Gao, Shang
    Qabaja, Ala
    Karampelas, Panagiotis
    Alhajj, Reda
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 948 - 955
  • [33] Financial Statement Text Information Mining and Key Information Extraction Model Construction
    Xu, Yi
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 800 - 805
  • [34] Literature mining for the biologist: from information retrieval to biological discovery
    Lars Juhl Jensen
    Jasmin Saric
    Peer Bork
    Nature Reviews Genetics, 2006, 7 : 119 - 129
  • [35] Literature mining for the biologist: from information retrieval to biological discovery
    Jensen, LJ
    Saric, J
    Bork, P
    NATURE REVIEWS GENETICS, 2006, 7 (02) : 119 - 129
  • [37] Creating Reference Datasets for Systems Biology Applications Using Text Mining
    Krallinger, Martin
    Maria Rojas, Ana
    Valencia, Alfonso
    CHALLENGES OF SYSTEMS BIOLOGY: COMMUNITY EFFORTS TO HARNESS BIOLOGICAL COMPLEXITY, 2009, 1158 : 14 - 28
  • [38] Scientific Literature Information Extraction Using Text Mining Techniques for Human Health Risk Assessment of Electromagnetic Fields
    Lee, Sang-Woo
    Kwon, Jung-Hyok
    Lee, Ben
    Kim, Eui-Jik
    SENSORS AND MATERIALS, 2020, 32 (01) : 149 - 157
  • [39] Preface to sentiment elicitation from natural text for information retrieval and extraction
    Cambria, Erik
    Liu, Bing
    Xia, Yunqing
    Chen, Ping
    Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013, 2013,
  • [40] Clinical information extraction applications: A literature review
    Wang, Yanshan
    Wang, Liwei
    Rastegar-Mojarad, Majid
    Moon, Sungrim
    Shen, Feichen
    Afzal, Naveed
    Liu, Sijia
    Zeng, Yuqun
    Mehrabi, Saeed
    Sohn, Sunghwan
    Liu, Hongfang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 : 34 - 49