Linking genes to literature: text mining, information extraction, and retrieval applications for biology

被引:90
|
作者
Krallinger, Martin [1 ]
Valencia, Alfonso [1 ]
Hirschman, Lynette [2 ]
机构
[1] Spanish Nacl Canc Res Ctr CNIO, Struct Biol & BioComp Programme, E-28029 Madrid, Spain
[2] Mitre Corp, Bedford, MA 01730 USA
来源
GENOME BIOLOGY | 2008年 / 9卷
基金
美国国家科学基金会;
关键词
D O I
10.1186/gb-2008-9-S2-S8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Efficient access to information contained in online scientific literature collections is essential for life science research, playing a crucial role from the initial stage of experiment planning to the final interpretation and communication of the results. The biological literature also constitutes the main information source for manual literature curation used by expert-curated databases. Following the increasing popularity of web-based applications for analyzing biological data, new text-mining and information extraction strategies are being implemented. These systems exploit existing regularities in natural language to extract biologically relevant information from electronic texts automatically. The aim of the BioCreative challenge is to promote the development of such tools and to provide insight into their performance. This review presents a general introduction to the main characteristics and applications of currently available text-mining systems for life sciences in terms of the following: the type of biological information demands being addressed; the level of information granularity of both user queries and results; and the features and methods commonly exploited by these applications. The current trend in biomedical text mining points toward an increasing diversification in terms of application types and techniques, together with integration of domain-specific resources such as ontologies. Additional descriptions of some of the systems discussed here are available on the internet http://zope.bioinfo.cnio.es/bionlp_tools/.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [1] Linking genes to literature: Text mining, information extraction, and retrieval applications for biology
    Krallinger M.
    Valencia A.
    Hirschman L.
    Genome Biology, 9 (Suppl 2)
  • [2] Event extraction for systems biology by text mining the literature
    Ananiadou, Sophia
    Pyysalo, Sampo
    Tsujii, Jun'ichi
    Kell, Douglas B.
    TRENDS IN BIOTECHNOLOGY, 2010, 28 (07) : 381 - 390
  • [3] Extraction of keyterms by simple text mining for business information retrieval
    Gao, XZ
    Murugesan, S
    Lo, B
    ICEBE 2005: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2005, : 332 - 339
  • [4] Text-mining and information-retrieval services for molecular biology
    Krallinger, M
    Valencia, A
    GENOME BIOLOGY, 2005, 6 (07)
  • [5] Text-mining and information-retrieval services for molecular biology
    Martin Krallinger
    Alfonso Valencia
    Genome Biology, 6
  • [6] Text mining and information retrieval
    Forest, Dominic
    Da Sylva, Lyne
    CANADIAN JOURNAL OF INFORMATION AND LIBRARY SCIENCE-REVUE CANADIENNE DES SCIENCES DE L INFORMATION ET DE BIBLIOTHECONOMIE, 2011, 35 (03): : 217 - 227
  • [7] Information Extraction and Linking in a Retrieval Context
    Moens, Marie-Francine
    Hiemstra, Djoerd
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 810 - +
  • [8] Evaluation of information retrieval and text mining tools on automatic named entity extraction
    Kumar, Nishant
    De Beer, Jan
    Vanthienen, Jan
    Moens, Marie-Francine
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 666 - 667
  • [9] Text mining via information extraction
    Feldman, R
    Aumann, Y
    Fresko, M
    Liphstat, O
    Rosenfeld, B
    Schler, Y
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 165 - 173
  • [10] Information Retrieval and Text Mining Technologies for Chemistry
    Krallinger, Martin
    Rabal, Obdulia
    Lourenco, Analia
    Oyarzabal, Julen
    Valencia, Alfonso
    CHEMICAL REVIEWS, 2017, 117 (12) : 7673 - 7761