Genomics and natural language processing

被引:0
|
作者
Mark D. Yandell
William H. Majoros
机构
[1] Howard Hughes Medical Institute,Department of Molecular and Cell Biology
[2] University of California,undefined
[3] The Institute for Genomic Research,undefined
来源
Nature Reviews Genetics | 2002年 / 3卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Today, the computational exploration and management of large text repositories are usually accomplished with search engines and databases that are based on a suite of text processing, indexing and search tools that are referred to collectively as 'natural language processing' (NLP) technologies. There are three fundamental aspects to NLP: information retrieval, semantics and information extraction. Exploring and managing the biomedical literature with these technologies, however, presents some interesting challenges, primarily because of the relationships between biomedical texts and biological sequences. The associations between biological sequences and texts are a truly unique aspect of the biomedical literature. However, understanding the complex associations that exist between genes, sequences and texts is a daunting task. The flood of sequence information produced by the rapid advances in genomics is creating new ways to explore texts and is blurring the traditional lines that separate bioinformatics and NLP. Biological NLP (bio-NLP) is an emerging field of research that seeks to create tools and methodologies for sequence and textual analysis that combine bioinformatics and NLP technologies in a synergistic fashion. Some bio-NLP researchers are focusing on texts as a means to discover information about protein interactions, and are wrestling with how best to adapt traditional NLP technologies to this task. Others, taking a more sequence-centred approach, are exploring the use of texts as a means to improve sequence-retrieval algorithms and as an aid to sequence annotation. If bio-NLP is to achieve its full potential, it will have to move beyond information management and generate specific predictions pertaining to gene function that can be verified at the bench. The synergistic use of sequence and text to extract latent information from the biomedical literature holds much promise in this regard. Realizing this potential, however, will require more and better ontologies, software that is able to make inferences using sequence and textual information, and access to the full text of articles.
引用
收藏
页码:601 / 610
页数:9
相关论文
共 50 条
  • [31] Natural Language Processing Future
    Surabhi, Chandhana M.
    2013 INTERNATIONAL CONFERENCE ON OPTICAL IMAGING SENSOR AND SECURITY (ICOSS 2013), 2013,
  • [32] NATURAL-LANGUAGE PROCESSING
    OBERMEIER, KK
    BYTE, 1987, 12 (14): : 225 - &
  • [33] NATURAL-LANGUAGE PROCESSING
    HIRSCHBERG, J
    BALLARD, BW
    HINDLE, D
    AT&T TECHNICAL JOURNAL, 1988, 67 (01): : 41 - 57
  • [34] NATURAL-LANGUAGE PROCESSING
    WEISCHEDEL, R
    CARBONELL, J
    GROSZ, B
    LEHNERT, W
    MARCUS, M
    PERRAULT, R
    WILENSKY, R
    ANNUAL REVIEW OF COMPUTER SCIENCE, 1989, 4 : 435 - 452
  • [35] Intelligent natural language processing
    Kacalak, Wojciech
    Stuart, Keith Douglas
    Majewski, Maciej
    ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 584 - 587
  • [36] Explainability for Natural Language Processing
    Danilevsky, Marina
    Dhanorkar, Shipi
    Li, Yunyao
    Popa, Lucian
    Qian, Kun
    Xu, Anbang
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4033 - 4034
  • [37] Natural language processing: an introduction
    Nadkarni, Prakash M.
    Ohno-Machado, Lucile
    Chapman, Wendy W.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) : 544 - 551
  • [38] Natural Language Processing in Psychology
    Bittermann, Andre
    Fischer, Andreas
    ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2024, 232 (03): : 143 - 146
  • [39] NATURAL-LANGUAGE PROCESSING
    MCKEVITT, P
    ARTIFICIAL INTELLIGENCE REVIEW, 1992, 6 (04) : 327 - 331
  • [40] NATURAL-LANGUAGE PROCESSING
    PATTEN, T
    JACOBS, P
    IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1994, 9 (01): : 35 - 35