Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach

被引:10
|
作者
Ratkovic, Zorana [1 ,2 ]
Golik, Wiktoria [1 ]
Warnier, Pierre [1 ,3 ]
机构
[1] MIG INRA UR1077 Domaine Vilvert, F-78352 Jouy En Josas, France
[2] Univ Paris 03, CNRS, UMR 8094, LaTTiCe, F-92120 Montrouge, France
[3] Univ Grenoble 1, LIG, F-38400 St Martin Dheres, France
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
TEXT;
D O I
10.1186/1471-2105-13-S11-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Bacteria biotopes cover a wide range of diverse habitats including animal and plant hosts, natural, medical and industrial environments. The high volume of publications in the microbiology domain provides a rich source of up-to-date information on bacteria biotopes. This information, as found in scientific articles, is expressed in natural language and is rarely available in a structured format, such as a database. This information is of great importance for fundamental research and microbiology applications (e.g., medicine, agronomy, food, bioenergy). The automatic extraction of this information from texts will provide a great benefit to the field. Methods: We present a new method for extracting relationships between bacteria and their locations using the Alvis framework. Recognition of bacteria and their locations was achieved using a pattern-based approach and domain lexical resources. For the detection of environment locations, we propose a new approach that combines lexical information and the syntactic-semantic analysis of corpus terms to overcome the incompleteness of lexical resources. Bacteria location relations extend over sentence borders, and we developed domain-specific rules for dealing with bacteria anaphors. Results: We participated in the BioNLP 2011 Bacteria Biotope (BB) task with the Alvis system. Official evaluation results show that it achieves the best performance of participating systems. New developments since then have increased the F-score by 4.1 points. Conclusions: We have shown that the combination of semantic analysis and domain-adapted resources is both effective and efficient for event information extraction in the bacteria biotope domain. We plan to adapt the method to deal with a larger set of location types and a large-scale scientific article corpus to enable microbiologists to integrate and use the extracted knowledge in combination with experimental data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach
    Zorana Ratkovic
    Wiktoria Golik
    Pierre Warnier
    [J]. BMC Bioinformatics, 13
  • [2] Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks
    Asai, Akari
    Gardner, Matt
    Hajishirzi, Hannaneh
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2226 - 2243
  • [3] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
    Lewis, Patrick
    Perez, Ethan
    Piktus, Aleksandra
    Petroni, Fabio
    Karpukhin, Vladimir
    Goyal, Naman
    Kuttler, Heinrich
    Lewis, Mike
    Yih, Wen-tau
    Rocktaschel, Tim
    Riedel, Sebastian
    Kiela, Douwe
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Medical prescription classification: a NLP-based approach
    Carchiolo, Vincenza
    Longheu, Alessandro
    Reitano, Giuseppa
    Zagarella, Luca
    [J]. PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 605 - 609
  • [5] Hybrid NLP-based extraction method to develop a knowledge graph for rock tunnel support design
    Ling, Jiaxin
    Li, Xiaojun
    Li, Haijiang
    An, Yi
    Rui, Yi
    Shen, Yi
    Zhu, Hehua
    [J]. ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [6] NLP-Based Fusion Approach to Robust Image Captioning
    Ricci, Riccardo
    Melgani, Farid
    Marcato Junior, Jose
    Goncalves, Wesley Nunes
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11809 - 11822
  • [7] NLP-based information extraction for managing the molecular biology literature
    Libbus, B
    Rindflesch, TC
    [J]. AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 445 - 449
  • [8] NLP-Based Recommendation Approach for Diverse Service Generation
    Jeong, Baek
    Lee, Kyoung Jun
    [J]. IEEE ACCESS, 2024, 12 : 14260 - 14274
  • [9] FactRunner: A New System for NLP-Based Information Extraction from Wikipedia
    Sutoyo, Rhio
    Quix, Christoph
    Kastrati, Fisnik
    [J]. WEB INFORMATION SYSTEMS AND TECHNOLOGIES, WEBIST 2013, 2014, 189 : 225 - 240
  • [10] AN NLP-BASED APPROACH FOR IMPROVING HUMAN-ROBOT INTERACTION
    Kilicaslan, Yilmaz
    Tuna, Gurkan
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2013, 3 (03) : 189 - 200