A rule-based named-entity recognition method for knowledge extraction of evidence based dietary recommendations

被引:78
|
作者
Eftimov, Tome [1 ,2 ]
Seljak, Barbara Korousic [1 ]
Korosec, Peter [1 ,3 ]
机构
[1] Josef Stefan Inst, Comp Syst Dept, Ljubljana, Slovenia
[2] Josef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[3] Nat Sci & Informat Technol, Fac Math, Koper, Slovenia
来源
PLOS ONE | 2017年 / 12卷 / 06期
关键词
ALGORITHMS;
D O I
10.1371/journal.pone.0179488
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Evidence-based dietary information represented as unstructured text is a crucial information that needs to be accessed in order to help dietitians follow the new knowledge arrives daily with newly published scientific reports. Different named-entity recognition (NER) methods have been introduced previously to extract useful information from the biomedical literature. They are focused on, for example extracting gene mentions, proteins mentions, relationships between genes and proteins, chemical concepts and relationships between drugs and diseases. In this paper, we present a novel NER method, called drNER, for knowledge extraction of evidence-based dietary information. To the best of our knowledge this is the first attempt at extracting dietary concepts. DrNER is a rule-based NER that consists of two phases. The first one involves the detection and determination of the entities mention, and the second one involves the selection and extraction of the entities. We evaluate the method by using text corpora from heterogeneous sources, including text from several scientifically validated web sites and text from scientific publications. Evaluation of the method showed that drNER gives good results and can be used for knowledge extraction of evidence-based dietary recommendations.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] FoodIE: A Rule-based Named-entity Recognition Method for Food Information Extraction
    Popovski, Gorjan
    Kochev, Stefan
    Seljak, Barbara Korousic
    Eftimov, Tome
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 915 - 922
  • [2] Grammar and Dictionary based Named-entity Linking for Knowledge Extraction of Evidence-based Dietary Recommendations
    Eftimov, Tome
    Seljak, Barbara Korousic
    Korosec, Peter
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 150 - 157
  • [3] Using machine learning to maintain rule-based named-entity recognition and classification systems
    Petasis, G
    Vichot, F
    Wolinski, F
    Paliouras, G
    Karkaletsis, V
    Spyropoulos, CD
    39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 418 - 425
  • [4] CustNER: A Rule-Based Named-Entity Recognizer With Improved Recall
    Mumtaz, Raabia
    Qadir, Muhammad Abdul
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2020, 16 (03) : 110 - 127
  • [5] Named entity recognition in Odia language: a rule-based approach
    Anandika A.
    Chakravarty S.
    Paikaray B.K.
    International Journal of Reasoning-based Intelligent Systems, 2023, 15 (01) : 15 - 21
  • [6] Document Theme Extraction Using Named-Entity Recognition
    Nagrale, Deepali
    Khatavkar, Vaibhav
    Kulkarni, Parag
    COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 499 - 509
  • [7] Knowledge Graph of Urban Firefighting with Rule-Based Entity Extraction
    Wang, Xudong
    Nady, Slam
    Zhang, Zixiang
    Zhang, Mingtong
    Wang, Jingrong
    24TH INTERNATIONAL CONFERENCE ON ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2023, 2023, 1826 : 168 - 177
  • [8] A Survey of Named-Entity Recognition Methods for Food Information Extraction
    Popovski, Gorjan
    Seljak, Barbara Korousic
    Eftimov, Tome
    IEEE ACCESS, 2020, 8 : 31586 - 31594
  • [9] Combining rule-based and statistical mechanisms for low-resource named entity recognition
    Gabbard, Ryan
    DeYoung, Jay
    Lignos, Constantine
    Freedman, Marjorie
    Weischedel, Ralph
    MACHINE TRANSLATION, 2018, 32 (1-2) : 31 - 43
  • [10] All that Glitters Is Not Gold - Rule-Based Curation of Reference Datasets for Named Entity Recognition and Entity Linking
    Jha, Kunal
    Roeder, Michael
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB ( ESWC 2017), PT I, 2017, 10249 : 305 - 320