Mining Skeletal Phenotype Descriptions from Scientific Literature

被引:7
|
作者
Groza, Tudor [1 ]
Hunter, Jane [1 ]
Zankl, Andreas [2 ,3 ]
机构
[1] Univ Queensland, Sch ITEE, Brisbane, Qld 4072, Australia
[2] Univ Queensland, UQCCR, Bone Dysplasia Res Grp, Brisbane, Qld 4072, Australia
[3] Royal Brisbane & Womens Hosp, Genet Hlth Queensland, Herston, Qld, Australia
来源
PLOS ONE | 2013年 / 8卷 / 02期
基金
澳大利亚研究理事会;
关键词
PERFORMANCE; ONTOLOGY;
D O I
10.1371/journal.pone.0055656
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phenotype descriptions are important for our understanding of genetics, as they enable the computation and analysis of a varied range of issues related to the genetic and developmental bases of correlated characters. The literature contains a wealth of such phenotype descriptions, usually reported as free-text entries, similar to typical clinical summaries. In this paper, we focus on creating and making available an annotated corpus of skeletal phenotype descriptions. In addition, we present and evaluate a hybrid Machine Learning approach for mining phenotype descriptions from free text. Our hybrid approach uses an ensemble of four classifiers and experiments with several aggregation techniques. The best scoring technique achieves an F-1 score of 71.52%, which is close to the state-of-the-art in other domains, where training data exists in abundance. Finally, we discuss the influence of the features chosen for the model on the overall performance of the method.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] The Welfare of Beef Cattle in the Scientific Literature From 1990 to 2019: A Text Mining Approach
    Nalon, Elena
    Contiero, Barbara
    Gottardo, Flaviana
    Cozzi, Giulio
    FRONTIERS IN VETERINARY SCIENCE, 2021, 7
  • [22] Mining Insights on Metal-Organic Framework Synthesis from Scientific Literature Texts
    Park, Hyunsoo
    Kang, Yeonghun
    Choe, Wonyoung
    Kim, Jihan
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (05) : 1190 - 1198
  • [23] SKELETAL STRUCTURAL DESCRIPTIONS
    LEVY, LS
    JOSHI, AK
    INFORMATION AND CONTROL, 1978, 39 (02): : 192 - 211
  • [24] Text mining in fisheries scientific literature: A term coding approach
    Fytilakos, Ioannis
    ECOLOGICAL INFORMATICS, 2021, 61
  • [25] CLUSTERS EVOLUTION MODELS IN THE SCIENTIFIC LITERATURE: A TEXT MINING APPROACH
    Chifor, Diana Cosmina
    Maier, Lucian C.
    Arion, Felix H.
    SCIENTIFIC PAPERS-SERIES MANAGEMENT ECONOMIC ENGINEERING IN AGRICULTURE AND RURAL DEVELOPMENT, 2023, 23 (02) : 109 - 122
  • [26] Scientific Literature Mining for Drug Discovery: A Case Study on Obesity
    Rajpal, Deepak K.
    Kumar, Vinod
    Agarwal, Pankaj
    DRUG DEVELOPMENT RESEARCH, 2011, 72 (02) : 201 - 208
  • [27] Pain in Pig Production: Text Mining Analysis of the Scientific Literature
    Contiero, Barbara
    Cozzi, Giulio
    Karpf, Lee
    Gottardo, Flaviana
    JOURNAL OF AGRICULTURAL & ENVIRONMENTAL ETHICS, 2019, 32 (03): : 401 - 412
  • [28] Terminological resources for text mining over biomedical scientific literature
    Rinaldi, Fabio
    Kaljurand, Kaarel
    Saetre, Rune
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2011, 52 (02) : 107 - 114
  • [29] Applying a text mining framework to the extraction of numerical parameters from scientific literature in the biotechnology domain
    Santos, Andre
    Nogueira, Regina
    Lourenco, Analia
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2012, 1 (01): : 2 - 9
  • [30] Pain in Pig Production: Text Mining Analysis of the Scientific Literature
    Barbara Contiero
    Giulio Cozzi
    Lee Karpf
    Flaviana Gottardo
    Journal of Agricultural and Environmental Ethics, 2019, 32 : 401 - 412