Can we use Linked Data Semantic Annotators for the Extraction of Domain-Relevant Expressions?

被引:0
|
作者
Gagnon, Michel [1 ]
Zouaq, Amal [2 ]
Jean-Louis, Ludovic [1 ]
机构
[1] Ecole Poytech Montreal, Montreal, PQ, Canada
[2] Royal Mil Coll Canada, Kingston, ON, Canada
关键词
Semantic annotation; topic extraction; evaluation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic annotation is the process of identifying expressions in texts and linking them to some semantic structure. In particular, Linked data-based Semantic Annotators are now becoming the new Holy Grail for meaning extraction from unstructured documents. This paper presents an evaluation of the main linked data-based annotators available with a focus on domain topics and named entities. In particular, we compare the ability of each tool to annotate relevant domain expressions in text. The paper also proposes a combination of annotators through voting methods and machine learning. Our results show that some linked-data annotators, especially Alchemy, can be considered as a useful resource for topic extraction. They also show that a substantial increase in recall can be achieved by combining the annotators with a weighted voting scheme. Finally, an interesting result is that by removing Alchemy from the combination, or by combining only the more precise annotators, we get a significant increase in precision, at the cost of a lower recall.
引用
收藏
页码:1239 / 1246
页数:8
相关论文
共 50 条
  • [41] Secondary Use of Healthcare Structured Data: The Challenge of Domain-Knowledge Based Extraction of Features
    Chazard, Emmanuel
    Ficheur, Gregoire
    Caron, Alexandre
    Lamer, Antoine
    Labreuche, Julien
    Cuggia, Marc
    Genin, Michael
    Bouzille, Guillaume
    Duhamel, Alain
    [J]. DECISION SUPPORT SYSTEMS AND EDUCATION: HELP AND SUPPORT IN HEALTHCARE, 2018, 255 : 15 - 19
  • [42] From morphology to molecular biology: can we use sequence data to identify fungal endophytes?
    Thida Win Ko Ko
    Steven L. Stephenson
    Ali H. Bahkali
    Kevin D. Hyde
    [J]. Fungal Diversity, 2011, 50 : 113 - 120
  • [43] Why and how we can use data linkage in oral health research: a narrative review
    Slack-Smith, Linda
    Arena, Gina
    [J]. COMMUNITY DENTISTRY AND ORAL EPIDEMIOLOGY, 2023, 51 (01) : 75 - 78
  • [44] Can we use routine data to optimize screening for hazardous alcohol consumption in primary care?
    Kriston, Levente
    Berner, Michael M.
    Ruf, Daniela
    Mundle, Goetz
    Haerter, Martin
    [J]. PRIMARY CARE & COMMUNITY PSYCHIATRY, 2008, 13 (03) : 100 - 108
  • [45] Can We Use Administrative Data to Accurately Identify Patients Who Receive a Prostate Biopsy?
    Lavallee, Luke T.
    Breau, Rodney H.
    Fergusson, Dean
    Walsh, Cynthia
    van Walraven, Carl
    [J]. JCO CLINICAL CANCER INFORMATICS, 2018, 2 : 1 - 10
  • [46] Can we use tracer tests to obtain data for performance assessment of repositories for nuclear waste?
    Moreno, Luis
    Crawford, James
    [J]. HYDROGEOLOGY JOURNAL, 2009, 17 (05) : 1067 - 1080
  • [47] Can We Use Milk Recording Data to Predict Reproduction? An Improvement on the Fat to Protein Ratio
    Madouasse, A.
    Huxley, J. N.
    Browne, W. J.
    Bradley, A. J.
    Green, M. J.
    [J]. CATTLE PRACTICE, 2010, 18 : 83 - 88
  • [48] From morphology to molecular biology: can we use sequence data to identify fungal endophytes?
    Ko, Thida Win Ko
    Stephenson, Steven L.
    Bahkali, Ali H.
    Hyde, Kevin D.
    [J]. FUNGAL DIVERSITY, 2011, 50 (01) : 113 - 120
  • [49] Can We Use "Pretty Big" Data to Settle the Score in Pediatric Extracorporeal Membrane Oxygenation?
    Fortenberry, James D.
    Paden, Matthew L.
    [J]. CRITICAL CARE MEDICINE, 2017, 45 (01) : 143 - 145
  • [50] MEASURING CARE COORDINATION. CAN WE USE DATA FROM THE ELECTRONIC HEALTH RECORD?
    Herndon, Brooke
    Stablein, Timothy P.
    Field, Carey J.
    Anthony, Denise L.
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2011, 26 : S217 - S217