Can we use Linked Data Semantic Annotators for the Extraction of Domain-Relevant Expressions?

被引:0
|
作者
Gagnon, Michel [1 ]
Zouaq, Amal [2 ]
Jean-Louis, Ludovic [1 ]
机构
[1] Ecole Poytech Montreal, Montreal, PQ, Canada
[2] Royal Mil Coll Canada, Kingston, ON, Canada
关键词
Semantic annotation; topic extraction; evaluation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic annotation is the process of identifying expressions in texts and linking them to some semantic structure. In particular, Linked data-based Semantic Annotators are now becoming the new Holy Grail for meaning extraction from unstructured documents. This paper presents an evaluation of the main linked data-based annotators available with a focus on domain topics and named entities. In particular, we compare the ability of each tool to annotate relevant domain expressions in text. The paper also proposes a combination of annotators through voting methods and machine learning. Our results show that some linked-data annotators, especially Alchemy, can be considered as a useful resource for topic extraction. They also show that a substantial increase in recall can be achieved by combining the annotators with a weighted voting scheme. Finally, an interesting result is that by removing Alchemy from the combination, or by combining only the more precise annotators, we get a significant increase in precision, at the cost of a lower recall.
引用
收藏
页码:1239 / 1246
页数:8
相关论文
共 50 条
  • [1] Proper data extraction and curation in an era of linked open data - Are we there yet?
    Zdrazil, Barbara
    Ecker, Gerhard
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [2] Use of Linked Data principles for semantic management of scanned documents
    Pessanha Monteiro, Luciane Lena
    de Azevedo Jacyntho, Mark Douglas
    [J]. TRANSINFORMACAO, 2016, 28 (02): : 241 - 251
  • [3] A Knowledge-Driven Approach for Automatic Semantic Aspect Term Extraction Using the Semantic Power of Linked Open Data
    Suwanpipob, Worapoj
    Arch-Int, Ngamnij
    Wunnasri, Warunya
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [4] Etiological diagnosis of hypersomnia: Which data can we use?
    Franco, Ines
    Esteves, Idalia
    Antunes, Ana
    Ines, Franco
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2014, 44
  • [5] CAN WE USE NEUROIMAGING DATA TO PREDICT TRANSITION TO PSYCHOSIS?
    McGuire, Philip
    Allen, P.
    Howes, O.
    Stone, J.
    Tognin, S.
    Riecher, A.
    Meisenzahl, E.
    Koutsouleris, N.
    Pantelis, C.
    McGorry, P.
    Broome, M.
    Valli, I.
    Woolley, J.
    Carletti, F.
    Egerton, A.
    Barker, G.
    Mechelli, A.
    [J]. SCHIZOPHRENIA BULLETIN, 2011, 37 : 171 - 172
  • [6] Building energy performance assessment using linked data and cross-domain semantic reasoning
    Hu, Shushan
    Wang, Jiale
    Hoare, Cathal
    Li, Yehong
    Pauwels, Pieter
    O'Donnell, James
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 124
  • [7] YeastHub: a semantic web use case for integrating data in the life sciences domain
    Cheung, KH
    Yip, KY
    Smith, A
    deKnikker, R
    Masiar, A
    Gerstein, M
    [J]. BIOINFORMATICS, 2005, 21 : I85 - I96
  • [8] Why we can't give up the concept of document in the era of Linked Data
    Salarelli, Alberto
    [J]. AIB STUDI, 2014, 54 (2-3): : 279 - 293
  • [9] CAN WE USE COMPENSATION DATA TO MEASURE JOB PERFORMANCE BEHAVIOR
    MCCORMIC.RR
    [J]. PERSONNEL JOURNAL, 1972, 51 (12) : 918 - 922
  • [10] Can we use automated data to assess quality of hypertension care?
    Borzecki, AM
    Wong, AT
    Hickey, EC
    Ash, AS
    Berlowitz, DR
    [J]. AMERICAN JOURNAL OF MANAGED CARE, 2004, 10 (07): : 473 - 479