Semantic annotation of natural history collections

被引:10
|
作者
Stork, Lise [1 ]
Weber, Andreas [2 ]
Miracle, Eulalia Gasso [3 ]
Verbeek, Fons [1 ]
Plaat, Aske [1 ]
van den Herik, Jaap [1 ,4 ]
Wolstencroft, Katherine [1 ]
机构
[1] Leiden Inst Adv Comp Sci, Niels Bohrweg 1, NL-2333 CA Leiden, Netherlands
[2] Univ Twente, Enschede, Netherlands
[3] Naturalis Biodivers Ctr, Leiden, Netherlands
[4] Leiden Ctr Data Sci, Leiden, Netherlands
来源
JOURNAL OF WEB SEMANTICS | 2019年 / 59卷
关键词
Linked data; Biodiversity; Natural history collections; Ontologies; Semantic annotation; History of science; SEARCH; INTERFACES; TAXA; WEB;
D O I
10.1016/j.websem.2018.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large collections of historical biodiversity expeditions are housed in natural history museums throughout the world. Potentially they can serve as rich sources of data for cultural historical and biodiversity research. However, they exist as only partially catalogued specimen repositories and images of unstructured, non-standardised, hand-written text and drawings. Although many archival collections have been digitised, disclosing their content is challenging. They refer to historical place names and outdated taxonomic classifications and are written in multiple languages. Efforts to transcribe the hand-written text can make the content accessible, but semantically describing and interlinking the content would further facilitate research. We propose a semantic model that serves to structure the named entities in natural history archival collections. In addition, we present an approach for the semantic annotation of these collections whilst documenting their provenance. This approach serves as an initial step for an adaptive learning approach for semi-automated extraction of named entities from natural history archival collections. The applicability of the semantic model and the annotation approach is demonstrated using image scans from a collection of 8, 000 field book pages gathered by the Committee for Natural History of the Netherlands Indies between 1820 and 1850, and evaluated together with domain experts from the field of natural and cultural history. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Semantic annotation and retrieval of image collections
    Osman, Taha
    Thakker, Dhavalkumar
    Schaefer, Gerald
    Leroy, Maxime
    Fournier, Alain
    [J]. 21ST EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2007: SIMULATIONS IN UNITED EUROPE, 2007, : 324 - +
  • [2] Controlled Natural Language for Semantic Annotation
    Davis, Brian
    Varma, Pradeep
    Handschuh, Siegfried
    Dragan, Laura
    Cunningham, Hamish
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 816 - +
  • [3] On Designing Controlled Natural Languages for Semantic Annotation
    Davis, Brian
    Dantuluri, Pradeep
    Dragan, Laura
    Handschuh, Siegfried
    Cunningham, Hamish
    [J]. CONTROLLED NATURAL LANGUAGE, 2010, 5972 : 187 - +
  • [4] Towards Controlled Natural Language for Semantic Annotation
    Davis, Brian
    Dantuluri, Pradeep
    Handschuh, Siegfried
    Cunningham, Hamish
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2010, 6 (04) : 64 - 91
  • [5] Computerizing natural history collections
    Sunderland, Mary E.
    [J]. ENDEAVOUR, 2013, 37 (03) : 150 - 161
  • [6] Natural history collections: Overview
    Dennis, JG
    [J]. Protecting Our Diverse Heritage: The Role of Parks, Protected Areas, and Cultural Sites, 2004, : 398 - 399
  • [7] Linking Natural History Collections
    Stork, Lise
    Weber, Andreas
    Miracle, Eulalia Gasso
    Wolstencroft, Katherine
    [J]. 2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 388 - 389
  • [8] Roles of natural history collections
    Lane, MA
    [J]. ANNALS OF THE MISSOURI BOTANICAL GARDEN, 1996, 83 (04) : 536 - 545
  • [9] Semano: Semantic Annotation Framework for Natural Language Resources
    Berry, David
    Nikitina, Nadeschda
    [J]. SEMANTIC WEB - ISWC 2014, PT I, 2014, 8796 : 503 - 518
  • [10] Semantic annotation of a natural language corpus for knowledge extraction
    Navarro, B
    Martínez-Barco, P
    Palomar, M
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 365 - 368