The Archaeotools project: faceted classification and natural language processing in an archaeological context

被引:22
|
作者
Jeffrey, S. [1 ]
Richards, J. [1 ]
Ciravegna, F. [2 ]
Waller, S. [1 ]
Chapman, S. [2 ]
Zhang, Z. [2 ]
机构
[1] Univ York, Dept Archaeol, Archaeol Data Serv, York Y01 7EP, N Yorkshire, England
[2] Univ Sheffield, Dept Comp Sci, Web Intelligence Technol Lab, Nat Language Proc Grp, Sheffield S1 4DP, S Yorkshire, England
关键词
archaeology; grey literature; faceted classification; information extraction; natural language processing;
D O I
10.1098/rsta.2009.0038
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper describes 'Archaeotools', a major e-Science project in archaeology. The aim of the project is to use faceted classification and natural language processing to create an advanced infrastructure for archaeological research. The project aims to integrate over 1 x 10(6) structured database records referring to archaeological sites and monuments in the UK, with information extracted from semi-structured grey literature reports, and unstructured antiquarian journal accounts, in a single faceted browser interface. The project has illuminated the variable level of vocabulary control and standardization that currently exists within national and local monument inventories. Nonetheless, it has demonstrated that the relatively well-defined ontologies and thesauri that exist in archaeology mean that a high level of success can be achieved using information extraction techniques. This has great potential for unlocking and making accessible the information held in grey literature and antiquarian accounts, and has lessons for allied disciplines.
引用
收藏
页码:2507 / 2519
页数:13
相关论文
共 50 条
  • [1] Natural Language Processing Using Database Context
    Mincheva, Zheni
    Vasilev, Nikola
    Antonov, Anatoliy
    Nikolov, Ventsislav
    [J]. INTELLIGENT COMPUTING, VOL 2, 2022, 507 : 747 - 759
  • [2] A natural language processing approach to Malware classification
    Mehta R.
    Jurečková O.
    Stamp M.
    [J]. Journal of Computer Virology and Hacking Techniques, 2024, 20 (01) : 173 - 184
  • [3] Learning Multi-faceted Knowledge Graph Embeddings for Natural Language Processing
    Chen, Muhao
    Zaniolo, Carlo
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5169 - 5170
  • [4] Behind the scenes: A medical natural language processing project
    Wu, Joy T.
    Dernoncourt, Franck
    Gehrmann, Sebastian
    Tyler, Patrick D.
    Moseley, Edward T.
    Carlson, Eric T.
    Grant, David W.
    Li, Yeran
    Welt, Jonathan
    Celi, Leo Anthony
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 112 : 68 - 73
  • [5] Natural language processing as work support in project tendering
    Cusumano, L.
    Rempling, R.
    Jockwer, R.
    Saraiva, R.
    Granath, M.
    Olsson, N.
    Okazawa, S.
    [J]. CURRENT PERSPECTIVES AND NEW DIRECTIONS IN MECHANICS, MODELLING AND DESIGN OF STRUCTURAL SYSTEMS, 2022, : 549 - 550
  • [6] Natural language processing as work support in project tendering
    Cusumano, L.
    Rempling, R.
    Jockwer, R.
    Saraiva, R.
    Granath, M.
    Olsson, N.
    Okazawa, S.
    [J]. CURRENT PERSPECTIVES AND NEW DIRECTIONS IN MECHANICS, MODELLING AND DESIGN OF STRUCTURAL SYSTEMS, 2022, : 1583 - 1588
  • [7] Introduction to the special issue on context in natural language processing
    Iwanska, L
    Zadrozny, W
    [J]. COMPUTATIONAL INTELLIGENCE, 1997, 13 (03) : 301 - 308
  • [8] Comparing Natural Language Processing and Quantum Natural Processing approaches in text classification tasks
    Peral-Garcia, David
    Cruz-Benito, Juan
    Garcia-Penalvo, Francisco Jose
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 254
  • [9] On Natural Language Processing Applications for Military Dialect Classification
    Gunasekara, Charith
    Carryer, Tobias
    Triff, Matt
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 211 - 218
  • [10] Classification of Poverty Condition Using Natural Language Processing
    Muneton-Santa, Guberney
    Escobar-Grisales, Daniel
    Orlando Lopez-Pabon, Felipe
    Perez-Toro, Paula Andrea
    Rafael Orozco-Arroyave, Juan
    [J]. SOCIAL INDICATORS RESEARCH, 2022, 162 (03) : 1413 - 1435