Semantics-based content extraction in typewritten historical documents

被引:16
|
作者
Antonacopoulos, A [1 ]
Karatzas, D [1 ]
机构
[1] Univ Salford, PRImA Lab, Sch Comp Sci & Engn, Salford M5 4WT, Lancs, England
关键词
D O I
10.1109/ICDAR.2005.215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a flexible approach to extracting content from scanned historical documents using semantic information. The final electronic document is the result of a "digital historical document lifecycle " process, where the expert knowledge of the historian/archivist user is incorporated at different stages. Results show that such a conversion strategy aided by (expert) user-specified semantic information and which enables the processing of individual parts of the document in a specialised way, produces superior (in a variety of significant ways) results than document analysis and understanding techniques devised for contemporary documents.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [1] Semantics-based retrieval by content
    Del Bimbo, A
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 516 - 519
  • [2] Personal name resolution crossover documents by a semantics-based approach
    Phan, XH
    Nguyen, LM
    Horiguchi, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (02): : 825 - 836
  • [3] Semantics-Based Change Impact Analysis for Heterogeneous Collections of Documents
    Autexier, Serge
    Mueller, Normen
    [J]. DOCENG2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2010, : 97 - 106
  • [4] Semantics-based information extraction for detecting economic events
    Hogenboom, Alexander
    Hogenboom, Frederik
    Frasincar, Flavius
    Schouten, Kim
    van der Meer, Otto
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 64 (01) : 27 - 52
  • [5] Semantics-Based Analysis of Content Security Policy Deployment
    Calzavara, Stefano
    Rabitti, Alvise
    Bugliesi, Michele
    [J]. ACM TRANSACTIONS ON THE WEB, 2018, 12 (02)
  • [6] Semantics-based information extraction for detecting economic events
    Alexander Hogenboom
    Frederik Hogenboom
    Flavius Frasincar
    Kim Schouten
    Otto van der Meer
    [J]. Multimedia Tools and Applications, 2013, 64 : 27 - 52
  • [7] Semantics-based topic inter-relationship extraction
    Menon, Remya R. K.
    Joseph, Deepthy
    Kaimal, M. R.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (04) : 2941 - 2951
  • [8] Schema-less, semantics-based change detection for XML documents
    Zhang, SH
    Dyreson, C
    Snodgrass, RT
    [J]. WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 279 - 290
  • [9] A holistic methodology for keyword search in historical typewritten documents
    Gatos, Basilis
    Konidaris, Thomas
    Pratikakis, Ioannis
    Perantonis, Stavros J.
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 490 - 493
  • [10] Semantic extraction and semantics-based annotation and retrieval for video databases
    Liu, Y
    Li, F
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2002, 17 (01) : 5 - 20