A survey of document image word spotting techniques

被引:83
|
作者
Giotis, Angelos P. [1 ,2 ]
Sfikas, Giorgos [2 ]
Gatos, Basilis [2 ]
Nikou, Christophoros [1 ]
机构
[1] Univ Ioannina, Dept Comp Sci & Engn, Ioannina, Greece
[2] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, GR-15310 Athens, Greece
关键词
Word spotting; Retrieval; Document indexing; Features; Representation; Relevance feedback; HIDDEN MARKOV-MODELS; HANDWRITTEN DOCUMENTS; TEXT LINE; SEGMENTATION; RETRIEVAL; RECOGNITION; CHARACTER; ONLINE; EXTRACTION; SIMILARITY;
D O I
10.1016/j.patcog.2017.02.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vast collections of documents available in image format need to be indexed for information retrieval purposes. In this framework, word spotting is an alternative solution to optical character recognition (OCR), which is rather inefficient for recognizing text of degraded quality and unknown fonts usually appearing in printed text, or writing style variations in handwritten documents. Over the past decade there has been a growing interest in addressing document indexing using word spotting which is reflected by the continuously increasing number of approaches. However, there exist very few comprehensive studies which analyze the various aspects of a word spotting system. This work aims to review the recent approaches as well as fill the gaps in several topics with respect to the related works. The nature of texts and inherent challenges addressed by word spotting methods are thoroughly examined. After presenting the core steps which compose a word spotting system, we investigate the use of retrieval enhancement techniques based on relevance feedback which improve the retrieved results. Finally, we present the datasets which are widely used for word spotting, we describe the evaluation standards and measures applied for performance assessment and discuss the results achieved by the state of the art. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:310 / 332
页数:23
相关论文
共 50 条
  • [21] Representative Image Selection for Data Efficient Word Spotting
    Westphal, Florian
    Grahn, Hakan
    Lavesson, Niklas
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 383 - 397
  • [22] A voting-based technique for word spotting in handwritten document images
    Shamik Majumder
    Subhrangshu Ghosh
    Samir Malakar
    Ram Sarkar
    Mita Nasipuri
    Multimedia Tools and Applications, 2021, 80 : 12411 - 12434
  • [23] A word spotting method for Farsi machine-printed document images
    Pourasad, Yaghoub
    Hassibi, Houshang
    Ghorbani, Azam
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (03) : 734 - 746
  • [24] A voting-based technique for word spotting in handwritten document images
    Majumder, Shamik
    Ghosh, Subhrangshu
    Malakar, Samir
    Sarkar, Ram
    Nasipuri, Mita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12411 - 12434
  • [25] Ridgelet-DTW-Based Word Spotting for Arabic Historical Document
    Brik, Youcef
    Chibani, Youcef
    Zemouri, Et-Tahir
    Sehad, Abdenour
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 194 - +
  • [26] HMM word graph based keyword spotting in handwritten document images
    Toselli, Alejandro Hector
    Vidal, Enrique
    Romero, Veronica
    Frinken, Volkmar
    INFORMATION SCIENCES, 2016, 370 : 497 - 518
  • [27] Word Spotting in Historical Document Collections with Online-Handwritten Queries
    Wieprecht, Christian
    Rothacker, Leonard
    Fink, Gernot A.
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 162 - 167
  • [28] Word spotting
    McQueen, J
    LANGUAGE AND COGNITIVE PROCESSES, 1996, 11 (06): : 695 - 699
  • [29] Word spotting
    Lang Cognit Processes, 6 (695):
  • [30] Keyword spotting on Korean document images by matching the keyword image
    Kim, SH
    Park, SC
    Jeong, CB
    Kim, JS
    Park, HR
    Lee, GS
    DIGITAL LIBRARIES: IMPLEMENTING STRATEGIES AND SHARING EXPERIENCES, PROCEEDINGS, 2005, 3815 : 158 - 166