HMM word graph based keyword spotting in handwritten document images

被引:39
|
作者
Toselli, Alejandro Hector [1 ]
Vidal, Enrique [1 ]
Romero, Veronica [1 ]
Frinken, Volkmar [2 ,3 ,4 ]
机构
[1] Univ Politecn Valencia, Camino Vera S-N, E-46022 Valencia, Spain
[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka 812, Japan
[3] Univ Calif Davis, Elect & Comp Engn, Davis, CA 95616 USA
[4] ONU Technol Inc, San Jose, CA USA
基金
欧盟地平线“2020”;
关键词
Keyword spotting; Handwritten text recognition; Word graph; Posterior probability; Confidence score; INTERACTIVE TRANSCRIPTION; HISTORICAL DOCUMENTS; CONFIDENCE MEASURES; SEGMENTATION; RECOGNITION; ALGORITHM; FILLER; MODEL;
D O I
10.1016/j.ins.2016.07.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Line-level keyword spotting (KWS) is presented on the basis of frame-level word posterior probabilities. These posteriors are obtained using word graphs derived from the recognition process of a full-fledged handwritten text recognizer based on hidden Markov models and N-gram language models. This approach has several advantages. First, since it uses a holistic, segmentation-free technology, it does not require any kind of word or character segmentation. Second, the use of language models allows the context of each spotted word to be taken into account, thereby considerably increasing KWS accuracy. And third, the proposed KWS scores are based on true posterior probabilities, taking into account all (or most) possible word segmentations of the input image. These scores are properly bounded and normalized. This mathematically clean formulation lends itself to smooth, threshold-based keyword queries which, in turn, permit comfortable trade-offs between search precision and recall. Experiments are carried out on several historic collections of handwritten text images, as well as a well-known data set of modern English handwritten text. According to the empirical results, the proposed approach achieves KWS results comparable to those obtained with the recently-introduced "BLSTM neural networks KWS" approach and clearly outperform the popular, state-of-the-art "Filler HMM" KWS method. Overall, the results clearly support all the above-claimed advantages of the proposed approach. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:497 / 518
页数:22
相关论文
共 50 条
  • [31] Fast HMM-Filler approach for Key Word Spotting in Handwritten Documents
    Hector Toselli, Alejandro
    Vidal, Enrique
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 501 - 505
  • [32] Mental model for handwritten keyword spotting
    Brik, Youcef
    Ziou, Djemel
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [33] Keyword spotting and retrieval of document images captured by a digital camera
    Lu, Shijian
    Tan, Chew Lim
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 994 - 998
  • [34] Word-Graph-based Handwriting Keyword Spotting of Out-of-Vocabulary Queries
    Puigcerver, Joan
    Hector Toselli, Alejandro
    Vidal, Enrique
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2035 - 2040
  • [35] Probabilistic multi-word spotting in handwritten text images
    Alejandro H. Toselli
    Enrique Vidal
    Joan Puigcerver
    Ernesto Noya-García
    Pattern Analysis and Applications, 2019, 22 : 23 - 32
  • [36] HMM based fast keyword spotting algorithm with no garbage models
    Sunil, S
    Palit, S
    Sreenivas, TV
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1020 - 1023
  • [37] Probabilistic multi-word spotting in handwritten text images
    Toselli, Alejandro H.
    Vidal, Enrique
    Puigcerver, Joan
    Noya-Garcia, Ernesto
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 23 - 32
  • [38] Word Spotting in Historical Document Collections with Online-Handwritten Queries
    Wieprecht, Christian
    Rothacker, Leonard
    Fink, Gernot A.
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 162 - 167
  • [39] A Novel Graph Database for Handwritten Word Images
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 553 - 563
  • [40] Word Spotting Application in Historical Mongolian Document Images
    Wei, Hongxi
    Gao, Guanglai
    INTELLIGENT COMPUTING THEORIES, 2013, 7995 : 265 - 274