Unsupervised writer adaptation of whole-word HMMs with application to word-spotting

被引:8
|
作者
Rodriguez-Serrano, Jose A. [1 ,2 ]
Perronnin, Florent [1 ]
Sanchez, Gemma [2 ]
Llados, Josep [2 ]
机构
[1] XRCE, F-38240 Meylan, France
[2] Univ Autonoma Barcelona, CVC, Bellaterra 08193, Spain
关键词
Word-spotting; Handwriting recognition; Writer adaptation; Hidden Markov model; Document analysis;
D O I
10.1016/j.patrec.2010.01.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel approach for writer adaptation in a handwritten word-spotting task. The method exploits the fact that the semi-continuous hidden Markov model separates the word model parameters into (i) a codebook of shapes and (ii) a set of word-specific parameters. Our main contribution is to employ this property to derive writer-specific word models by statistically adapting an initial universal codebook to each document. This process is unsupervised and does not even require the appearance of the keyword(s) in the searched document. Experimental results show an increase in performance when this adaptation technique is applied. To the best of our knowledge, this is the first work dealing with adaptation for word-spotting. The preliminary version of this paper obtained an IBM Best Student Paper Award at the 19th International Conference on Pattern Recognition. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:742 / 749
页数:8
相关论文
共 50 条
  • [1] Unsupervised writer style adaptation for handwritten word spotting
    Rodriguez, Jose A.
    Perronnin, Florent
    Sanchez, Gemma
    Llados, Josep
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3039 - 3042
  • [2] A RECORD WORD-SPOTTING MECHANISM
    Heacock, R. H.
    JOURNAL OF THE SOCIETY OF MOTION PICTURE ENGINEERS, 1937, 28 (01): : 63 - 72
  • [3] Improving OCR for an Under-Resourced Script Using Unsupervised Word-Spotting
    Silberpfennig, Adi
    Wolf, Lior
    Dershowitz, Nachum
    Bhagesh, Seraogi
    Chaudhuri, Bidyut B.
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 706 - 710
  • [4] An Automatic Word-spotting Framework for Medieval Manuscripts
    Pintus, Ruggero
    Yang, Ying
    Gobbetti, Enrico
    Rushmeier, Holly
    2015 DIGITAL HERITAGE INTERNATIONAL CONGRESS, VOL 2: ANALYSIS & INTERPRETATION THEORY, METHODOLOGIES, PRESERVATION & STANDARDS DIGITAL HERITAGE PROJECTS & APPLICATIONS, 2015, : 5 - 12
  • [5] Word-spotting based on inter-word and intra-word diphone models
    Nitta, T
    Tanaka, S
    Masai, Y
    Matsuura, H
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1093 - 1096
  • [6] A Classification-free Word-Spotting System
    Vassilopoulos, Nikos
    Kavallieratou, Ergina
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [7] The measurement of whole-word productions
    Ingram, D
    JOURNAL OF CHILD LANGUAGE, 2002, 29 (04) : 713 - 733
  • [8] SUBSTITUTE FOR WHOLE-WORD METHOD
    SHAWAKER, A
    READING TEACHER, 1967, 20 (05): : 426 - 432
  • [9] Automatic Synthesis of Historical Arabic Text for Word-Spotting
    Kassis, Majeed
    El-Sana, Jihad
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 239 - 244
  • [10] HMMs for Unsupervised Vietnamese Word Segmentation
    Ba-Long Bui
    Thi-Trang Nguyen
    Huu-Hoang Nguyen
    Kiem-Hieu Nguyen
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 284 - 289