Bayesian background models for keyword spotting in handwritten documents

被引:6
|
作者
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] Univ Buffalo, Dept Comp Sci & Engn, 113 Davis Hall, Amherst, NY 14260 USA
关键词
Handwriting recognition; Keyword spotting; Bayesian generalized linear models; Bayesian generalized kernel models; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.patcog.2016.06.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background in a handwritten document can be anything other than the words we are interested in. The characteristics of the background are typically captured by a background model to achieve spotting in handwritten documents. We propose two such Bayesian background models for keyword spotting in handwritten documents. Firstly, we present a background model using the Bayesian generalized linear model called (VDBM) and secondly propose a Bayesian generalized kernel background model called BGKBM. Given a set of handwritten documents and a bunch of keyword and non-keyword scores, the models learn an efficient Bayesian rejection criteria to output the most confident keyword regions in the handwritten document. For the variational dynamic background model (VDBM) the inference of parameters is done using variational methods and for the Bayesian generalized kernel background model (BGKBM), the inference is done using a proposed Markov chain Monte Carlo (MCMC) approach. The models are built on top of the scores returned by a handwritten recognizer for keywords and non-keywords. The approach is recognition based and works at line level. The methods have been validated on publicly available IAM dataset and compared with other state of the art line level keyword spotting approaches.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [1] Bayesian Active Learning for Keyword Spotting in Handwritten Documents
    Kumar, Gaurav
    Govindaraju, Venu
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2041 - 2046
  • [2] Variational Dynamic Background Model for Keyword Spotting in Handwritten Documents
    Kumar, Gaurav
    Wshah, Safwan
    Govindaraju, Venu
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [3] New Gradient Descriptor for Keyword Spotting in Handwritten Documents
    Bouined, Mohamed Lamine
    Nemmour, Hassiba
    Chibani, Youcef
    2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 160 - 164
  • [4] Graph Based Keyword Spotting in Handwritten Historical Slavic Documents
    Riesen, Kaspar
    Brodic, Darko
    ERCIM NEWS, 2013, (95): : 37 - 38
  • [5] Keyword spotting in historical handwritten documents based on graph matching
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    PATTERN RECOGNITION, 2018, 81 : 240 - 253
  • [6] Visual keyword based word-spotting in handwritten documents
    Kolcz, A
    Alspector, J
    Augusteijn, M
    Carlson, R
    Popescu, GV
    DOCUMENT RECOGNITION V, 1998, 3305 : 185 - 193
  • [7] ICDAR2015 Competition on Keyword Spotting for Handwritten Documents
    Puigcerver, Joan
    Toselli, Alejandro H.
    Vidal, Enrique
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1176 - 1180
  • [8] Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 564 - 573
  • [9] Two-Stage Approach to Keyword Spotting in Handwritten Documents
    Haji, Mehdi
    Ameri, Mohammad R.
    Bui, Tien D.
    Suen, Ching Y.
    Ponson, Dominique
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [10] Keyword Spotting in Handwritten Documents using Projections of Oriented Gradients
    Retsinas, George
    Louloudis, Georgios
    Stamatopoulos, Nikolaos
    Gatos, Basilis
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 411 - 416