Bayesian background models for keyword spotting in handwritten documents

被引:6
|
作者
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] Univ Buffalo, Dept Comp Sci & Engn, 113 Davis Hall, Amherst, NY 14260 USA
关键词
Handwriting recognition; Keyword spotting; Bayesian generalized linear models; Bayesian generalized kernel models; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.patcog.2016.06.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background in a handwritten document can be anything other than the words we are interested in. The characteristics of the background are typically captured by a background model to achieve spotting in handwritten documents. We propose two such Bayesian background models for keyword spotting in handwritten documents. Firstly, we present a background model using the Bayesian generalized linear model called (VDBM) and secondly propose a Bayesian generalized kernel background model called BGKBM. Given a set of handwritten documents and a bunch of keyword and non-keyword scores, the models learn an efficient Bayesian rejection criteria to output the most confident keyword regions in the handwritten document. For the variational dynamic background model (VDBM) the inference of parameters is done using variational methods and for the Bayesian generalized kernel background model (BGKBM), the inference is done using a proposed Markov chain Monte Carlo (MCMC) approach. The models are built on top of the scores returned by a handwritten recognizer for keywords and non-keywords. The approach is recognition based and works at line level. The methods have been validated on publicly available IAM dataset and compared with other state of the art line level keyword spotting approaches.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [21] Cross-Evaluation of Graph-Based Keyword Spotting in Handwritten Historical Documents
    Stauffer, Michael
    Maergner, Paul
    Fischer, Andreas
    Riesen, Kaspar
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019, 2019, 11510 : 45 - 55
  • [22] Keyword spotting in handwritten documents based on a generic text line HMM and a SVM verification
    Kessentini, Yousri
    Paquet, Thierry
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 41 - 45
  • [23] Keyword search in handwritten documents
    Kolcz, A
    Alspector, J
    Augusteijn, M
    Carlson, R
    Popescu, GV
    PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON APPLICATIONS OF NEURAL NETWORKS TO TELECOMMUNICATIONS 3, 1997, 3 : 171 - 180
  • [24] HMM Based Keyword Spotting System in Printed/Handwritten Arabic/Latin Documents with Identification Stage
    Rouhou, Ahmed Cheikh
    Kessentini, Yousri
    Kanoun, Slim
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 309 - 320
  • [25] Deep Learning Features for Handwritten Keyword Spotting
    Wicht, Baptiste
    Fischer, Andreas
    Hennebert, Jean
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3434 - 3439
  • [26] Keyword spotting in handwritten chinese documents using semi-markov conditional random fields
    Zhang, Heng
    Zhou, Xiang-Dong
    Liu, Cheng-Lin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 49 - 61
  • [27] Keyword Spotting Techniques for Sanskrit Documents
    Bhardwaj, Anurag
    Setlur, Srirangaraj
    Govindaraju, Venu
    SANSKRIT COMPUTATIONAL LINGUISTICS, INVITED PAPERS, 2009, 5402 : 403 - 416
  • [28] A Survey on handwritten documents word spotting
    Ahmed R.
    Al-Khatib W.G.
    Mahmoud S.
    International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 31 - 47
  • [29] Word Spotting as a Service for Handwritten Documents
    Amanatiadis, Angelos
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [30] Spotting words in handwritten Arabic documents
    Srihari, Sargur
    Srinivasan, Harish
    Babu, Pavithra
    Bhole, Chetan
    DOCUMENT RECOGNITION AND RETRIEVAL XIII, 2006, 6067