Bayesian background models for keyword spotting in handwritten documents

被引:6
|
作者
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] Univ Buffalo, Dept Comp Sci & Engn, 113 Davis Hall, Amherst, NY 14260 USA
关键词
Handwriting recognition; Keyword spotting; Bayesian generalized linear models; Bayesian generalized kernel models; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.patcog.2016.06.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background in a handwritten document can be anything other than the words we are interested in. The characteristics of the background are typically captured by a background model to achieve spotting in handwritten documents. We propose two such Bayesian background models for keyword spotting in handwritten documents. Firstly, we present a background model using the Bayesian generalized linear model called (VDBM) and secondly propose a Bayesian generalized kernel background model called BGKBM. Given a set of handwritten documents and a bunch of keyword and non-keyword scores, the models learn an efficient Bayesian rejection criteria to output the most confident keyword regions in the handwritten document. For the variational dynamic background model (VDBM) the inference of parameters is done using variational methods and for the Bayesian generalized kernel background model (BGKBM), the inference is done using a proposed Markov chain Monte Carlo (MCMC) approach. The models are built on top of the scores returned by a handwritten recognizer for keywords and non-keywords. The approach is recognition based and works at line level. The methods have been validated on publicly available IAM dataset and compared with other state of the art line level keyword spotting approaches.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [41] Keyword Spotting in Online Handwritten Documents Containing Text and Non-Text using BLSTM Neural Networks
    Indermuehle, Emanuel
    Frinken, Volkmar
    Fischer, Andreas
    Bunke, Horst
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 73 - 77
  • [42] Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage
    Ahmed Cheikhrouhou
    Yousri Kessentini
    Slim Kanoun
    Neural Computing and Applications, 2020, 32 : 9201 - 9215
  • [43] Dynamic handwritten keyword spotting based on the NSHP-HMM
    Choisy, Christophe
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 242 - 246
  • [44] Combining Neural Networks to Improve Performance of Handwritten Keyword Spotting
    Frinken, Volkmar
    Fischer, Andreas
    Bunke, Horst
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 215 - 224
  • [45] Script Independent Word Spotting in Offline Handwritten Documents Based on Hidden Markov Models
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 14 - 19
  • [46] Improved Keyword Spotting based on Keyword/Garbage Models
    Chen, Qiyu
    Zhang, Weibin
    Xu, Xiangmin
    Xing, Xiaofen
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [47] Word Spotting based Retrieval of Urdu Handwritten Documents
    Abidi, Ali
    Jamil, Akhtar
    Siddiqi, Imran
    Khurshid, Khurram
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 331 - 336
  • [48] KEYWORD SPOTTING FROM ONLINE CHINESE HANDWRITTEN DOCUMENTS USING ONE-VERSUS-ALL CHARACTER CLASSIFICATION MODEL
    Zhang, Heng
    Wang, Da-Han
    Liu, Cheng-Lin
    Bunke, Horst
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (03)
  • [49] Benchmarking discriminative approaches for word spotting in handwritten documents
    Bideault, Gautier
    Mioulet, Luc
    Chatelain, Clement
    Paquet, Thierry
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 201 - 205
  • [50] Word Spotting and Regular Expression Detection in Handwritten Documents
    Kessentini, Yousri
    Chatelain, Clement
    Paquet, Thierry
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 516 - 520