Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents

被引:6
|
作者
Zhang, Heng [1 ]
Wang, Da-Han [1 ]
Liu, Cheng-Lin [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Online Chinese handwritten documents; Keyword spotting; Posterior probability; N-best list; Confidence measure; Confusion network; SPEECH RECOGNITION; SEGMENTATION; CONSENSUS;
D O I
10.1016/j.patcog.2013.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In keyword spotting from handwritten documents by text query, the word similarity is usually computed by combining character similarities, which are desired to approximate the logarithm of the character probabilities. In this paper, we propose to directly estimate the posterior probability (also called confidence) of candidate characters based on the N-best paths from the candidate segmentation-recognition lattice. On evaluating the candidate segmentation-recognition paths by combining multiple contexts, the scores of the N-best paths are transformed to posterior probabilities using soft-max. The parameter of soft-max (confidence parameter) is estimated from the character confusion network, which is constructed by aligning different paths using a string matching algorithm. The posterior probability of a candidate character is the summation of the probabilities of the paths that pass through the candidate character. We compare the proposed posterior probability estimation method with some reference methods including the word confidence measure and the text line recognition method. Experimental results of keyword spotting on a large database CASIA-OLHWDB of unconstrained online Chinese handwriting demonstrate the effectiveness of the proposed method. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1880 / 1890
页数:11
相关论文
共 40 条
  • [31] An Irrelevant Variability Normalization Based Discriminative Training Approach for Online Handwritten Chinese Character Recognition
    Du, Jun
    Huo, Qiang
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 69 - 73
  • [32] A Prefix Tree Based n-best List Re-scoring Strategy for Recurrent Neural Network Language Model
    SI Yujing
    LI Ta
    PAN Jielin
    YAN Yonghong
    [J]. Chinese Journal of Electronics, 2014, 23 (01) : 70 - 74
  • [34] Writer Adaptive Feature Extraction Based on Convolutional Neural Networks For Online Handwritten Chinese Character Recognition
    Du, Jun
    Zhai, Jian-Fang
    Hu, Jin-Shui
    Zhu, Bo
    Wei, Si
    Dai, Li-Rong
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 841 - 845
  • [35] A Prefix Tree Based n-best List Re-scoring Strategy for Recurrent Neural Network Language Model
    Si Yujing
    Li Ta
    Pan Jielin
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (01) : 70 - 74
  • [36] Online Handwritten Chinese Character Recognition Based on 1-D Convolution and Two-Streams Transformers
    Chen, Yihong
    Zheng, Hao
    Li, Yanchun
    Ouyang, Wanli
    Zhu, Jiang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5769 - 5781
  • [37] RESCORING N-BEST SPEECH RECOGNITION LIST BASED ON ONE-ON-ONE HYPOTHESIS COMPARISON USING ENCODER-CLASSIFIER MODEL
    Ogawa, Atsunori
    Delcroix, Marc
    Karita, Shigeki
    Nakatani, Tomohiro
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6099 - 6103
  • [38] A minimax classification approach to HMM-based online handwritten chinese character recognition robust against affine distortions
    Huo, Qiang
    He, Tingting
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 43 - 47
  • [39] Prefix Tree based N-best list Re-scoring for Recurrent Neural Network Language Model used in Speech Recognition System
    Si, Yujing
    Zhang, Qingqing
    Li, Ta
    Pan, Jielin
    Yan, Yonghong
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3386 - 3390
  • [40] An irrelevant variability normalization approach to discriminative training of multi-prototype based classifiers and its applications for online handwritten Chinese character recognition
    Du, Jun
    Huo, Qiang
    [J]. PATTERN RECOGNITION, 2014, 47 (12) : 3959 - 3966