Video text recognition using sequential Monte Carlo and error voting methods

被引:32
|
作者
Chen, DT [1 ]
Odobez, JM [1 ]
机构
[1] IDIAP Res Inst, CH-1920 Martigny, Valais, Switzerland
关键词
video text recognition; text segmentation; sequential Monte-Carlo filter; language model; recognition output voting error reduction;
D O I
10.1016/j.patrec.2004.11.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the issue of segmentation and recognition of text embedded in video sequences from their associated text image sequence extracted by a text detection module. To this end, we propose a probabilistic algorithm based on Bayesian adaptive thresholding and Monte-Carlo sampling. The algorithm approximates the posterior distribution of segmentation thresholds of text pixels in an image by a set of weighted samples. The set of samples is initialized by applying a classical segmentation algorithm on the first video frame and further refined by random sampling under a temporal Bayesian framework. One important contribution of the paper is to show that, thanks to the proposed methodology, the likelihood of a segmentation parameter sample can be estimated not using a classification criterion or a visual quality criterion based on the produced segmentation map, but directly from the induced text recognition result, which is directly relevant to our task. Furthermore, as a second contribution of the paper, we propose to align text recognition results from high confidence samples gathered over time, to composite a final result using error voting technique (ROVER) at the character level. Experiments are conducted on a two hour video database. Character recognition rates higher than 93%, and word error rates higher than 90% are achieved, which are 4% and 3% more than state-of-the-art methods applied to the same database. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:1386 / 1403
页数:18
相关论文
共 50 条
  • [1] Sequential Monte Carlo video text segmentation
    Chen, DT
    Odobez, JM
    [J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 21 - 24
  • [2] Error analysis of sequential Monte Carlo methods for transport problems
    Kong, R
    Spanier, J
    [J]. MONTE CARLO AND QUASI-MONTE CARLO METHODS 1998, 2000, : 252 - 272
  • [3] Monte Carlo video text segmentation
    Chen, DT
    Odobez, JM
    Thiran, JP
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (05) : 647 - 661
  • [4] Inverse kinematics using sequential Monte Carlo methods
    Courty, Nicolas
    Arnaud, Elise
    [J]. ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2008, 5098 : 1 - +
  • [5] Bayesian Geosteering Using Sequential Monte Carlo Methods
    Veettil, Dilshad R. Akkam
    Clark, Kit
    [J]. PETROPHYSICS, 2020, 61 (01): : 99 - 111
  • [6] Nested Sequential Monte Carlo Methods
    Naesseth, Christian A.
    Lindsten, Fredrik
    Schon, Thomas B.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1292 - 1301
  • [7] Satellite Image Denoising using Sequential Monte Carlo Methods
    Solarte, Kelvin
    Sanchez, Luis
    Ordonez, Joan
    [J]. INGENIERIA UC, 2014, 21 (02): : 35 - 42
  • [8] Structure from motion using sequential Monte Carlo methods
    Qian, G
    Chellappa, R
    [J]. EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, 2001, : 614 - 621
  • [9] Structure from motion using sequential Monte Carlo methods
    Qian, G
    Chellappa, R
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (01) : 5 - 31
  • [10] Structure from Motion Using Sequential Monte Carlo Methods
    Gang Qian
    Rama Chellappa
    [J]. International Journal of Computer Vision, 2004, 59 : 5 - 31