An improved scene text extraction method using Conditional Random Field and Optical Character Recognition

被引:17
|
作者
Zhang, Hongwei [1 ]
Liu, Changsong [1 ]
Yang, Cheng [1 ]
Ding, Xiaoqing [1 ]
Wang, KongQiao [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Nokia Res Ctr Beijing, BDA, Beijing 100176, Peoples R China
基金
中国国家自然科学基金;
关键词
CRF; OCR; BP; Scene text extraction;
D O I
10.1109/ICDAR.2011.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past few years, research on scene text extraction has developed rapidly. Recently, condition random field (CRF) has been used to give connected components (CCs) 'text' or 'non-text' labels. However, a burning issue in CRF model comes from multiple text lines extraction. In this paper, we propose a two-step iterative CRF algorithm with a Belief Propagation inference and an OCR filtering stage. Two kinds of neighborhood relationship graph are used in the respective iterations for extracting multiple text lines. Furthermore, OCR confidence is used as an indicator for identifying the text regions, while a traditional OCR filter module only considered the recognition results. The first CRF iteration aims at finding certain text CCs, especially in multiple text lines, and sending uncertain CCs to the second iteration. The second iteration gives second chance for the uncertain CCs and filter false alarm CCs with the help of OCR. Experiments based on the public dataset of ICDAR 2005 prove that the proposed method is comparative with the existing algorithms.
引用
收藏
页码:708 / 712
页数:5
相关论文
共 50 条
  • [21] A Chinese Toponym Recognition Method Based on Conditional Random Field
    Wu L.
    Liu L.
    Li H.
    Gao Y.
    Gao, Yong (gaoyong@pku.edu.can), 2017, Editorial Board of Medical Journal of Wuhan University (42): : 150 - 156
  • [22] Resource-Aware Scene Text Recognition Using Learned Features, Quantization, and Contour-Based Character Extraction
    Ademola, Olutosin Ajibola
    Petlenkov, Eduard
    Leier, Mairo
    IEEE ACCESS, 2023, 11 : 56865 - 56874
  • [23] Optical Character Recognition for printed Tamil text using Unicode
    Seethalakshmi R.
    Sreeranjani T.R.
    Balachandar T.
    Singh A.
    Singh M.
    Ratan R.
    Kumar S.
    Journal of Zhejiang University-SCIENCE A, 2005, 6 (11): : 1297 - 1305
  • [24] Optical Character Recognition for printed Tamil text using Unicode
    SEETHALAKSHMI R.
    SREERANJANI T.R.
    BALACHANDAR T.
    Abnikant Singh
    Markandey Singh
    Ritwaj Ratan
    Sarvesh Kumar
    Journal of Zhejiang University Science A(Science in Engineering), 2005, (11) : 131 - 139
  • [25] Scene Text Recognition Based on Improved CRNN
    Yu, Wenhua
    Ibrayim, Mayire
    Hamdulla, Askar
    INFORMATION, 2023, 14 (07)
  • [26] Japanese Scene Character Recognition using Random Image Feature and Ensemble Scheme
    Horie, Fuma
    Goto, Hideaki
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 414 - 420
  • [27] Fingertip-writing alphanumeric character recognition based on hidden conditional random field
    Lee, Chien-Cheng
    Li, Yi-Fang
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2013, 4 (03) : 285 - 291
  • [28] Fingertip-writing alphanumeric character recognition based on hidden conditional random field
    Chien-Cheng Lee
    Yi-Fang Li
    Journal of Ambient Intelligence and Humanized Computing, 2013, 4 : 285 - 291
  • [29] Character Extraction in Web Image for Text Recognition
    Su, Bolan
    Lu, Shijian
    Trung Quy Phan
    Tan, Chew Lim
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3042 - 3045
  • [30] Linear street extraction using a Conditional Random Field model
    Corcoran, Padraig
    Mooney, Peter
    Bertolotto, Michela
    SPATIAL STATISTICS, 2015, 14 : 532 - 545