An improved scene text extraction method using Conditional Random Field and Optical Character Recognition

被引:17
|
作者
Zhang, Hongwei [1 ]
Liu, Changsong [1 ]
Yang, Cheng [1 ]
Ding, Xiaoqing [1 ]
Wang, KongQiao [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Nokia Res Ctr Beijing, BDA, Beijing 100176, Peoples R China
基金
中国国家自然科学基金;
关键词
CRF; OCR; BP; Scene text extraction;
D O I
10.1109/ICDAR.2011.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past few years, research on scene text extraction has developed rapidly. Recently, condition random field (CRF) has been used to give connected components (CCs) 'text' or 'non-text' labels. However, a burning issue in CRF model comes from multiple text lines extraction. In this paper, we propose a two-step iterative CRF algorithm with a Belief Propagation inference and an OCR filtering stage. Two kinds of neighborhood relationship graph are used in the respective iterations for extracting multiple text lines. Furthermore, OCR confidence is used as an indicator for identifying the text regions, while a traditional OCR filter module only considered the recognition results. The first CRF iteration aims at finding certain text CCs, especially in multiple text lines, and sending uncertain CCs to the second iteration. The second iteration gives second chance for the uncertain CCs and filter false alarm CCs with the help of OCR. Experiments based on the public dataset of ICDAR 2005 prove that the proposed method is comparative with the existing algorithms.
引用
收藏
页码:708 / 712
页数:5
相关论文
共 50 条
  • [1] Natural Scene Character Recognition using Markov Random Field
    Liu, Xiaolong
    Lu, Tong
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 396 - 400
  • [2] Optical Character Recognition for Scene Text Detection, Mining and Recognition
    Nathiya, N.
    Pradeepa, K.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 662 - 665
  • [3] Character Recognition using Conditional Random Field based Matching Engine
    Ray, Anupama
    Chandawala, Ankit
    Chaudhary, Santanu
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 18 - 22
  • [4] Scene Text Detection with Robust Character Candidate Extraction Method
    Sung, Myung-Chul
    Jun, Bongjin
    Cho, Hojin
    Kim, Daijin
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 426 - 430
  • [5] Scene text detection with robust character candidate extraction method
    Department of Computer Science and Engineering, POSTECH, Pohang, Korea, Republic of
    不详
    Proc. Int. Conf. Doc. Anal. Recognit., (426-430):
  • [6] Named Entity Recognition in Text Documents Using a Modified Conditional Random Field
    Veena, G.
    Gupta, Deepa
    Lakshmi, S.
    Jacob, Jeenu T.
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 3, 2018, 709 : 31 - 41
  • [7] OPTICAL CHARACTER RECOGNITION USING A NEW METHOD OF CHARACTERISTIC EXTRACTION
    SHONO, Y
    INUZUKA, T
    APPLIED OPTICS, 1972, 11 (05): : 1271 - &
  • [8] A Component-based On-line Handwritten Tibetan Character Recognition Method Using Conditional Random Field
    Ma, Long-Long
    Wu, Jian
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 704 - 709
  • [9] Scene Text Character Recognition Using Spatiality Embedded Dictionary
    Gao, Song
    Wang, Chunheng
    Xiao, Baihua
    Shi, Cunzhao
    Zhou, Wen
    Zhang, Zhong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (07): : 1942 - 1946
  • [10] Improved local binary pattern for real scene optical character recognition
    Yang, Chu-Sing
    Yang, Yung-Hsuan
    PATTERN RECOGNITION LETTERS, 2017, 100 : 14 - 21