Text detection and recognition in natural scene with edge analysis

被引:15
|
作者
Yu, Chong [1 ]
Song, Yonghong [1 ]
Meng, Quan [1 ]
Zhang, Yuanlin [1 ]
Liu, Yang [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
IMAGES; VIDEO;
D O I
10.1049/iet-cvi.2013.0307
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text plays an important role in daily life because of its rich information, thus automatic text detection in natural scenes has many attractive applications. However, detecting and recognising such text is always a challenging problem. In this study, the authors propose a method which extends the widely-used stroke width transform by two steps of edge analysis, namely candidate edge recombination and edge classification. A new method that recognises text through candidate edge recombination and candidate edge recognition is also proposed. In the step of candidate edge recombination, they use the idea of over-segmentation and region merging. To separate text edge from background, the edge of the input image is first divided into small segments. Then, neighbour edge segments are merged, if they have similar stroke width and colour. Through this step, each character is described by one candidate boundary. In the step of boundary classification, candidate boundaries are aggregated into text chains, followed by chain classification using character-based and chain-based features. To recognise text, the grey image is extracted based on the location of each candidate edge after the step of candidate edge recombination. Then, histogram of gradient features and a classifier are used to recognise each character. To evaluate the effectiveness of their method, the algorithm is run on the ICDAR competition dataset and Street View Text database. The experimental results show that the proposed method provides promising performance in comparison with the existing methods.
引用
收藏
页码:603 / 613
页数:11
相关论文
共 50 条
  • [41] Arabic Cursive Text Recognition from Natural Scene Images
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yusof, Rubiyah
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (02):
  • [42] English Text Localization and Recognition from Natural Scene Image
    Satwashil, Kakade Snehal
    Pawar, V. R.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 555 - 559
  • [43] LED Dot matrix text recognition method in natural scene
    Wahyono
    Jo, Kanghyun
    [J]. NEUROCOMPUTING, 2015, 151 : 1033 - 1041
  • [44] A cascaded method for text detection in natural scene images
    Zheng, Yang
    Li, Qing
    Liu, Jie
    Liu, Heping
    Li, Gen
    Zhang, Shuwu
    [J]. NEUROCOMPUTING, 2017, 238 : 307 - 315
  • [45] Cascade Detector for Text Detection in Natural Scene Images
    Hanif, Shehzad Muhammad
    Prevost, Lionel
    Negri, Pablo Augusto
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1917 - +
  • [46] Improved FCENet Algorithm for Natural Scene Text Detection
    Zhou, Yan
    Liao, Junwei
    Liu, Xiangyu
    Zhou, Yuexia
    Zeng, Fanzhi
    [J]. Computer Engineering and Applications, 2024, 60 (03) : 228 - 235
  • [47] Fast and Accurate Text Detection in Natural Scene Images
    Xiao, Chengqiu
    Ji, Lixin
    Gao, Chao
    Li, Shaomei
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 1 - 10
  • [48] Method for unconstrained text detection in natural scene image
    Liu, Zhandong
    Li, Yong
    Qi, Xiangwei
    Yang, Yong
    Nian, Mei
    Zhang, Haijun
    Xiamixiding, Reziwanguli
    [J]. IET COMPUTER VISION, 2017, 11 (07) : 596 - 604
  • [49] A SWT Verified Method of Natural Scene Text Detection
    Huang Jian
    Liu Xiaopei
    Zhao Qian
    [J]. 2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, : 980 - 984
  • [50] Integrated Method for Text Detection in Natural Scene Images
    Zheng, Yang
    Liu, Jie
    Liu, Heping
    Li, Qing
    Li, Gen
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (11): : 5583 - 5604