Scene Text Detection by Leveraging Multi-channel Information and Local Context

被引:0
|
作者
Wang, Runmin [1 ]
Qian, Shengyou [1 ]
Yang, Jianfeng [1 ]
Gao, Changxin [2 ]
机构
[1] Hunan Normal Univ, Sch Phys & Informat Sci, Lushan Load 36, Changsha, Hunan, Peoples R China
[2] Huazhong Univ Sci & Technol, Sci & Technol Multispectral Informat Proc Lab, Lushan Load 1037, Wuhan, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Text detection; multi-channel information; cascaded strategy; visual context; IMAGES;
D O I
10.1117/12.2284295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important information carrier, texts play significant roles in many applications. However, text detection in unconstrained scenes is a challenging problem due to cluttered backgrounds, various appearances, uneven illumination, etc.. In this paper, an approach based on multi-channel information and local context is proposed to detect texts in natural scenes. According to character candidate detection plays a vital role in text detection system, Maximally Stable Extremal Regions(MSERs) and Graph-cut based method are integrated to obtain the character candidates by leveraging the multi-channel image information. A cascaded false positive elimination mechanism are constructed from the perspective of the character and the text line respectively. Since the local context information is very valuable for us, these information is utilized to retrieve the missing characters for boosting the text detection performance. Experimental results on two benchmark datasets, i.e., the ICDAR 2011 dataset and the ICDAR 2013 dataset, demonstrate that the proposed method have achieved the state-of-the-art performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Text Detection in Natural Scene Images Leveraging Context Information
    Wang, Runmin
    Sang, Nong
    Gao, Changxin
    Kuang, Xiaoqin
    Xiang, Jun
    [J]. PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 444 - 454
  • [2] Natural Scene Text Detection Based on Multi-Channel FASText
    Guo Chenfeng
    Liu Juhua
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INFORMATION ENGINEERING (ICACIE 2017), 2017, 119 : 16 - 20
  • [3] LEVERAGING SURROUNDING CONTEXT FOR SCENE TEXT DETECTION
    Li, Yao
    Shen, Chunhua
    Jia, Wenjing
    van den Hengel, Anton
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2264 - 2268
  • [4] Natural Scene Text Detection with Multi-channel Connected Component Segmentation
    Wang, Xiaobing
    Song, Yonghong
    Zhang, Yuanlin
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1375 - 1379
  • [5] Aggregating Local Context for Accurate Scene Text Detection
    He, Dafang
    Yang, Xiao
    Huang, Wenyi
    Zhou, Zihan
    Kifer, Daniel
    Giles, C. Lee
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 280 - 296
  • [6] Text detection in color scene images based on unsupervised clustering of multi-channel wavelet features
    Saoi, T
    Goto, H
    Kobayashi, H
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 690 - 694
  • [7] Natural scene text detection and recognition based on saturation-incorporated multi-channel MSER
    Tong, Guoxiang
    Dong, Ming
    Sun, Xiaoxia
    Song, Yan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [8] Scene Text Identification by Leveraging Mid-level Patches and Context Information
    Wang, Runmin
    Sang, Nong
    Gao, Changxin
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) : 963 - 967
  • [9] MIX: Multi-Channel Information Crossing for Text Matching
    Chen, Haolan
    Han, Fred X.
    Niu, Di
    Liu, Dong
    Lai, Kunfeng
    Wu, Chenglin
    Xu, Yu
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 110 - 119
  • [10] Multi-orientation Scene Text Detection Leveraging Background Suppression
    Wang, Xihan
    Feng, Xiaoyi
    Xia, Zhaoqiang
    Peng, Jinye
    Granger, Eric
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 555 - 566