Text String Detection From Natural Scenes by Structure-Based Partition and Grouping

被引:201
|
作者
Yi, Chucai [1 ]
Tian, YingLi [2 ]
机构
[1] CUNY, Grad Ctr, New York, NY 10016 USA
[2] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Adjacent character grouping; character property; image partition; text line grouping; text string detection; text string structure; EXTRACTION; SEGMENTATION;
D O I
10.1109/TIP.2011.2126586
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text information in natural scene images serves as important clues for many image-based applications such as scene understanding, content-based image retrieval, assistive navigation, and automatic geocoding. However, locating text from a complex background with multiple colors is a challenging task. In this paper, we explore a new framework to detect text strings with arbitrary orientations in complex natural scene images. Our proposed framework of text string detection consists of two steps: 1) image partition to find text character candidates based on local gradient features and color uniformity of character components and 2) character candidate grouping to detect text strings based on joint structural features of text characters in each text string such as character size differences, distances between neighboring characters, and character alignment. By assuming that a text string has at least three characters, we propose two algorithms of text string detection: 1) adjacent character grouping method and 2) text line grouping method. The adjacent character grouping method calculates the sibling groups of each character candidate as string segments and then merges the intersecting sibling groups into text string. The text line grouping method performs Hough transform to fit text line among the centroids of text candidates. Each fitted text line describes the orientation of a potential text string. The detected text string is presented by a rectangle region covering all characters whose centroids are cascaded in its text line. To improve efficiency and accuracy, our algorithms are carried out in multi-scales. The proposed methods outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation. Furthermore, the effectiveness of our methods to detect text strings with arbitrary orientations is evaluated on the Oriented Scene Text Dataset collected by ourselves containing text strings in nonhorizontal orientations.
引用
收藏
页码:2594 / 2605
页数:12
相关论文
共 50 条
  • [21] Skew Distribution NMS Algorithm for Text Detection in Natural Scenes
    Zhou, Gang
    Yang, Youwei
    Mo, Jiaqing
    Liu, Qiuling
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 212 - 217
  • [22] A Traffic Sign Text Detection System for Pratical Natural Scenes
    Zuo, Zhongrong
    Yang, Pengtao
    2018 IEEE 24TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2018), 2018, : 1069 - 1074
  • [23] Intelligent Detection Method of English Text in Natural Scenes in Video
    Dai, Liqin
    Chen, ChunHua
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [24] Text recognition in natural scenes based on deep learning
    Jiang, Yi
    Jiang, Zhongyu
    He, Liang
    Chen, Shuai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (08) : 10545 - 10559
  • [25] Text recognition in natural scenes based on deep learning
    Yi Jiang
    Zhongyu Jiang
    Liang He
    Shuai Chen
    Multimedia Tools and Applications, 2022, 81 : 10545 - 10559
  • [26] Segmentation of Natural Scenes Based on Visual Attention and Gestalt Grouping Laws
    Mesquita, R. G.
    Mello, C. A. B.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 4237 - 4242
  • [27] A New Method to Extract Text from Natural Scenes
    郝峻晟
    戚飞虎
    朱凯华
    蒋人杰
    Journal of DongHua University, 2005, (04) : 52 - 57
  • [28] Class Hierarchical Structure-based Text Classification
    Chen, Xiaoyun
    Chen, Jinhua
    ADVANCES IN CIVIL ENGINEERING, PTS 1-6, 2011, 255-260 : 2233 - 2237
  • [29] Text Detection Algorithm for Natural Scenes under Attention Supervision Strategy
    Haorang L.
    Lingchen Y.
    Ronghua L.
    Long C.
    Hao W.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (07): : 1011 - 1019
  • [30] Structure-based synthesis: From natural products to drug prototypes
    Hanessian, Stephen
    PURE AND APPLIED CHEMISTRY, 2009, 81 (06) : 1085 - 1091