Cascaded Segmentation-Detection Networks for Word-Level Text Spotting

被引:13
|
作者
Qin, Siyang [1 ]
Manduchi, Roberto [1 ]
机构
[1] Univ Calif Santa Cruz, Comp Engn Dept, Santa Cruz, CA 95064 USA
关键词
scene text detection; convolutional neural network; COMPETITION;
D O I
10.1109/ICDAR.2017.210
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce an algorithm for word-level text spotting that is able to accurately and reliably determine the bounding regions of individual words of text "in the wild". Our system is formed by the cascade of two convolutional neural networks. The first network is fully convolutional and is in charge of detecting areas containing text. This results in a very reliable but possibly inaccurate segmentation of the input image. The second network (inspired by the popular YOLO architecture) analyzes each segment produced in the first stage, and predicts oriented rectangular regions containing individual words. No post-processing (e.g. text line grouping) is necessary. With execution time of 450 ms for a 1000x560 image on a Titan X GPU, our system achieves good performance on the ICDAR 2013, 2015 benchmarks [2], [1].
引用
收藏
页码:1275 / 1282
页数:8
相关论文
共 50 条
  • [1] Cascaded Segmentation-Detection Networks for Text-Based Traffic Sign Detection
    Zhu, Yingying
    Liao, Minghui
    Yang, Mingkun
    Liu, Wenyu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (01) : 209 - 219
  • [2] MODEL FOR WORD-LEVEL CONVERSION OF ARBITRARY TEXT TO SPEECH
    ALLEN, J
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 356 - &
  • [3] Infants' use of rhythmic cues in word-level segmentation
    Childers, JB
    Echols, CH
    [J]. PROCEEDINGS OF THE 20TH ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT, VOLS 1 AND 2, 1996, : 167 - 176
  • [4] Word-level and phrase-level strategies for figurative text identification
    Qimeng Yang
    Long Yu
    Shengwei Tian
    Jinmiao Song
    [J]. Multimedia Tools and Applications, 2022, 81 : 14339 - 14353
  • [5] Word-level and phrase-level strategies for figurative text identification
    Yang, Qimeng
    Yu, Long
    Tian, Shengwei
    Song, Jinmiao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14339 - 14353
  • [7] Combating Word-level Adversarial Text with Robust Adversarial Training
    Du, Xiaohu
    Yu, Jie
    Li, Shasha
    Yi, Zibo
    Liu, Hai
    Ma, Jun
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Word-level text highlighting of medical texts for telehealth services
    Ozyegen, Ozan
    Kabe, Devika
    Cevik, Mucahit
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127
  • [9] Transition-Based Neural Word Segmentation Using Word-Level Features
    Zhang, Meishan
    Zhang, Yue
    Fu, Guohong
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 63 : 923 - 953
  • [10] Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines
    Bimal Bhattarai
    Ole-Christoffer Granmo
    Lei Jiao
    [J]. Applied Intelligence, 2022, 52 : 17465 - 17489