Detection of artificial and scene text in images and video frames

被引:20
|
作者
Anthimopoulos, Marios [1 ]
Gatos, Basilis [1 ]
Pratikakis, Ioannis [2 ]
机构
[1] Demokritos Natl Ctr Sci Res, Computat Intelligence Lab, Inst Informat & Telecommun, GR-15310 Athens, Greece
[2] Democritus Univ Thrace, Dept Elect & Comp Engn, GR-67100 Xanthi, Greece
关键词
Text detection; Artificial text; Scene text; Natural scene images; Video OCR; Multimedia information retrieval; SEGMENTATION; RECOGNITION; EXTRACTION; LOCATION;
D O I
10.1007/s10044-011-0237-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual information in images and video frames constitutes a valuable source of high-level semantics for multimedia indexing and retrieval systems. Text detection is the most crucial step in a multimedia text extraction system and although it has been extensively studied the past decade still, it does not exist a generic architecture that would work for artificial and scene text in multimedia content. In this paper we propose a system for text detection of both artificial and scene text in images and video frames. The system is based on a machine learning stage which uses an Random Forest classifier and a highly discriminative feature set produced by using a new texture operator called Multilevel Adaptive Color edge Local Binary Pattern (MACeLBP). MACeLBP describes the spatial distribution of color edges in multiple adaptive levels of contrast. Then, a gradient-based algorithm is applied to achieve distinction among text lines as well as refinement in the localization of the text lines. The whole algorithm is situated in a multiresolution framework to achieve invariance to scale for the detection of text lines. Finally, an optional connected-component step segments text lines into words based on the distances between the resulting components. The experimental results are produced by applying a concise evaluation methodology and prove the superior performance achieved by the proposed text detection system for artificial and scene text in images and video frames.
引用
收藏
页码:431 / 446
页数:16
相关论文
共 50 条
  • [1] Detection of artificial and scene text in images and video frames
    Marios Anthimopoulos
    Basilis Gatos
    Ioannis Pratikakis
    [J]. Pattern Analysis and Applications, 2013, 16 : 431 - 446
  • [2] Multi-oriented text detection and verification in video frames and scene images
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Roy, Partha Pratim
    Pal, Umapada
    [J]. NEUROCOMPUTING, 2018, 275 : 1531 - 1549
  • [3] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
  • [4] Text detection and recognition in images and video frames
    Chen, DT
    Odobez, JM
    Bourlard, H
    [J]. PATTERN RECOGNITION, 2004, 37 (03) : 595 - 608
  • [5] Fast and robust text detection in images and video frames
    Ye, QX
    Huang, QM
    Gao, W
    Zhao, DB
    [J]. IMAGE AND VISION COMPUTING, 2005, 23 (06) : 565 - 576
  • [6] An Adaptive Text Detection Approach in Images and Video Frames
    Li, Minhua
    Wang, Chunheng
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 72 - 77
  • [7] A robust text detection algorithm in images and video frames
    Ye, QX
    Gao, W
    Wang, WQ
    Zeng, W
    [J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 802 - 806
  • [8] A new text detection algorithm in images/video frames
    Ye, QX
    Huang, QM
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004, 3332 : 858 - 865
  • [9] Forged text detection in video, scene, and document images
    Nandanwar, Lokesh
    Shivakumara, Palaiahnakote
    Mondal, Prabir
    Raghunandan, Karpuravalli Srinivas
    Pal, Umapada
    Lu, Tong
    Lopresti, Daniel
    [J]. IET IMAGE PROCESSING, 2020, 14 (17) : 4744 - 4755
  • [10] Hybrid Chinese/English text detection in images and video frames
    Mao, WG
    Chung, FL
    Lam, KKM
    Siu, WC
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 1015 - 1018