Detection and recognition of cursive text from video frames

被引:7
|
作者
Mirza, Ali [1 ]
Zeshan, Ossama [1 ]
Atif, Muhammad [1 ]
Siddiqi, Imran [1 ]
机构
[1] Bahria Univ, Islamabad, Pakistan
关键词
Text detection; Text recognition; Script identification; Deep neural networks (DNNs); Convolutional neural networks (CNNs); Long short-term memory (LSTM) networks; Caption text; Cursive text; ARTIFICIAL URDU TEXT; NATURAL SCENE IMAGE; SCRIPT IDENTIFICATION; NEURAL-NETWORK; HYBRID APPROACH; LOCALIZATION; FEATURES; REPRESENTATION; SEGMENTATION; EXTRACTION;
D O I
10.1186/s13640-020-00523-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Textual content appearing in videos represents an interesting index for semantic retrieval of videos (from archives), generation of alerts (live streams), as well as high level applications like opinion mining and content summarization. The key components of such systems require detection and recognition of textual content which also make the subject of our study. This paper presents a comprehensive framework for detection and recognition of textual content in video frames. More specifically, we target cursive scripts taking Urdu text as a case study. Detection of textual regions in video frames is carried out by fine-tuning deep neural networks based object detectors for the specific case of text detection. Script of the detected textual content is identified using convoluational neural networks (CNNs), while for recognition, we propose a UrduNet, a combination of CNNs and long short- term memory (LSTM) networks. A benchmark dataset containing cursive text with more than 13,000 video frame is also developed. A comprehensive series of experiments is carried out reporting an F-measure of 88.3% for detection while a recognition rate of 87%.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Detection of artificial and scene text in images and video frames
    Anthimopoulos, Marios
    Gatos, Basilis
    Pratikakis, Ioannis
    PATTERN ANALYSIS AND APPLICATIONS, 2013, 16 (03) : 431 - 446
  • [22] Fast and robust text detection in images and video frames
    Ye, QX
    Huang, QM
    Gao, W
    Zhao, DB
    IMAGE AND VISION COMPUTING, 2005, 23 (06) : 565 - 576
  • [23] Detection of artificial and scene text in images and video frames
    Marios Anthimopoulos
    Basilis Gatos
    Ioannis Pratikakis
    Pattern Analysis and Applications, 2013, 16 : 431 - 446
  • [24] TEXT DETECTION IN VIDEO FRAMES USING HYBRID FEATURES
    Ji, Zhong
    Wang, Jian
    Su, Yu-Ting
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 318 - 322
  • [25] A robust text detection algorithm in images and video frames
    Ye, QX
    Gao, W
    Wang, WQ
    Zeng, W
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 802 - 806
  • [26] A new text detection algorithm in images/video frames
    Ye, QX
    Huang, QM
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004, 3332 : 858 - 865
  • [27] ResNet CNN with LSTM Based Tamil Text Detection from Video Frames
    Muthumani, I
    Malmurugan, N.
    Ganesan, L.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (02): : 917 - 928
  • [28] ICDAR2017 Competition on Arabic Text Detection and Recognition in Multi-resolution Video Frames
    Zayene, Oussama
    Hennebert, Jean
    Ingold, Rolf
    BenAmara, Najoua Essoukri
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1460 - 1465
  • [29] A Smart Approach for Text Detection, Localization and Extraction in Video Frames
    Shi, Shuicai
    Cheng, Tao
    Xiao, Shibin
    Lv, Xueqiang
    2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 158 - +
  • [30] A framework for improved video text detection and recognition
    Haojin Yang
    Bernhard Quehl
    Harald Sack
    Multimedia Tools and Applications, 2014, 69 : 217 - 245