Integrated Coarse-Fine Scheme for Text Information Extraction from Instructional Videos

被引:0
|
作者
Hamad, Ahmed
El-Ghonaimy, Said [1 ]
Soliman, Taysir
Afifi, Marwa [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt
关键词
Text Detection; Text Extraction; Edge Detection; Morphological Operations; SVM; OCR; IMAGES; LOCALIZATION; FRAMES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an effective coarse-to-fine algorithm to detect and extract text in instructional videos. Firstly, in the coarse section, an edge-based algorithm is employed to detect all candidate regions of character edges. First in the detection step, an edge map is created using the canny edge detector. Then, morphological filtering is used, based on geometrical structure element, in order to connect the vertical edges and discard false alarms. A connected component analysis is performed to the filtered edge map in order to determine a bounding box for every candidate text area. Finally in the localization step, horizontal and vertical projections are calculated on the edge map of every box and a threshold is applied, refining the result and splitting text areas in text lines. Secondly, in the fine section, correct text regions are selected from candidate ones by support vector machine (SVM) model and texture features. Finally, we segment these regions and binarize them to be fed into the OCR engine to be recognized. Experimental results show that our algorithm achieves high performance and prove that our system is highly effective and efficient for text information extraction.
引用
收藏
页码:559 / 566
页数:8
相关论文
共 50 条
  • [1] An Efficient Coarse-to-Fine Scheme for Text Detection in Videos
    Wang, Liuan
    Huang, Lin-Lin
    Wu, Yang
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 475 - 479
  • [2] Coarse-Fine Networks for Temporal Activity Detection in Videos
    Kahatapitiya, Kumara
    Ryoo, Michael S.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8381 - 8390
  • [3] Multi-information Integrated Method for Text Extraction from Videos
    Jiang, Aiwen
    Zeng, Gaorong
    [J]. ADVANCED RESEARCH ON AUTOMATION, COMMUNICATION, ARCHITECTONICS AND MATERIALS, PTS 1 AND 2, 2011, 225-226 (1-2): : 827 - +
  • [4] A unified text extraction method for instructional videos
    Tang, LJ
    Kender, JR
    [J]. 2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 2893 - 2896
  • [5] Result-consistent counter sampling scheme for coarse-fine TDCs
    Michalik, P.
    Fernandez, D.
    Madrenas, J.
    [J]. ELECTRONICS LETTERS, 2012, 48 (19) : 1195 - U43
  • [6] A novel coarse-fine search scheme for digital image correlation method
    Zhang, Zhi-Feng
    Kang, Yi-Lan
    Wang, Huai-Wen
    Qin, Qlng-Hua
    Qiu, Yu
    Li, Xiao-Qi
    [J]. MEASUREMENT, 2006, 39 (08) : 710 - 718
  • [7] Fully integrated coarse-fine wideband distributed voltage controlled oscillator
    Cannone, F.
    Avitabile, G.
    Cascella, D.
    [J]. PRIME: 2008 PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS, PROCEEDINGS, 2008, : 165 - 168
  • [8] Real-time text information extraction from videos
    Ou, Guobin
    Zhang, Li
    Xie, Pan
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2002, 42 (07): : 869 - 872
  • [9] A 99.15% energy-reduced switching scheme based on HSRS coarse-fine architecture for SAR ADCs
    Yue, Peiyi
    Li, Yongyuan
    Liu, Shubin
    Zhu, Zhangming
    [J]. ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2021, 107 (01) : 1 - 14
  • [10] Text Extraction from videos using MapReduce
    Roshan, Chanchal Kumar
    Kaushal, Rajeet
    Alam, Sha
    Rai, Shashank
    Gholap, Yuvraj
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 431 - 434