Fractional poisson enhancement model for text detection and recognition in video frames

被引:28
|
作者
Roy, Sangheeta [1 ]
Shivakumara, Palaiahnakote [1 ]
Jalab, Hamid A. [1 ]
Ibrahim, Rabha W. [2 ]
Pal, Umapada [3 ]
Lu, Tong [4 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[2] Univ Malaya, Inst Math Sci, Kuala Lumpur, Malaysia
[3] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[4] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210008, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Text detection; Text recognition; Laplacian operation; Fractional Poission model; Text enhancement; LAPLACIAN; BINARIZATION; FEATURES;
D O I
10.1016/j.patcog.2015.10.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Performing Laplacian operation on video images is a common technique to improve image contrast to achieve good text detection and recognition accuracies. However, it is a fact that when Laplacian operation enhances contrast, at the same time it introduces too many noises. To alleviate this, the existing methods propose different enhancement methods and filters. In this paper, we propose a generalized enhancement model based on fractional calculus to increase the quality of images obtained by Laplacian operation. The proposed method considers edges and their neighbor information to derive a mathematical model for enhancing low contrast information in video as well as in scene images. Experimental results of text detection and recognition methods on different databases show that the proposed enhancement model improves their accuracies significantly. The enhancement model is compared with standard enhancement models to show that the proposed model outperforms the existing models in terms of quality measures. The usefulness of the proposed model is validated through text detection and recognition experiments. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:433 / 447
页数:15
相关论文
共 50 条
  • [31] An Efficient Edge based Technique for Text Detection in Video Frames
    Shivakumara, Palaiahnakote
    Huang, Weihua
    Tan, Chew Lim
    [J]. PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 307 - 314
  • [32] Hybrid Chinese/English text detection in images and video frames
    Mao, WG
    Chung, FL
    Lam, KKM
    Siu, WC
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 1015 - 1018
  • [33] Moving Text Line Detection and Extraction in TV Video Frames
    Kumar, Punith
    Puttaswamy, P. S.
    [J]. 2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 6 - 10
  • [34] Detecting text in video frames
    Anthimopoulos, M.
    Gatos, B.
    Pratikakis, I.
    Perantonis, S. J.
    [J]. PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PATTERN RECOGNITION, AND APPLICATIONS, 2007, : 39 - +
  • [35] Ote-Ocr Based Text Recognition and Extraction from Video Frames
    Shetty, Shashank
    Devadiga, Arun S.
    Chakkaravarthy, S. Sibi
    Kumar, K. A. Varun
    [J]. 2014 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2014, : 229 - 232
  • [36] Hybrid approach for Farsi/Arabic text detection and localisation in video frames
    Moradi, Mohieddin
    Mozaffari, Saeed
    [J]. IET IMAGE PROCESSING, 2013, 7 (02) : 154 - 164
  • [37] Artificial Urdu Text Detection and Localization from Individual Video Frames
    Unar, Salahuddin
    Jalbani, Akhtar Hussain
    Jawaid, Muhammad Moazzam
    Shaikh, Mohsin
    Chandio, Asghar Ali
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2018, 37 (02) : 429 - 438
  • [38] Video text detection and segmentation for optical character recognition
    Ngo, CW
    Chan, CK
    [J]. MULTIMEDIA SYSTEMS, 2005, 10 (03) : 261 - 272
  • [39] Video text detection and segmentation for optical character recognition
    Chong-Wah Ngo
    Chi-Kwong Chan
    [J]. Multimedia Systems, 2005, 10 : 261 - 272
  • [40] Text Detection, Tracking and Recognition in Video: A Comprehensive Survey
    Yin, Xu-Cheng
    Zuo, Ze-Yu
    Tian, Shu
    Liu, Cheng-Lin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2752 - 2773