Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images

被引:38
|
作者
Raghunandan, K. S. [1 ]
Shivakumara, Palaiahnakote [2 ]
Roy, Sangheeta [2 ]
Kumar, G. Hemantha [1 ]
Pal, Umapada [3 ]
Lu, Tong [4 ]
机构
[1] Univ Mysore, Dept Studies Comp Sci, Mysore 57005, Karnataka, India
[2] Univ Malaya, Fac Comp Syst & Informat Technol, Kuala Lumpur 50603, Malaysia
[3] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, India
[4] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
关键词
Bit plane slicing; convex and concave deficiencies; wavelet sub-bands; arbitrarily-oriented text detection and recognition; hidden Markov model; multi-lingual text detection and recognition; VIDEO; SEGMENTATION; TRACKING;
D O I
10.1109/TCSVT.2018.2817642
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Achieving good text detection and recognition results for multi-script-oriented images is a challenging task. First, we explore bit plane slicing in order to utilize the advantage of the most significant bit information to identify text components. A new iterative nearest neighbor symmetry is then proposed based on shapes of convex and concave deficiencies of text components in bit planes to identify candidate planes. Further, we introduce a new concept called mutual nearest neighbor pair components based on gradient direction to identify representative pairs of texts in each candidate bit plane. The representative pairs are used to restore words with the help of edge image of the input one, which results in text detection results (words). Second, we propose a new idea by fixing window for character components of arbitrary oriented words based on angular relationship between sub-bands and a fused band. For each window, we extract features in contourlet wavelet domain to detect characters with the help of an SVM classifier. Further, we propose to explore HMM for recognizing characters and words of any orientation using the same feature vector. The proposed method is evaluated on standard databases such as ICDAR, YVT video, ICDAR, SVT, MSRA scene data, ICDAR born digital data, and multi-lingual data to show its superiority to the state of the art methods.
引用
收藏
页码:1145 / 1162
页数:18
相关论文
共 50 条
  • [1] Multi-oriented text detection and verification in video frames and scene images
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Roy, Partha Pratim
    Pal, Umapada
    [J]. NEUROCOMPUTING, 2018, 275 : 1531 - 1549
  • [2] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [3] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
  • [4] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
    Basavanna, M.
    Shivakumara, P.
    Srivatsa, S. K.
    Kumar, G. Hemantha
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [5] Unconstrained Scene Text and Video Text Recognition for Arabic Script
    Jain, Mohit
    Mathew, Minesh
    Jawahar, C. V.
    [J]. 2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 26 - 30
  • [6] Script independent approach for multi-oriented text detection in scene image
    Dey, Sounak
    Shivakumara, Palaiahnakote
    Raghunandan, K. S.
    Pal, Umapada
    Lu, Tong
    Kumar, G. Hemantha
    Chan, Chee Seng
    [J]. NEUROCOMPUTING, 2017, 242 : 96 - 112
  • [7] Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images
    Liang, Guozhu
    Shivakumara, Palaiahnakote
    Lu, Tong
    Tan, Chew Lim
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4488 - 4501
  • [8] Fractals based multi-oriented text detection system for recognition in mobile video images
    Shivakumara, Palaiahnakote
    Wu, Liang
    Lu, Tong
    Tan, Chew Lim
    Blumenstein, Michael
    Anami, Basavaraj S.
    [J]. PATTERN RECOGNITION, 2017, 68 : 158 - 174
  • [9] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [10] Text Detection and Recognition in Natural Scene Images
    Huang, Xiaoming
    Shen, Tao
    Wang, Run
    Gao, Chenqiang
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 44 - 49