Optimal Classification Model for Text Detection and Recognition in Video Frames

被引:1
|
作者
Eshwarappa, Laxmikant [1 ]
Rajput, G. G. [2 ]
机构
[1] Sharnbasva Univ, Dept Master Comp Applicat, Sharana Nagara Kalaburagi 585105, Karnataka, India
[2] Akkamahadevi Womens Univ, Dept Comp Sci, Vijayapur, Karnataka, India
关键词
Text detection; improved distance map; improved candidate text block; LSTM; SI-BESO; IMAGES; EXTRACTION; ALGORITHM;
D O I
10.1142/S0219467825500147
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Currently, the identification of text from video frames and normal scene images has got amplified awareness amongst analysts owing to its diverse challenges and complexities. Owing to a lower resolution, composite backdrop, blurring effect, color, diverse fonts, alternate textual placement among panels of photos and videos, etc., text identification is becoming complicated. This paper suggests a novel method for identifying texts from video with five stages. Initially, "video-to-frame conversion", is done during pre-processing. Further, text region verification is performed and keyframes are recognized using CNN. Then, improved candidate text block extraction is carried out using MSER. Subsequently, "DCT features, improved distance map features, and constant gradient-based features" are extracted. These characteristics subsequently provided "Long Short-Term Memory (LSTM)" for detection. Finally, OCR is done to recognize the texts in the image. Particularly, the Self-Improved Bald Eagle Search (SI-BESO) algorithm is used to adjust the LSTM weights. Finally, the superiority of the SI-BESO-based technique over many other techniques is demonstrated.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Text detection and recognition in images and video frames
    Chen, DT
    Odobez, JM
    Bourlard, H
    [J]. PATTERN RECOGNITION, 2004, 37 (03) : 595 - 608
  • [2] Detection and Recognition of Arabic Text in Video Frames
    Ohyama, Wataru
    Iwata, Seiya
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 20 - 24
  • [3] Fractional poisson enhancement model for text detection and recognition in video frames
    Roy, Sangheeta
    Shivakumara, Palaiahnakote
    Jalab, Hamid A.
    Ibrahim, Rabha W.
    Pal, Umapada
    Lu, Tong
    [J]. PATTERN RECOGNITION, 2016, 52 : 433 - 447
  • [4] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
  • [5] Detection and recognition of cursive text from video frames
    Mirza, Ali
    Zeshan, Ossama
    Atif, Muhammad
    Siddiqi, Imran
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [6] Detection and recognition of cursive text from video frames
    Ali Mirza
    Ossama Zeshan
    Muhammad Atif
    Imran Siddiqi
    [J]. EURASIP Journal on Image and Video Processing, 2020
  • [7] Multiresolution text detection in video frames
    Anthimopoulos, Marios
    Gatos, Basilis
    Pratikakis, Ioannis
    [J]. VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 161 - +
  • [8] Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
    Zayene, Oussama
    Touj, Sameh Masmoudi
    Hennebert, Jean
    Ingold, Rolf
    Ben Amara, Najoua Essoukri
    [J]. JOURNAL OF IMAGING, 2018, 4 (02):
  • [9] VIDEO FRAMES TEXT DETECTION THROUGH BAYESIAN CLASSIFICATION AND BOUNDARY GROWING METHOD
    Nancy, A.
    Jayapriya, D.
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [10] Caption text recognition in video frames by MAP matching
    Nakamura, A
    Yamamoto, K
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 650 - 654