Optimal Classification Model for Text Detection and Recognition in Video Frames

被引：1

作者：

Eshwarappa, Laxmikant ^{[1
]}

Rajput, G. G. ^{[2
]}

机构：

[1] Sharnbasva Univ, Dept Master Comp Applicat, Sharana Nagara Kalaburagi 585105, Karnataka, India

[2] Akkamahadevi Womens Univ, Dept Comp Sci, Vijayapur, Karnataka, India

来源：

INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS | 2023年

关键词：

Text detection; improved distance map; improved candidate text block; LSTM; SI-BESO; IMAGES; EXTRACTION; ALGORITHM;

D O I：

10.1142/S0219467825500147

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Currently, the identification of text from video frames and normal scene images has got amplified awareness amongst analysts owing to its diverse challenges and complexities. Owing to a lower resolution, composite backdrop, blurring effect, color, diverse fonts, alternate textual placement among panels of photos and videos, etc., text identification is becoming complicated. This paper suggests a novel method for identifying texts from video with five stages. Initially, "video-to-frame conversion", is done during pre-processing. Further, text region verification is performed and keyframes are recognized using CNN. Then, improved candidate text block extraction is carried out using MSER. Subsequently, "DCT features, improved distance map features, and constant gradient-based features" are extracted. These characteristics subsequently provided "Long Short-Term Memory (LSTM)" for detection. Finally, OCR is done to recognize the texts in the image. Particularly, the Self-Improved Bald Eagle Search (SI-BESO) algorithm is used to adjust the LSTM weights. Finally, the superiority of the SI-BESO-based technique over many other techniques is demonstrated.

引用

页数：24

共 50 条

[1] Text detection and recognition in images and video frames
Chen, DT
Odobez, JM
Bourlard, H
[J]. PATTERN RECOGNITION, 2004, 37 (03) : 595 - 608
[2] Detection and Recognition of Arabic Text in Video Frames
Ohyama, Wataru
Iwata, Seiya
Wakabayashi, Tetsushi
Kimura, Fumitaka
[J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 20 - 24
[3] Fractional poisson enhancement model for text detection and recognition in video frames
Roy, Sangheeta
Shivakumara, Palaiahnakote
Jalab, Hamid A.
Ibrahim, Rabha W.
Pal, Umapada
Lu, Tong
[J]. PATTERN RECOGNITION, 2016, 52 : 433 - 447
[4] Video Scene Text Frames Categorization for Text Detection and Recognition
Qin, Longfei
Shivakumara, Palaiahnakote
Lu, Tong
Pal, Umapada
Tan, Chew Lim
[J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
[5] Detection and recognition of cursive text from video frames
Mirza, Ali
Zeshan, Ossama
Atif, Muhammad
Siddiqi, Imran
[J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
[6] Detection and recognition of cursive text from video frames
Ali Mirza
Ossama Zeshan
Muhammad Atif
Imran Siddiqi
[J]. EURASIP Journal on Image and Video Processing, 2020
[7] Multiresolution text detection in video frames
Anthimopoulos, Marios
Gatos, Basilis
Pratikakis, Ioannis
[J]. VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 161 - +
[8] Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
Zayene, Oussama
Touj, Sameh Masmoudi
Hennebert, Jean
Ingold, Rolf
Ben Amara, Najoua Essoukri
[J]. JOURNAL OF IMAGING, 2018, 4 (02):
[9] VIDEO FRAMES TEXT DETECTION THROUGH BAYESIAN CLASSIFICATION AND BOUNDARY GROWING METHOD
Nancy, A.
Jayapriya, D.
[J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
[10] Caption text recognition in video frames by MAP matching
Nakamura, A
Yamamoto, K
[J]. SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 650 - 654

← 1 2 3 4 5 →