Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

被引:0
|
作者
Soni, Vishnu Kant [1 ]
Shukla, Vivek [1 ]
Tandan, S. R. [2 ]
Pimpalkar, Amit [3 ]
Nema, Neetesh Kumar [1 ]
Naik, Muskan [4 ]
机构
[1] Dr CV Raman Univ, Dept Comp Sci & Engn, Bilaspur, CG, India
[2] Govt RVRS Kanya Mahavidyalaya, Dept Comp Sci, Kawardha, CG, India
[3] Ramdeobaba Univ, Engn AIML Shri Ramdeobaba Coll Engn & Management, Dept Comp Sci, Nagpur, India
[4] Lakhmi Chand Inst Technol, Dept Comp Sci & Engn, Bilaspur, CG, India
关键词
Scene text recognition; optical character recognition; deep learning; feature extraction; scene text detection; ATTENTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
texts refer to arbitrary text found in images captured by cameras in real-world settings. The tasks of text detection and recognition are critical components of computer vision, with applications spanning scene understanding, information retrieval, robotics, and autonomous driving. Despite significant advancements in deep learning methods, achieving accurate text detection and recognition in complex images remains a formidable challenge for robust real-world applications. Several factors contribute to these challenges. First, the diversity of text shapes, fonts, colors, and styles complicates detection efforts. Second, the myriad combinations of characters, often with unstable attributes, make complete detection difficult, especially when background interruptions obscure character strokes and shapes. Finally, effective coordination of multiple sub-tasks in end-to-end learning is essential for success. This research aimed to tackle these challenges by enhancing text discriminative representation. This study focused on two interconnected problems: Scene Text Recognition (STR), which involves recognizing text from scene images, and Scene Text Detection (STD), which entails simultaneously detecting and recognizing multiple texts within those images. This research focuses on implementing and evaluating the Efficient and Accurate Scene Text Detector (EAST) algorithm for text detection and recognition in natural scene images. The study aims to compare the performance of three prominent Optical Character Recognition (OCR) techniques-TesseractOCR, PaddleOCR, and EasyOCR. The EAST model was applied to a series of sample test images, and the results were visually represented with bounding boxes highlighting the detected text regions. The inference times for each image were recorded, highlighting the algorithm's efficiency, with average times of 0.446, 0.439, and 0.440 seconds for the respective test images. These results indicate that the EAST algorithm is accurate and operates in real-time, making it suitable for applications requiring immediate text recognition.
引用
收藏
页码:445 / 453
页数:9
相关论文
共 50 条
  • [31] How Far Deep Learning Systems for Text Detection and Recognition in Natural Scenes are Affected by Occlusion?
    Soares, Aline Geovanna
    Dantas Bezerra, Byron Leite
    Lima, Estanislau Baptista
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 198 - 212
  • [32] An Algorithm for Natural Images Text Recognition Using Four Direction Features
    Zhang, Min
    Yan, Yujin
    Wang, Hai
    Zhao, Wei
    ELECTRONICS, 2019, 8 (09)
  • [33] End-to-End Analysis for Text Detection and Recognition in Natural Scene Images
    Alnefaie, Ahlam
    Gupta, Deepak
    Bhuyan, Monowar H.
    Razzak, Imran
    Gupta, Prashant
    Prasad, Mukesh
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [34] Text Detection on Natural Images Using Mnemonic Cellular Automata
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    JOURNAL OF CELLULAR AUTOMATA, 2014, 9 (2-3) : 183 - 194
  • [35] An Efficient Text Dependent Speaker Recognition using Fusion of MFCC and SBC
    Kishore, K. V. Krishna
    Sharrefaunnisa, Syed.
    Venkatramaphanikumar, S.
    2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 18 - 22
  • [36] Scene Based Text Recognition From Natural Images and Classification Based on Hybrid CNN Models with Performance Evaluation
    Dasari, Sunil Kumar
    Mehta, Shilpa
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (03) : 293 - 300
  • [37] Using spin images for efficient object recognition in cluttered 3D scenes
    Johnson, AE
    Hebert, M
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (05) : 433 - 449
  • [38] Convolutional Feature Fusion for Multi-Language Text Detection in Natural Scene Images
    Chandio, Asghar Ali
    Pickering, Mark
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING, MATHEMATICS AND ENGINEERING TECHNOLOGIES (ICOMET), 2019,
  • [39] Text Detection and Recognition from Scene Images using MSER and CNN
    Choudhary, Savita
    Singh, Nikhil Kumar
    Chichadwani, Sanjay
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [40] Text Detection in Natural Scenes Using Gradient Vector Flow-Guided Symmetry
    Trung Quy Phan
    Shivakumara, Palaiahnakote
    Chew Lim Tan
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3296 - 3299