Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

被引:0
|
作者
Soni, Vishnu Kant [1 ]
Shukla, Vivek [1 ]
Tandan, S. R. [2 ]
Pimpalkar, Amit [3 ]
Nema, Neetesh Kumar [1 ]
Naik, Muskan [4 ]
机构
[1] Dr CV Raman Univ, Dept Comp Sci & Engn, Bilaspur, CG, India
[2] Govt RVRS Kanya Mahavidyalaya, Dept Comp Sci, Kawardha, CG, India
[3] Ramdeobaba Univ, Engn AIML Shri Ramdeobaba Coll Engn & Management, Dept Comp Sci, Nagpur, India
[4] Lakhmi Chand Inst Technol, Dept Comp Sci & Engn, Bilaspur, CG, India
关键词
Scene text recognition; optical character recognition; deep learning; feature extraction; scene text detection; ATTENTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
texts refer to arbitrary text found in images captured by cameras in real-world settings. The tasks of text detection and recognition are critical components of computer vision, with applications spanning scene understanding, information retrieval, robotics, and autonomous driving. Despite significant advancements in deep learning methods, achieving accurate text detection and recognition in complex images remains a formidable challenge for robust real-world applications. Several factors contribute to these challenges. First, the diversity of text shapes, fonts, colors, and styles complicates detection efforts. Second, the myriad combinations of characters, often with unstable attributes, make complete detection difficult, especially when background interruptions obscure character strokes and shapes. Finally, effective coordination of multiple sub-tasks in end-to-end learning is essential for success. This research aimed to tackle these challenges by enhancing text discriminative representation. This study focused on two interconnected problems: Scene Text Recognition (STR), which involves recognizing text from scene images, and Scene Text Detection (STD), which entails simultaneously detecting and recognizing multiple texts within those images. This research focuses on implementing and evaluating the Efficient and Accurate Scene Text Detector (EAST) algorithm for text detection and recognition in natural scene images. The study aims to compare the performance of three prominent Optical Character Recognition (OCR) techniques-TesseractOCR, PaddleOCR, and EasyOCR. The EAST model was applied to a series of sample test images, and the results were visually represented with bounding boxes highlighting the detected text regions. The inference times for each image were recorded, highlighting the algorithm's efficiency, with average times of 0.446, 0.439, and 0.440 seconds for the respective test images. These results indicate that the EAST algorithm is accurate and operates in real-time, making it suitable for applications requiring immediate text recognition.
引用
收藏
页码:445 / 453
页数:9
相关论文
共 50 条
  • [41] Design of Integrated Latext: Halal Detection Text using OCR (Optical Character Recognition) and Web Service
    Yuniarti, Anny
    Kuswardayan, Imam
    Hariadi, Ridho Rahman
    Arifiani, Siska
    Mursidah, Eva
    2017 INTERNATIONAL SEMINAR ON APPLICATION FOR TECHNOLOGY OF INFORMATION AND COMMUNICATION (ISEMANTIC), 2017, : 137 - 141
  • [42] Bilingual text detection in natural scene images using invariant moments
    Maheshwari, Karan
    Raj, Alex Noel Joseph
    Mahesh, Vijayalakshmi G. V.
    Zhuang, Zhemin
    Rufus, Elizabeth
    Shivakumara, Palaiahnakote
    Naik, Ganesh R.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (05) : 6773 - 6784
  • [43] Text Detection in Natural Images using Bio-Inspired Models
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1370 - 1374
  • [44] Text Detection in Natural Scene Images Using Two Masks Filtering
    Turki, Houssem
    Ben Halima, Mohamed
    Alimi, Adel M.
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [45] Chinese Text Detection and Recognition in Natural Scene Using HOG and SVM
    Yu, Boran
    Wan, Hongjie
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS (ITMS 2016), 2016, : 148 - 152
  • [46] Automated Text Detection and Character Recognition in Natural Scenes Based on Local Image Features and Contour Processing Techniques
    Baran, Remigiusz
    Partila, Pavol
    Wilk, Rafal
    INTELLIGENT HUMAN SYSTEMS INTEGRATION, IHSI 2018, 2018, 722 : 42 - 48
  • [47] A PERFORMANCE EVALUATION OF FUSION TECHNIQUES FOR SPATIO-TEMPORAL SALIENCY DETECTION IN DYNAMIC SCENES
    Muddamsetty, Satya M.
    Sidibe, Desire
    Tremeau, Alain
    Meriaudeau, Fabrice
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3924 - 3928
  • [48] Fusion of thermal and visual images for efficient face recognition using Gabor filter
    Ahmad, Jahanzed
    Ali, Usman
    Qureshi, Rashid Jalal
    2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 135 - +
  • [49] Evaluation of panchromatic and multispectral image fusion methods using natural images
    Mejia, Heber, I
    Sanchez, Samuel
    Monja, Fernando J.
    Cabrejos, Luz A.
    Tuesta, Victor A.
    Forero, Manuel G.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [50] Text Detection from Natural Scene Images Using Scale Space Model
    Sun, Qiaoyu
    Lu, Yue
    ADVANCES ON DIGITAL TELEVISION AND WIRELESS MULTIMEDIA COMMUNICATIONS, 2012, 331 : 156 - 161