Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

被引:0
|
作者
Soni, Vishnu Kant [1 ]
Shukla, Vivek [1 ]
Tandan, S. R. [2 ]
Pimpalkar, Amit [3 ]
Nema, Neetesh Kumar [1 ]
Naik, Muskan [4 ]
机构
[1] Dr CV Raman Univ, Dept Comp Sci & Engn, Bilaspur, CG, India
[2] Govt RVRS Kanya Mahavidyalaya, Dept Comp Sci, Kawardha, CG, India
[3] Ramdeobaba Univ, Engn AIML Shri Ramdeobaba Coll Engn & Management, Dept Comp Sci, Nagpur, India
[4] Lakhmi Chand Inst Technol, Dept Comp Sci & Engn, Bilaspur, CG, India
关键词
Scene text recognition; optical character recognition; deep learning; feature extraction; scene text detection; ATTENTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
texts refer to arbitrary text found in images captured by cameras in real-world settings. The tasks of text detection and recognition are critical components of computer vision, with applications spanning scene understanding, information retrieval, robotics, and autonomous driving. Despite significant advancements in deep learning methods, achieving accurate text detection and recognition in complex images remains a formidable challenge for robust real-world applications. Several factors contribute to these challenges. First, the diversity of text shapes, fonts, colors, and styles complicates detection efforts. Second, the myriad combinations of characters, often with unstable attributes, make complete detection difficult, especially when background interruptions obscure character strokes and shapes. Finally, effective coordination of multiple sub-tasks in end-to-end learning is essential for success. This research aimed to tackle these challenges by enhancing text discriminative representation. This study focused on two interconnected problems: Scene Text Recognition (STR), which involves recognizing text from scene images, and Scene Text Detection (STD), which entails simultaneously detecting and recognizing multiple texts within those images. This research focuses on implementing and evaluating the Efficient and Accurate Scene Text Detector (EAST) algorithm for text detection and recognition in natural scene images. The study aims to compare the performance of three prominent Optical Character Recognition (OCR) techniques-TesseractOCR, PaddleOCR, and EasyOCR. The EAST model was applied to a series of sample test images, and the results were visually represented with bounding boxes highlighting the detected text regions. The inference times for each image were recorded, highlighting the algorithm's efficiency, with average times of 0.446, 0.439, and 0.440 seconds for the respective test images. These results indicate that the EAST algorithm is accurate and operates in real-time, making it suitable for applications requiring immediate text recognition.
引用
收藏
页码:445 / 453
页数:9
相关论文
共 50 条
  • [21] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300
  • [22] Methods of Natural Image Preprocessing Supporting the Automatic Text Recognition Using the OCR Algorithms
    Lech, Piotr
    Okarma, Krzysztof
    IMAGE PROCESSING AND COMMUNICATIONS CHALLENGES 7, 2016, 389 : 143 - 150
  • [23] An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images
    Dong, Lanfang
    Chao, Zhongdi
    Wang, Jianfu
    WEARABLE SENSORS AND ROBOTS, 2017, 399 : 447 - 460
  • [24] Text Detection and Recognition Using Camera Based Images
    Darshan, H. Y.
    Gopalkrishna, M. T.
    Hanumantharaju, M. C.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 2, 2015, 328 : 573 - 579
  • [25] Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering
    Maryam, Hiba
    Fu, Ling
    Song, Jiajun
    Shafayet, Tajrian A. B. M.
    Luo, Qidi
    Bai, Xiang
    Liu, Yuliang
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 279 - 292
  • [26] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
  • [27] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [28] A Framework of Text Detection and Recognition from Natural Images for Mobile Device
    Selmi, Zied
    Ben Halima, Mohamed
    Wali, Ali
    Alimi, Adel M.
    NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [29] Fast and Accurate Text Detection in Natural Scene Images with User-intention
    Wang, Liuan
    Fan, Wei
    He, Yuan
    Sun, Jun
    Katsuyama, Yutaka
    Hotta, Yoshinobu
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2920 - 2925
  • [30] A multiscale feature fusion method for cursive text detection in natural scene images
    Chandio, Asghar Ali
    Leghari, Mehwish
    Soomro, Muhammad Ali
    Nizamani, Shah Zaman
    Memon, Saifullah
    IMAGING SCIENCE JOURNAL, 2021, 69 (5-8): : 302 - 318