Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion

被引:0
|
作者
Soni, Vishnu Kant [1 ]
Shukla, Vivek [1 ]
Tandan, S. R. [2 ]
Pimpalkar, Amit [3 ]
Nema, Neetesh Kumar [1 ]
Naik, Muskan [4 ]
机构
[1] Dr CV Raman Univ, Dept Comp Sci & Engn, Bilaspur, CG, India
[2] Govt RVRS Kanya Mahavidyalaya, Dept Comp Sci, Kawardha, CG, India
[3] Ramdeobaba Univ, Engn AIML Shri Ramdeobaba Coll Engn & Management, Dept Comp Sci, Nagpur, India
[4] Lakhmi Chand Inst Technol, Dept Comp Sci & Engn, Bilaspur, CG, India
关键词
Scene text recognition; optical character recognition; deep learning; feature extraction; scene text detection; ATTENTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
texts refer to arbitrary text found in images captured by cameras in real-world settings. The tasks of text detection and recognition are critical components of computer vision, with applications spanning scene understanding, information retrieval, robotics, and autonomous driving. Despite significant advancements in deep learning methods, achieving accurate text detection and recognition in complex images remains a formidable challenge for robust real-world applications. Several factors contribute to these challenges. First, the diversity of text shapes, fonts, colors, and styles complicates detection efforts. Second, the myriad combinations of characters, often with unstable attributes, make complete detection difficult, especially when background interruptions obscure character strokes and shapes. Finally, effective coordination of multiple sub-tasks in end-to-end learning is essential for success. This research aimed to tackle these challenges by enhancing text discriminative representation. This study focused on two interconnected problems: Scene Text Recognition (STR), which involves recognizing text from scene images, and Scene Text Detection (STD), which entails simultaneously detecting and recognizing multiple texts within those images. This research focuses on implementing and evaluating the Efficient and Accurate Scene Text Detector (EAST) algorithm for text detection and recognition in natural scene images. The study aims to compare the performance of three prominent Optical Character Recognition (OCR) techniques-TesseractOCR, PaddleOCR, and EasyOCR. The EAST model was applied to a series of sample test images, and the results were visually represented with bounding boxes highlighting the detected text regions. The inference times for each image were recorded, highlighting the algorithm's efficiency, with average times of 0.446, 0.439, and 0.440 seconds for the respective test images. These results indicate that the EAST algorithm is accurate and operates in real-time, making it suitable for applications requiring immediate text recognition.
引用
收藏
页码:445 / 453
页数:9
相关论文
共 50 条
  • [1] TEXT DETECTION AND RECOGNITION IN NATURAL SCENES AND CONSUMER VIDEOS
    Jain, Arpit
    Peng, Xujun
    Zhuang, Xiaodan
    Natarajan, Pradeep
    Cao, Huaigu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Text Detection and Recognition in Natural Scene Images
    Huang, Xiaoming
    Shen, Tao
    Wang, Run
    Gao, Chenqiang
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 44 - 49
  • [3] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [4] Integrated Text Detection and Recognition in Natural Images
    Roubtsova, Nadejda S.
    Wijnhoven, Rob G. J.
    de With, Peter H. N.
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS X AND PARALLEL PROCESSING FOR IMAGING APPLICATIONS II, 2012, 8295
  • [5] Fast and Accurate Text Detection in Natural Scene Images
    Xiao, Chengqiu
    Ji, Lixin
    Gao, Chao
    Li, Shaomei
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 1 - 10
  • [6] Focusing Attention: Towards Accurate Text Recognition in Natural Images
    Cheng, Zhanzhan
    Bai, Fan
    Xu, Yunlu
    Zheng, Gang
    Pu, Shiliang
    Zhou, Shuigeng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5086 - 5094
  • [7] Text Detection and Recognition Using Enhanced MSER Detection and a Novel OCR Technique
    Islam, Md. Rabiul
    Mondal, Chayan
    Azam, Md. Kawsar
    Islam, Abu Syed Md. Jannatul
    2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV), 2016, : 15 - 20
  • [8] Research on the Text Detection and Recognition in Natural Scene Images
    Wei Zi-han
    Du Xiao-ping
    Cao Lei
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [9] Plant Classification in Images of Natural Scenes Using Segmentations Fusion
    Nikbakhsh, N.
    Damavandi, Y. Baleghi
    Agahi, H.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2020, 33 (09): : 1743 - 1750
  • [10] Performance evaluation of OCR on poor resolution text document images using different pre processing steps
    Naganjaneyulu, G. V. S. S. K. R.
    Narasimhadhan, A. V.
    Venkatesh, K.
    TENCON 2014 - 2014 IEEE REGION 10 CONFERENCE, 2014,