An Algorithmic Approach for Text Recognition from Printed/Typed Text Images

被引:0
|
作者
Agrawal, Neha [1 ]
Kaur, Arashdeep [1 ]
机构
[1] Amity Univ Uttar Pradesh, Amity Sch Engn & Technol, Dept Comp Sci & Engn, Noida, India
关键词
OCR; Otsu's algorithm; Hough transform; English alphabets; skew detection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of texts from scanned copies of documents and text images is an important task in the recent scenario. Optical Character Recognition (OCR) is used to analyze text in images. The proposed algorithm deals with taking scanned copy of a document as an input and extract texts from the image into a text format using Otsu's algorithm for segmentation and Hough transform method for skew detection. The system was confined to recognize English alphabets (A-Z, a-z) and numerals (0-9). OCR technique has been implemented to recognize characters. Validation tests were done on screenshots of typed texts and images of scanned document from Internet sources. Experimental results indicate that the proposed algorithm is able to recognize alphabets written in Verdana font style with size 14 and also showed good results with rotated images. The average accuracy to determine rotation angle correctly was calculated to be 90% and overall system accuracy was calculated to be 93%.
引用
收藏
页码:876 / 879
页数:4
相关论文
共 50 条
  • [31] Arabic Cursive Text Recognition from Natural Scene Images
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yusof, Rubiyah
    APPLIED SCIENCES-BASEL, 2019, 9 (02):
  • [32] A discriminative approach for the retrieval of images from text queries
    Grangier, David
    Monay, Florent
    Bengio, Samy
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 162 - 173
  • [33] RECOGNITION OF TYPED TEXT CHARACTERS USING A 2-D FT FOR A LETTER DRIVEN TEXT READING SYSTEM
    GUMAHAD, AT
    BOURBAKIS, NG
    KOUTSOUPERAS, C
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 1993, 6 (05) : 473 - 478
  • [34] Identification of Handwritten Text in Machine Printed Document Images
    Banerjee, Sandipan
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2, 2013, 177 : 823 - 831
  • [35] Automatic Anonymization of Printed-Text Document Images
    Sanchez, Angel
    Velez, Jose F.
    Sanchez, Javier
    Belen Moreno, A.
    IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 145 - 152
  • [36] A MORPHOLOGICAL APPROACH TO TEXT STRING EXTRACTION FROM REGULAR PERIODIC OVERLAPPING TEXT BACKGROUND IMAGES
    SU, L
    AHMADI, M
    SHRIDHAR, M
    CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1994, 56 (05): : 402 - 413
  • [37] Text Detection and Recognition in Real World Images
    Saabni, Raid
    Zwilling, Moti
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 443 - 448
  • [38] Text Detection and Recognition in Natural Scene Images
    Huang, Xiaoming
    Shen, Tao
    Wang, Run
    Gao, Chenqiang
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 44 - 49
  • [39] Deep Learning Model for Text Recognition in Images
    Shrivastava, Anupriya
    Amudha, J.
    Gupta, Deepa
    Sharma, Kshitij
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [40] Ensemble Attention For Text Recognition In Natural Images
    Gao, Hongchao
    Li, Yujia
    Wang, Xi
    Han, Jizhong
    Li, Ruixuan
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,