An Algorithmic Approach for Text Recognition from Printed/Typed Text Images

被引:0
|
作者
Agrawal, Neha [1 ]
Kaur, Arashdeep [1 ]
机构
[1] Amity Univ Uttar Pradesh, Amity Sch Engn & Technol, Dept Comp Sci & Engn, Noida, India
关键词
OCR; Otsu's algorithm; Hough transform; English alphabets; skew detection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of texts from scanned copies of documents and text images is an important task in the recent scenario. Optical Character Recognition (OCR) is used to analyze text in images. The proposed algorithm deals with taking scanned copy of a document as an input and extract texts from the image into a text format using Otsu's algorithm for segmentation and Hough transform method for skew detection. The system was confined to recognize English alphabets (A-Z, a-z) and numerals (0-9). OCR technique has been implemented to recognize characters. Validation tests were done on screenshots of typed texts and images of scanned document from Internet sources. Experimental results indicate that the proposed algorithm is able to recognize alphabets written in Verdana font style with size 14 and also showed good results with rotated images. The average accuracy to determine rotation angle correctly was calculated to be 90% and overall system accuracy was calculated to be 93%.
引用
收藏
页码:876 / 879
页数:4
相关论文
共 50 条
  • [21] Optical character recognition of arabic printed text
    Electrical and Electronics Engineering Department, University of Khartoum, Sudan
    SCOReD - IEEE Stud. Conf. Res. Dev., (235-240):
  • [22] COMPUTER RECOGNITION OF HAND-PRINTED TEXT
    MUNSON, JH
    JOURNAL OF TYPOGRAPHIC RESEARCH, 1969, 3 (01): : 31 - +
  • [23] MACHINE RECOGNITION AND CORRECTION OF PRINTED ARABIC TEXT
    AMIN, A
    MARI, JF
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05): : 1300 - 1306
  • [24] Database for Arabic Printed Text Recognition Research
    Jaiem, Faten Kallel
    Kanoun, Slim
    Khemakhem, Maher
    El Abed, Haikal
    Kardoun, Jihain
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT 1, 2013, 8156 : 251 - 259
  • [25] On the problem of individual character recognition in a printed text
    Ardalionov, L.V.
    Simaranov, S.Yu.
    Izvestiya Akademii Nauk. Teoriya i Sistemy Upravleniya, 1993, (06): : 61 - 75
  • [26] Handwritten and Machine Printed Text Separation from Kannada Document Images
    Pardeshi, Rajmohan
    Hangarge, Mallikarjun
    Doddamani, Srikanth
    Santosh, K. C.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [27] Text string extraction from images of colour-printed documents
    Suen, HM
    Wang, JF
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1996, 143 (04): : 210 - 216
  • [28] Key Information Extraction and Recognition from Rich Text Images
    Do, Tien
    Doan, Thuyen Tran
    Le, Khiem
    Nguyen, Thua
    Le, Duy-Dinh
    Ngo, Thanh Duc
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024, 11 (04) : 569 - 594
  • [29] Recognition based Text Localization from Natural Scene Images
    Ray, Anupama
    Shah, Archit
    Chaudhury, Santanu
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1177 - 1182
  • [30] Recognition of Apparent Personality Traits from Text and Handwritten Images
    Perez Costa, Ernesto
    Villasenor-Pienda, Luis
    Morales, Eduardo
    Jair Escalante, Hugo
    PATTERN RECOGNITION AND INFORMATION FORENSICS, 2019, 11188 : 146 - 152