RECOGNITION OF PRINTED TEXT UNDER REALISTIC CONDITIONS

被引:17
|
作者
PAVLIDIS, T
机构
[1] Department of Computer Science, SUNY, Stony Brook
关键词
CHARACTER RECOGNITION;
D O I
10.1016/0167-8655(93)90097-W
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Past research in OCR has focused on the shape analysis of binarized images, quite often assuming good quality document and isolated characters. Such assumptions are challenged by the conditions met in practice: binarization is difficult for low contrast documents, characters often touch each other, not only on the sides but also between lines, etc. After a brief review of past work we will describe current efforts to deal with OCR as a signal processing problem where the causes of noise and distortions as well the idealized images (definitions of typefaces) are modeled and subjected to a quantitative analysis. The key idea of the analysis is that while printed text images may be binary in an ideal state. the images seen by the sensors are gray scale because of convolution distortion and other causes. Therefore binarization should be carried out at the same time as feature extraction.
引用
收藏
页码:317 / 326
页数:10
相关论文
共 50 条
  • [31] Printed Text Recognition using BLSTM and MDLSTM for Indian languages
    Chavan, Vishal
    Malage, Abhijit
    Mehrotra, Kapil
    Gupta, Manish Kumar
    2017 FOURTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2017, : 345 - 350
  • [32] Modelling heterogeneous electrocatalysis under realistic conditions
    Steinmann, Stephan
    Michel, Carine
    Sautet, Philippe
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [33] Recognition of Hand written and Printed Text of Cursive Writing Utilizing Optical Character Recognition
    Duth, Sudharshan P.
    Amulya, B.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 576 - 581
  • [34] Baseline Isolated Printed Text Image Database for Pashto Script Recognition
    Siddiqu, Arfa
    Basit, Abdul
    Noor, Waheed
    Khan, Muhammad Asfandyar
    Kakar, M. Saeed H.
    Khan, Azam
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01): : 875 - 885
  • [35] Segmentation-free optical character recognition for printed Urdu text
    Israr Ud Din
    Imran Siddiqi
    Shehzad Khalid
    Tahir Azam
    EURASIP Journal on Image and Video Processing, 2017
  • [36] Radial basis function and subspace approach for printed Kannada text recognition
    Vijaykumar, B
    Ramakrishnan, AG
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 321 - 324
  • [37] Recognition and translation of the Myanmar Printed text based on Hopfield Neural Network
    Swe, Thynzar
    Tin, Pike
    APSITT 2005: 6th Asia-Pacific Symposium on Information and Telecommunication Technologies, Proceedings, 2005, : 99 - 104
  • [38] RECOGNITION OF HANDWRITTEN AND MACHINE-PRINTED TEXT FOR POSTAL ADDRESS INTERPRETATION
    SRIHARI, SN
    PATTERN RECOGNITION LETTERS, 1993, 14 (04) : 291 - 302
  • [39] Printed Ottoman text recognition using synthetic data and data augmentation
    Esma F. Bilgin Tasdemir
    International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 273 - 287
  • [40] A novel approach for improving recognition accuracies in OCR of printed Telugu text
    Lakshmi, CV
    Patvardhan, C
    Prasad, M
    2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 255 - 259