Methods of Natural Image Preprocessing Supporting the Automatic Text Recognition Using the OCR Algorithms

被引:0
|
作者
Lech, Piotr [1 ,2 ]
Okarma, Krzysztof [1 ,2 ]
机构
[1] West Pomeranian Univ Technol, Szczecin, Poland
[2] Fac Elect Engn, Dept Signal Proc & Multimedia Engn, 26 Kwietnia 10, PL-71126 Szczecin, Poland
关键词
Image binarization; Natural images; OCR; COLOR;
D O I
10.1007/978-3-319-23814-2_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text from natural images is much more difficult than from scanned text documents since the text may appear in all colors, different sizes and types, often with distorted geometry or textures applied. The paper presents the idea of high-speed image preprocessing algorithms utilizing the quasi-local histogram based methods such as binarization, ROI filtering, line and corners detection, etc. which can be helpful for this task. Their low computational cost is provided by a reduction of the amount of processed information carried out by means of a simple random sampling. The approach presented in the paper allows to minimize some problems with the implementation of the OCR algorithms operating on natural images on devices with low computing power (e.g. mobile or embedded). Due to relatively small computational effort it is possible to test multiple hypotheses e.g. related to the possible location of the text in the image. Their verification can be based on the analysis of images in various color spaces. An additional advantage of the discussed algorithms is their construction allowing an efficient parallel implementation further reducing the computation time.
引用
收藏
页码:143 / 150
页数:8
相关论文
共 50 条
  • [1] PREPROCESSING AND PRESORTING OF ENVELOPE IMAGES FOR AUTOMATIC SORTING USING OCR
    DOWNTON, AC
    LEEDHAM, CG
    PATTERN RECOGNITION, 1990, 23 (3-4) : 347 - 362
  • [2] Text Recognition for 2D Bridge Plans Using OCR-Algorithms
    Peng, Mengyan
    Kang, Chongjie
    Marx, Steffen
    EUROPEAN ASSOCIATION ON QUALITY CONTROL OF BRIDGES AND STRUCTURES, EUROSTRUCT 2023, VOL 6, ISS 5, 2023, : 661 - 666
  • [3] Searching Documentation using Text, OCR, and Image
    Yeh, Tom
    Katz, Boris
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 776 - 777
  • [4] Multimodal fine-grained grocery product recognition using image and OCR text
    Pettersson, Tobias
    Riveiro, Maria
    Lofstrom, Tuwe
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [5] Methods and Algorithms for Automatic Text Analysis
    Yatsko, V. A.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2011, 45 (05) : 224 - 231
  • [6] On the automatic text detection and recognition algorithms for maritime images
    Nita, Cornelia
    Vandewal, Marijke
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS III, 2021, 11870
  • [7] CONVERSION OF IMAGE TO TEXT TO SPEECH USING OCR AND TTS SYSTHESIS
    Dharshini, V
    Keerthana, P.
    Manju, T.
    Deepa, R.
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 432 - 438
  • [8] Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables
    Oropeza Rodriguez, Jose Luis
    Suarez Guerra, Sergio
    COMPUTACION Y SISTEMAS, 2006, 9 (03): : 270 - 286
  • [9] OCR Based Image Text To Speech Conversion Using MATLAB
    Madre, Sneha. C.
    Gundre, S. B.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 858 - 861
  • [10] Automatic Recognition Methods Supporting Pain Assessment: A Survey
    Werner, Philipp
    Lopez-Martinez, Daniel
    Walter, Steffen
    Al-Hamadi, Ayoub
    Gruss, Sascha
    Picard, Rosalind W.
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (01) : 530 - 552