Methods of Natural Image Preprocessing Supporting the Automatic Text Recognition Using the OCR Algorithms

被引:0
|
作者
Lech, Piotr [1 ,2 ]
Okarma, Krzysztof [1 ,2 ]
机构
[1] West Pomeranian Univ Technol, Szczecin, Poland
[2] Fac Elect Engn, Dept Signal Proc & Multimedia Engn, 26 Kwietnia 10, PL-71126 Szczecin, Poland
关键词
Image binarization; Natural images; OCR; COLOR;
D O I
10.1007/978-3-319-23814-2_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text from natural images is much more difficult than from scanned text documents since the text may appear in all colors, different sizes and types, often with distorted geometry or textures applied. The paper presents the idea of high-speed image preprocessing algorithms utilizing the quasi-local histogram based methods such as binarization, ROI filtering, line and corners detection, etc. which can be helpful for this task. Their low computational cost is provided by a reduction of the amount of processed information carried out by means of a simple random sampling. The approach presented in the paper allows to minimize some problems with the implementation of the OCR algorithms operating on natural images on devices with low computing power (e.g. mobile or embedded). Due to relatively small computational effort it is possible to test multiple hypotheses e.g. related to the possible location of the text in the image. Their verification can be based on the analysis of images in various color spaces. An additional advantage of the discussed algorithms is their construction allowing an efficient parallel implementation further reducing the computation time.
引用
收藏
页码:143 / 150
页数:8
相关论文
共 50 条
  • [41] Automatic text recognition in natural scene and its translation into user defined language
    Bijalwan, Deepak Chandra
    Aggarwal, Alok
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 324 - 329
  • [42] Using linguistic cues for the automatic recognition of personality in conversation and text
    Mairesse, François
    Walker, Marilyn A.
    Mehl, Matthias R.
    Moore, Roger K.
    Journal of Artificial Intelligence Research, 1600, 30 : 457 - 500
  • [43] Using linguistic cues for the automatic recognition of personality in conversation and text
    Mairesse, Francois
    Walker, Marilyn A.
    Mehl, Matthias R.
    Moore, Roger K.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2007, 30 : 457 - 500
  • [44] Testing of energy minimizing methods in image preprocessing using the PICASSO system
    Gribkov, IV
    Koltsov, PP
    Kotovich, NV
    Kravchenko, AA
    Koutsaev, AS
    Nikolaev, VK
    Zakharov, AV
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 233 - 238
  • [45] Methods for automatic term recognition in domain-specific text collections: A survey
    Astrakhantsev, N. A.
    Fedorenko, D. G.
    Turdakov, D. Yu.
    PROGRAMMING AND COMPUTER SOFTWARE, 2015, 41 (06) : 336 - 349
  • [46] Methods for automatic term recognition in domain-specific text collections: A survey
    N. A. Astrakhantsev
    D. G. Fedorenko
    D. Yu. Turdakov
    Programming and Computer Software, 2015, 41 : 336 - 349
  • [47] Towards Computer Vision Based Ancient Coin Recognition in the Wild - Automatic Reliable Image Preprocessing and Normalization
    Conn, Brandon
    Arandjelovic, Ognjen
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1457 - 1464
  • [48] Table of Contents Recognition in OCR Documents using Image-based Machine Learning
    Kosaraju, Sai
    Tsaku, Nelson Zange
    Patel, Pritesh
    Bayramoglu, Tanju
    Modgil, Girish
    Kang, Mingon
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 186 - 189
  • [49] Improving Automatic Image Captioning Using Text Summarization Techniques
    Plaza, Laura
    Lloret, Elena
    Aker, Ahmet
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 165 - +
  • [50] Supporting efficient and reliable content analysis using automatic text processing technology
    Gweon, G
    Rosé, CP
    Wittwer, J
    Nueckles, M
    HUMAN-COMPUTER INTERACTION - INTERACT 2005, PROCEEDINGS, 2005, 3585 : 1112 - 1115