Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network

被引:0
|
作者
Hassan El Bahi
Abdelkarim Zatni
机构
[1] Ibnou Zohr University,Laboratory of Metrology and Information Processing
[2] Cadi Ayyad University,Laboratory of Applied Mathematics and Computer Science
来源
关键词
Text recognition; Document image; Smartphone; Convolutional neural network; Recurrent neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic text recognition in document images is an important task in many real-world applications. Several systems have been proposed to accomplish this task. However, a little attention has been given to document images obtained by mobile phones. To meet this need, we propose a new system that integrates preprocessing, features extraction and classification in order to recognize text contained in the document images acquired by a smartphone. The preprocessing phase is applied to locate the text region, and then segment that region into text line images. In the second phase, a sliding window divides the text-line image into a sequence of frames; afterwards a deep convolutional neural network (CNN) model is used to extract features from each frame. Finally, an architecture that combines the bidirectional recurrent neural network (RNN), the gated recurrent units (GRU) block and the connectionist temporal classification (CTC) layer is explored to ensure the classification phase. The proposed system has been tested on the ICDAR2015 Smartphone document OCR dataset and the experimental results show that the proposed system is capable to achieve promising recognition rates.
引用
收藏
页码:26453 / 26481
页数:28
相关论文
共 50 条
  • [1] Text recognition in document images obtained by a smartphone based on deep convolutional and recurrent neural network
    El Bahi, Hassan
    Zatni, Abdelkarim
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 26453 - 26481
  • [2] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
    Chandio, Asghar Ali
    Asikuzzaman, MD.
    Pickering, Mark R.
    Leghari, Mehwish
    [J]. IEEE ACCESS, 2022, 10 : 10062 - 10078
  • [3] Deep features based convolutional neural network model for text and non-text region segmentation from document images
    Umer, Saiyed
    Mondal, Ranjan
    Pandey, Hari Mohan
    Rout, Ranjeet Kumar
    [J]. APPLIED SOFT COMPUTING, 2021, 113
  • [4] Text Baseline Recognition Using a Recurrent Convolutional Neural Network
    Woedlinger, Matthias
    Sablatnig, Robert
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4673 - 4679
  • [5] Deep Convolutional Neural Network for Recognizing the Images of Text Documents
    Golovko, Vladimir
    Kroshchanka, Aliaksandr
    Mikhno, Egor
    Komar, Myroslav
    Sachenko, Anatoliy
    Bezobrazov, Sergei
    Shylinska, Inna
    [J]. MOMLET&DS-2019: MODERN MACHINE LEARNING TECHNOLOGIES AND DATA SCIENCE, 2019, 2386 : 297 - 306
  • [6] A Sketch Recognition Method Based on Deep Convolutional-Recurrent Neural Network
    [J]. 2018, Institute of Computing Technology (30):
  • [7] Jaundice detection by deep convolutional neural network using smartphone images
    Su, Tung-Hung
    Li, Jia-Wei
    Chen, Shann-Ching
    Jiang, Pei-Ying
    Kao, Jia-Horng
    Chou, Cheng-Fu
    [J]. JOURNAL OF HEPATOLOGY, 2021, 75 : S629 - S629
  • [8] Very Deep Recurrent Convolutional Neural Network for Object Recognition
    Brahimi, Sourour
    Ben Aoun, Najib
    Ben Amar, Chokri
    [J]. NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [9] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    [J]. MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [10] Scene text recognition using residual convolutional recurrent neural network
    Zhengchao Lei
    Sanyuan Zhao
    Hongmei Song
    Jianbing Shen
    [J]. Machine Vision and Applications, 2018, 29 : 861 - 871