Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Amar Jindal [1 ]
机构
[1] UPES,School of Computer Science
关键词
Script identification; Deep learning; Bayesian optimization; CNN-LSTM;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard.
引用
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [31] Script and language identification for handwritten document images
    Judith Hochberg
    Kevin Bowers
    Michael Cannon
    Patrick Kelly
    International Journal on Document Analysis and Recognition, 1999, 2 (2-3) : 45 - 52
  • [32] Radon and Wavelet Transforms for Handwritten Script Identification
    Veershetty, C.
    Pardeshi, Rajmohan
    Hangarge, Mallikarjun
    Dhawale, Chitra
    AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 755 - 765
  • [33] Improved word-level handwritten Indic script identification by integrating small convolutional neural networks
    Ukil, Soumya
    Ghosh, Swarnendu
    Obaidullah, Sk Md
    Santosh, K. C.
    Roy, Kaushik
    Das, Nibaran
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2829 - 2844
  • [34] Improved word-level handwritten Indic script identification by integrating small convolutional neural networks
    Soumya Ukil
    Swarnendu Ghosh
    Sk Md Obaidullah
    K. C. Santosh
    Kaushik Roy
    Nibaran Das
    Neural Computing and Applications, 2020, 32 : 2829 - 2844
  • [35] Script identification from Indian documents
    Joshi, GD
    Carg, S
    Sivaswamy, J
    DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 255 - 267
  • [36] Writer Identification in Noisy Handwritten Documents
    Ni, Karl
    Callier, Patrick
    Hatch, Bradley
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1177 - 1186
  • [37] Script-independent text line segmentation in freestyle handwritten documents
    Li, Yi
    Zheng, Yefeng
    Doermann, David
    Jaeger, Stefan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) : 1313 - 1329
  • [38] Language Identification from Handwritten Documents
    Mioulet, Luc
    Garain, Utpal
    Chatelain, Clement
    Barlas, Philippine
    Paquet, Thierry
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 676 - 680
  • [39] A Comparison of Recognition Strategies for Printed/Handwritten Composite Documents
    Moysset, Bastien
    Messina, Ronaldo
    Kermorvant, Christopher
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 158 - 163
  • [40] Survey of Mathematical Expression Recognition for Printed and Handwritten Documents
    Aggarwal, Ridhi
    Pandey, Shilpa
    Tiwari, Anil Kumar
    Harit, Gaurav
    IETE TECHNICAL REVIEW, 2022, 39 (06) : 1245 - 1253