Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Amar Jindal [1 ]
机构
[1] UPES,School of Computer Science
关键词
Script identification; Deep learning; Bayesian optimization; CNN-LSTM;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard.
引用
下载
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [21] Identification of handwritten Gujarati alphanumeric script by integrating transfer learning and convolutional neural networks
    Krishn Limbachiya
    Ankit Sharma
    Priyank Thakkar
    Dipak Adhyaru
    Sādhanā, 2022, 47
  • [22] Identification of handwritten Gujarati alphanumeric script by integrating transfer learning and convolutional neural networks
    Limbachiya, Krishn
    Sharma, Ankit
    Thakkar, Priyank
    Adhyaru, Dipak
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2022, 47 (02):
  • [23] Script identification in a handwritten document image using texture features
    Hiremath, P. S.
    Shivashankar, S.
    Pujari, Jagdeesh D.
    Mouneswara, V.
    2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 110 - +
  • [24] Writer identification in handwritten documents
    Siddiqi, Imran Ahmed
    Vincent, Nicole
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 108 - 112
  • [25] Dewarping Machine Printed Documents of Gurmukhi Script
    Sharma, Dharam Veer
    Wadhwa, Shilpi
    INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 117 - 123
  • [26] Page Segmentation for Historical Handwritten Documents Using Fully Convolutional Networks
    Xu, Yue
    He, Wenhao
    Yin, Fei
    Liu, Cheng-Lin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 541 - 546
  • [27] Statistical script independent word spotting in offline handwritten documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    PATTERN RECOGNITION, 2014, 47 (03) : 1039 - 1050
  • [28] Script Identification of Multi-Script Documents: A Survey
    Ubul, Kurban
    Tursun, Gulzira
    Aysa, Alimjan
    Impedovo, Donato
    Pirlo, Giuseppe
    Yibulayin, Tuergen
    IEEE ACCESS, 2017, 5 : 6546 - 6559
  • [29] Script Identification from Offline Handwritten Characters Using Combination of Features
    Bhardwaj, Akshi
    Jindal, Simpel Rani
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2016, VOL 2, 2017, 547 : 170 - 177
  • [30] Writer Identification from Handwritten Devanagari Script
    Halder, Chayan
    Thakur, Kishore
    Phadikar, Santanu
    Roy, Kaushik
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, 2015, 340 : 497 - 505