Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Amar Jindal [1 ]
机构
[1] UPES,School of Computer Science
关键词
Script identification; Deep learning; Bayesian optimization; CNN-LSTM;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard.
引用
下载
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [1] A Review on Methods of Script Identification for Printed and Handwritten Documents
    Gaygole, Aditi
    Rojatkar, Dinesh
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [2] Script Identification from Handwritten Documents using SIFT Method
    Rajput, G. G.
    Ummapure, Suryakant Baburao
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 520 - 526
  • [3] Script identification in printed bilingual documents
    Dhanya, D
    Ramakrishnan, AG
    DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 13 - 24
  • [4] Script identification in printed bilingual documents
    D. Dhanya
    A. G. Ramakrishnan
    Peeta Basa Pati
    Sadhana, 2002, 27 : 73 - 82
  • [5] Script identification in printed bilingual documents
    Dhanya, D
    Ramakrishnan, AG
    Pati, PB
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1): : 73 - 82
  • [6] Script Identification for Printed and Handwritten Indian Documents: An Empirical Study of Different Feature Classifier Combinations
    Rani, Rajneesh
    Dhir, Renu
    Kakkar, Deepti
    Sharma, Nonita
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2022, 22 (03)
  • [7] Offline script recognition from handwritten and printed multilingual documents: a survey
    Deepak Sinwar
    Vijaypal Singh Dhaka
    Nitesh Pradhan
    Saumya Pandey
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 97 - 121
  • [8] Offline script recognition from handwritten and printed multilingual documents: a survey
    Sinwar, Deepak
    Dhaka, Vijaypal Singh
    Pradhan, Nitesh
    Pandey, Saumya
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021, 24 (1-2) : 97 - 121
  • [9] Segmentation of Merged Lines and Script Identification in Handwritten Bilingual Documents
    Zinjore, Ranjana S.
    Ramteke, R. J.
    Pathak, Varsha M.
    PROCEEDINGS OF THE 9TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION (FIRE 2017), 2017, : 29 - 32
  • [10] Statistical comparison of classifiers for script identification from multi-script handwritten documents
    Singh, Pawan Kumar
    Sarkar, Ram
    Das, Nibaran
    Basu, Subhadip
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2014, 1 (02) : 152 - 172