Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Amar Jindal [1 ]
机构
[1] UPES,School of Computer Science
关键词
Script identification; Deep learning; Bayesian optimization; CNN-LSTM;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard.
引用
下载
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [11] Word-Level Script Identification from Handwritten Multi-script Documents
    Singh, Pawan Kumar
    Mondal, Arafat
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 551 - 558
  • [12] Page-level Script Identification from Multi-script Handwritten Documents
    Singh, Pawan Kumar
    Dalal, Santu Kumar
    Sarkar, Ram
    Nasipuri, Mita
    2015 THIRD INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT), 2015,
  • [13] Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage
    Cheikhrouhou, Ahmed
    Kessentini, Yousri
    Kanoun, Slim
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 9201 - 9215
  • [14] Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage
    Ahmed Cheikhrouhou
    Yousri Kessentini
    Slim Kanoun
    Neural Computing and Applications, 2020, 32 : 9201 - 9215
  • [15] Extreme learning machine for handwritten Indic script identification in multiscript documents
    Obaidullah, Sk. Md.
    Bose, Amitava
    Mukherjee, Himadri
    Santosh, K. C.
    Das, Nibaran
    Roy, Kaushik
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [16] Separating Indic Scripts with matra for Effective Handwritten Script Identification in Multi-Script Documents
    Obaidullah, Sk Md
    Goswami, Chitrita
    Santosh, K. C.
    Das, Nibaran
    Halder, Chayan
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (05)
  • [17] Fractal-Based System for Arabic/Latin, Printed/Handwritten Script Identification
    Ben Moussa, S.
    Zahour, A.
    Benabdelhafid, A.
    Alimi, A. M.
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2643 - 2646
  • [18] Robust shared feature learning for script and handwritten/machine-printed identification
    Feng, Ziyong
    Yang, Zhaoyang
    Jin, Lianwen
    Huang, Shuangping
    Sun, Jun
    PATTERN RECOGNITION LETTERS, 2017, 100 : 6 - 13
  • [19] A Texture based approach to Word-level Script Identification from Multi-script Handwritten Documents
    Singh, Pawan Kumar
    Khan, Aparajita
    Sarkar, Ram
    Nasipuri, Mita
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 228 - 232
  • [20] Scene Text Script Identification with Convolutional Recurrent Neural Networks
    Mei, Jieru
    Dai, Luo
    Shi, Baoguang
    Bai, Xiang
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4053 - 4058