Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Amar Jindal [1 ]
机构
[1] UPES,School of Computer Science
关键词
Script identification; Deep learning; Bayesian optimization; CNN-LSTM;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard.
引用
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [41] A Novel Technique for Line Segmentation in Offline Handwritten Gurmukhi Script Documents
    Kumar, Munish
    Jindal, M. K.
    Sharma, R. K.
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2017, 40 (04): : 273 - 277
  • [42] A Novel Technique for Line Segmentation in Offline Handwritten Gurmukhi Script Documents
    Munish Kumar
    M. K. Jindal
    R. K. Sharma
    National Academy Science Letters, 2017, 40 : 273 - 277
  • [43] Automatic discrimination between printed and handwritten text in documents
    da Silva, Lincoln Faria
    Conci, Aura
    Sanchez, Angel
    2009 XXII BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING (SIBGRAPI 2009), 2009, : 261 - +
  • [44] HMM Based Keyword Spotting System in Printed/Handwritten Arabic/Latin Documents with Identification Stage
    Rouhou, Ahmed Cheikh
    Kessentini, Yousri
    Kanoun, Slim
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 309 - 320
  • [45] Automatic Indic script identification from handwritten documents: page, block, line and word-level approach
    Obaidullah, Sk Md
    Santosh, K. C.
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (01) : 87 - 106
  • [46] Handwritten Texts for Personality Identification Using Convolutional Neural Networks
    Valdez-Rodriguez, Jose E.
    Calvo, Hiram
    Felipe-Riveron, Edgardo M.
    PATTERN RECOGNITION AND INFORMATION FORENSICS, 2019, 11188 : 140 - 145
  • [47] Automatic Indic script identification from handwritten documents: page, block, line and word-level approach
    Sk Md Obaidullah
    K. C. Santosh
    Chayan Halder
    Nibaran Das
    Kaushik Roy
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 87 - 106
  • [48] Handwritten Indic Script Identification in Multi-Script Document Images: A Survey
    Obaidullah, Sk Md
    Santosh, K. C.
    Das, Nibaran
    Halder, Chayan
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (10)
  • [49] Script Separation in Machine Printed Bilingual (Devnagari and Gurumukhi) Documents Using Morphological Approach
    Singh, Sukhvir
    Kumar, Anil
    Shaw, Dinesh Kr.
    Ghosh, D.
    2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [50] Performance optimization for handwritten Gujarati alphanumeric script identification
    Limbachiya, Krishn
    Sharma, Ankit
    Thakkar, Priyank
    Adhyaru, Dipak
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2023, 48 (04):