Script Identification for Printed and Handwritten Indian Documents: An Empirical Study of Different Feature Classifier Combinations

被引:5
|
作者
Rani, Rajneesh [1 ]
Dhir, Renu [1 ]
Kakkar, Deepti [2 ]
Sharma, Nonita [1 ]
机构
[1] Dr BR Ambedkar Natl Inst Technol, Dept Comp Sci & Engn, Jalandhar 144011, Punjab, India
[2] Dr BR Ambedkar Natl Inst Technol, Dept Elect & Commun Engn, Jalandhar 144011, Punjab, India
关键词
Script identification; page level; texture features; machine learning; Gabor; wavelet; INVARIANT TEXTURE FEATURES; ROTATION;
D O I
10.1142/S0219467821400118
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The identification of script in a document page image is the first step for an OCR system processing multi-script documents. In this multilingual/multiscript world, document processing systems relying on the OCR that need human involvement to select the appropriate OCR package is definitely undesirable and inefficient. The development of robust and efficient methods for automatic script identification of a document is a subject of major importance for automatic document processing in a multilingual/multiscript environment. Thus, the basic objective is to come up with some intuitive methods having straightforward implementation without compromising with efficiency. The aim of this work is to evaluate state-of-the-art feature extraction and classification techniques in the field of automatic script identification of printed and handwritten documents and to propose the best combination for the same.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [1] A Review on Methods of Script Identification for Printed and Handwritten Documents
    Gaygole, Aditi
    Rojatkar, Dinesh
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [2] Writer Identification System for Handwritten Gurmukhi Characters: Study of Different Feature-Classifier Combinations
    Sakshi
    Garg, Naresh Kumar
    Kumar, Munish
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 125 - 131
  • [3] Script identification in handwritten and printed documents using convolutional recurrent connection
    Amar Jindal
    Multimedia Tools and Applications, 2025, 84 (9) : 5549 - 5563
  • [4] Robust shared feature learning for script and handwritten/machine-printed identification
    Feng, Ziyong
    Yang, Zhaoyang
    Jin, Lianwen
    Huang, Shuangping
    Sun, Jun
    PATTERN RECOGNITION LETTERS, 2017, 100 : 6 - 13
  • [5] Script identification in printed bilingual documents
    Dhanya, D
    Ramakrishnan, AG
    DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 13 - 24
  • [6] Script identification in printed bilingual documents
    D. Dhanya
    A. G. Ramakrishnan
    Peeta Basa Pati
    Sadhana, 2002, 27 : 73 - 82
  • [7] Script identification in printed bilingual documents
    Dhanya, D
    Ramakrishnan, AG
    Pati, PB
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1): : 73 - 82
  • [8] A Study of Different Classifier Combination Approaches for Handwritten Indic Script Recognition
    Mukhopadhyay, Anirban
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    JOURNAL OF IMAGING, 2018, 4 (02)
  • [9] Script identification from Indian documents
    Joshi, GD
    Carg, S
    Sivaswamy, J
    DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 255 - 267
  • [10] Offline script recognition from handwritten and printed multilingual documents: a survey
    Deepak Sinwar
    Vijaypal Singh Dhaka
    Nitesh Pradhan
    Saumya Pandey
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 97 - 121