Character and numeral recognition for non-Indic and Indic scripts: a survey

被引:46
|
作者
Kumar, Munish [1 ]
Jindal, M. K. [2 ]
Sharma, R. K. [3 ]
Jindal, Simpel Rani [4 ]
机构
[1] GZS Campus Coll Engn & Technol, Dept Comp Applicat, Bathinda, Punjab, India
[2] Panjab Univ, Reg Ctr, Dept Comp Sci & Applicat, Muktsar, Punjab, India
[3] Thapar Univ, Dept Comp Sci & Engn, Patiala, Punjab, India
[4] Yadavindra Coll Engn, Comp Sci & Engn, Bathinda, Punjab, India
关键词
OCR; Character recognition; Non-Indic scripts; Indic scripts; JAPANESE TEXT RECOGNITION; HANDWRITING RECOGNITION; ONLINE; OCR; SEGMENTATION; EXTRACTION; FUSION; BANGLA; SYSTEM;
D O I
10.1007/s10462-017-9607-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A collection of different scripts is employed in writing languages throughout the world. Character and numeral recognition of a particular script is a key area in the field of pattern recognition. In this paper, we have presented a comprehensive survey on character and numeral recognition of non-Indic and Indic scripts. Many researchers have done work on character and numeral recognition from the most recent couple of years. In perspective of this, few strategies for character/numeral have been developed so far. There are an immense number of frameworks available for printed and handwritten character recognition for non-Indic scripts. But, only a limited number of systems are offered for character/numeral recognition of Indic scripts. However, few endeavors have been made on the recognition of Bangla, Devanagari, Gurmukhi, Kannada, Oriya and Tamil scripts. In this paper, we have additionally examined major challenges/issues for character/numeral recognition. The efforts in two directions (non-Indic and Indic scripts) are reflected in this paper. When compared with non-Indic scripts, the research on character recognition of Indic scripts has not achieved that perfection yet. The techniques used for recognition of non-Indic scripts may be used for recognition of Indic scripts (printed/handwritten text) and vice versa to improve the recognition rates. It is also noticed that the research in this field is quietly thin and still more research is to be done, particularly in the case of handwritten Indic scripts documents.
引用
收藏
页码:2235 / 2261
页数:27
相关论文
共 50 条
  • [41] A survey on optical character recognition for Bangla and Devanagari scripts
    Bag, Soumen
    Harit, Gaurav
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2013, 38 (01): : 133 - 168
  • [42] A new dataset of word-level offline handwritten numeral images from four official Indic scripts and its benchmarking using image transform fusion
    Obaidullah, Sk Md
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2016, 4 (01) : 1 - 20
  • [43] A survey on optical character recognition for Bangla and Devanagari scripts
    SOUMEN BAG
    GAURAV HARIT
    Sadhana, 2013, 38 : 133 - 168
  • [44] Indic script family and its offline handwriting recognition for characters/digits and words: a comprehensive survey
    Singh, Sukhdeep
    Sharma, Anuj
    Chauhan, Vinod Kumar
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL3) : S3003 - S3055
  • [45] IndicSTR12: A Dataset for Indic Scene Text Recognition
    Lunia, Harsh
    Mondal, Ajoy
    Jawahar, C.V.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14193 LNCS : 233 - 250
  • [46] Recognition of Handwritten Indic Script Numerals Using Mojette Transform
    Singh, Pawan Kumar
    Das, Supratim
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COMMUNICATION, 2017, 458 : 459 - 466
  • [47] Separating Indic Scripts with 'matra'-A Precursor to Script Identification in Multi-script Documents
    Obaidullah, Sk. Md.
    Goswami, Chitrita
    Santosh, K. C.
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2016, VOL 1, 2017, 459 : 205 - 214
  • [48] Recognition of handwritten indic script using clonal selection algorithm
    Garain, Utpal
    Chakraborty, Mangal P.
    Dasgupta, Dipankar
    ARTIFICIAL IMMUNE SYSTEMS, PROCEEDINGS, 2006, 4163 : 256 - 266
  • [49] An AI-Based Detection System for Mudrabharati: A Novel Unified Fingerspelling System for Indic Scripts
    Ashwin, F. Amal Jude
    Chakravarthy, V. Srinivasa
    Kopparapu, Sunil Kumar
    TEXT, SPEECH, AND DIALOGUE, TSD 2021, 2021, 12848 : 425 - 434
  • [50] A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts
    Sarkhel, Ritesh
    Das, Nibaran
    Das, Aritra
    Kundu, Mahantapas
    Nasipuri, Mita
    PATTERN RECOGNITION, 2017, 71 : 78 - 93