Character and numeral recognition for non-Indic and Indic scripts: a survey

被引:46
|
作者
Kumar, Munish [1 ]
Jindal, M. K. [2 ]
Sharma, R. K. [3 ]
Jindal, Simpel Rani [4 ]
机构
[1] GZS Campus Coll Engn & Technol, Dept Comp Applicat, Bathinda, Punjab, India
[2] Panjab Univ, Reg Ctr, Dept Comp Sci & Applicat, Muktsar, Punjab, India
[3] Thapar Univ, Dept Comp Sci & Engn, Patiala, Punjab, India
[4] Yadavindra Coll Engn, Comp Sci & Engn, Bathinda, Punjab, India
关键词
OCR; Character recognition; Non-Indic scripts; Indic scripts; JAPANESE TEXT RECOGNITION; HANDWRITING RECOGNITION; ONLINE; OCR; SEGMENTATION; EXTRACTION; FUSION; BANGLA; SYSTEM;
D O I
10.1007/s10462-017-9607-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A collection of different scripts is employed in writing languages throughout the world. Character and numeral recognition of a particular script is a key area in the field of pattern recognition. In this paper, we have presented a comprehensive survey on character and numeral recognition of non-Indic and Indic scripts. Many researchers have done work on character and numeral recognition from the most recent couple of years. In perspective of this, few strategies for character/numeral have been developed so far. There are an immense number of frameworks available for printed and handwritten character recognition for non-Indic scripts. But, only a limited number of systems are offered for character/numeral recognition of Indic scripts. However, few endeavors have been made on the recognition of Bangla, Devanagari, Gurmukhi, Kannada, Oriya and Tamil scripts. In this paper, we have additionally examined major challenges/issues for character/numeral recognition. The efforts in two directions (non-Indic and Indic scripts) are reflected in this paper. When compared with non-Indic scripts, the research on character recognition of Indic scripts has not achieved that perfection yet. The techniques used for recognition of non-Indic scripts may be used for recognition of Indic scripts (printed/handwritten text) and vice versa to improve the recognition rates. It is also noticed that the research in this field is quietly thin and still more research is to be done, particularly in the case of handwritten Indic scripts documents.
引用
收藏
页码:2235 / 2261
页数:27
相关论文
共 50 条
  • [21] Identification of Indic Scripts on Torn-Documents
    Chanda, Sukalpa
    Franke, Katrin
    Pal, Umapada
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 713 - 717
  • [22] Towards a Robust OCR System for Indic Scripts
    Krishnan, Praveen
    Sankaran, Naveen
    Singh, Ajeet Kumar
    Jawahar, C. V.
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 141 - 145
  • [23] Strike off removal in Indic scripts with transfer learning
    Nigam, Shivangi
    Behera, Adarsh Prasad
    Gogoi, Manas
    Verma, Shekhar
    Nagabhushan, P.
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17): : 12927 - 12943
  • [24] A survey of Early Middle Indic
    Jamison, SW
    JOURNAL OF THE AMERICAN ORIENTAL SOCIETY, 2003, 123 (02) : 467 - 467
  • [25] IIIT-INDIC-HW-WORDS: A Dataset for Indic Handwritten Text Recognition
    Gongidi, Santhoshini
    Jawahar, C., V
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 444 - 459
  • [26] Strike off removal in Indic scripts with transfer learning
    Shivangi Nigam
    Adarsh Prasad Behera
    Manas Gogoi
    Shekhar Verma
    P. Nagabhushan
    Neural Computing and Applications, 2023, 35 : 12927 - 12943
  • [27] Scene text recognition: an Indic perspective
    Vijayan, Vasanthan P.
    Chanda, Sukalpa
    Doermann, David
    Krishnan, Narayanan C.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024,
  • [28] An Empirical Study of Effectiveness of Post-processing in Indic Scripts
    Vinitha, V. S.
    Mathew, Minesh
    Jawahar, C. V.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 7, 2017, : 32 - 36
  • [29] Word-level Script Identification for Handwritten Indic scripts
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    Doermann, David
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1106 - 1110
  • [30] Writer Identification in Indic Scripts: A Stroke Distribution Based Approach
    Reddy, Santhoshini
    Andrew, Chris
    Pal, Umapada
    Alaei, Alireza
    Viswanath, P.
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 947 - 952