Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition

被引:0
|
作者
Sanasam Inunganbi
Prakash Choudhary
Khumanthem Manglem
机构
[1] National Institute of Technology,
[2] National Institute of Technology,undefined
来源
The Visual Computer | 2021年 / 37卷
关键词
Meitei Mayek; Indian language; Character recognition; Segmentation;
D O I
暂无
中图分类号
学科分类号
摘要
A peculiar Indian Script Meitei Mayek has experienced a resurgence in the last few years and gets very little attention in handwriting research due to recently insurgence and limited sources. The objective of this paper is two folds; firstly, develop two different datasets: Mayek27 having 4900 isolated Meitei Mayek alphabets and MM (Meitei Mayek) dataset of 189 full-length handwritten text page. Secondly, develop a recognition system on the Mayek27 dataset using convolutional neural network and segmentation algorithms (text-lines, words, and characters) on the full-length Meitei Mayek handwritten text. A recognition rate of 99.02%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$99.02\%$$\end{document} is achieved using three layers of convolutional layers with a filter size of 3×3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 3$$\end{document} with 16, 32, and 96 kernels. In MM text dataset, the text-line and word segmentation are performed concurrently on 809 lines by tracking space between lines in a novel approach based on horizontal projection histogram and monitoring vertical projection histogram along the run-length of segmentation. Various constraints like skew, curve, close, and touching text-lines are incorporated, and the segmentation algorithm results are 91.84% and 88.96% for text-line and word, respectively. Furthermore, characters are segmented by headline removal, and connected component analysis achieves an accuracy of 91.12%.
引用
收藏
页码:291 / 305
页数:14
相关论文
共 50 条
  • [1] Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition
    Inunganbi, Sanasam
    Choudhary, Prakash
    Manglem, Khumanthem
    [J]. VISUAL COMPUTER, 2021, 37 (02): : 291 - 305
  • [2] Towards a Complete Character Set Meitei Mayek Handwritten Character Recognition
    Hijam, Deena
    Saharia, Sarat
    Nirmal, Yumnam
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [3] Convolutional Neural Network Based Meitei Mayek Handwritten Character Recognition
    Hijam, Deena
    Saharia, Sarat
    [J]. INTELLIGENT HUMAN COMPUTER INTERACTION, 2018, 11278 : 207 - 219
  • [4] On developing complete character set Meitei Mayek handwritten character database
    Hijam, Deena
    Saharia, Sarat
    [J]. VISUAL COMPUTER, 2022, 38 (02): : 525 - 539
  • [5] Local texture descriptors and projection histogram based handwritten Meitei Mayek character recognition
    Inunganbi, Sanasam
    Choudhary, Prakash
    Singh, Khumanthem Manglem
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) : 2813 - 2836
  • [6] A Neural Network Based Handwritten Meitei Mayek Alphabet Optical Character Recognition System
    Laishram, Romesh
    Singh, Pheiroijam Bebison
    Singh, Thokchom Suka Deba
    Anilkumar, Sapam
    Singh, Angom Umakanta
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 997 - 1001
  • [7] On developing complete character set Meitei Mayek handwritten character database
    Deena Hijam
    Sarat Saharia
    [J]. The Visual Computer, 2022, 38 : 525 - 539
  • [8] Local texture descriptors and projection histogram based handwritten Meitei Mayek character recognition
    Sanasam Inunganbi
    Prakash Choudhary
    Khumanthem Manglem Singh
    [J]. Multimedia Tools and Applications, 2020, 79 : 2813 - 2836
  • [9] Bangla-Meitei Mayek scripts handwritten character recognition using Convolutional Neural Network
    Abhishek Hazra
    Prakash Choudhary
    Sanasam Inunganbi
    Mainak Adhikari
    [J]. Applied Intelligence, 2021, 51 : 2291 - 2311
  • [10] Bangla-Meitei Mayek scripts handwritten character recognition using Convolutional Neural Network
    Hazra, Abhishek
    Choudhary, Prakash
    Inunganbi, Sanasam
    Adhikari, Mainak
    [J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2291 - 2311