DNN-HMM for Large Vocabulary Mongolian Offline Handwriting Recognition

被引:0
|
作者
Fan Daoerji [1 ]
Gao Guanglai [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China
关键词
Mongolian; Handwriting Recognition; HMM; DNN;
D O I
10.1109/ICFHR.2016.23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a large vocabulary Mongolian offline handwriting recognition system, using hidden Markov models (HMMs)-deep neural networks (DNN) hybrid architectures which shows superior performance on auto speech recognize (ASR) tasks. We select 50 sub-characters from all shape of Mongolian letters as the smallest modeling unit. First, a set of intensity features are extracted from each of the segmented word, which is based on a sliding window moving across each word image. Then, Multiple context dependent Gaussian mixture model (GMM)-HMMs are trained by the features. At last a DNN which have 4 hidden layers are trained as a frame classifier, where the class labels are state labels assigned to each input frame through forced alignment using the context-dependent model. In order to validate the proposed model, extensive experiments were carried out using the MHW database which contains 100,000 handwritten words in training set, 5,000 in test set I and 14,085 in Test set II. The DNN-HMM which is trained on raw image pixels yields best performance on Test set I with an accuracy of 97.61% and on Test set II with an accuracy of 94.14%.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 50 条
  • [1] Feature Selection for DNN-HMM Based Mongolian Offline Handwriting Recognition
    Wu, Huijuan
    Fan, Daoerji
    [J]. 2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 141 - 145
  • [2] Large Vocabulary Children's Speech Recognition with DNN-HMM and SGMM Acoustic Modeling
    Giuliani, Diego
    BabaAli, Bagher
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1635 - 1639
  • [3] Large Vocabulary Hybrid DNN/HMM Arabic Online Handwriting Recognition System
    Khaled, Omar
    Fahmy, Aly
    Abdou, Sherif
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 876 - 881
  • [4] DNN-HMM based Large Vocabulary Online Handwritten Assamese Word Recognition System
    Mandal, Subhasis
    Choudhury, Himakshi
    Prasanna, S. R. Mahadeva
    Sundaram, Suresh
    [J]. PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 321 - 326
  • [5] Phonetic Context Embeddings for DNN-HMM Phone Recognition
    Badino, Leonardo
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 405 - 409
  • [6] DNN-HMM based Automatic Speech Recognition for HRI Scenarios
    Novoa, Jose
    Wuth, Jorge
    Pablo Escudero, Juan
    Fredes, Josue
    Mahu, Rodrigo
    Becerra Yoma, Nestor
    [J]. HRI '18: PROCEEDINGS OF THE 2018 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2018, : 150 - 159
  • [7] Phonotactic Language Recognition Based on DNN-HMM Acoustic Model
    Liu, Wei-Wei
    Cai, Meng
    Yuan, Hua
    Shi, Xiao-Bei
    Zhang, Wei-Qiang
    Liu, Jia
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 153 - +
  • [8] Offline Arabic Handwriting Recognition System based on HMM
    Xiang, Dong
    Yan, Huahua
    Chen, Xianqiao
    Cheng, Yanfen
    [J]. PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 526 - 529
  • [9] Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework
    Peng, Yizhou
    Zhang, Jicheng
    Zhang, Haobo
    Xu, Haihua
    Huang, Hao
    Li, Sheng
    Chng, Eng Siong
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1043 - 1048
  • [10] A DNN-HMM Approach to Story Segmentation
    Yu, Jia
    Xiao, Xiong
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1527 - 1531