Language model of Chinese character recognition and its application

被引:0
|
作者
Zhang, S [1 ]
Wu, XL [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Engn Ctr Character Recognit, Beijing 100080, Peoples R China
关键词
character recognition; Markov language model; combined model; cache-based model language model; trigram model; 3g-gram model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a 5-gram combined model that can reflect features of Chinese and Chinese character recognition based on introducing several kinds of Markov language models. The major feature of this model is that it captures both forward and backward statistical characters of one word. The model contains three traditional "trigram components", a "cache component" which reflects short-term patterns of word use, and a "3g-gram component" based on a new classification method that is fast and automatic. Experiment on a 1,500,000-word corpus shows significant improvement achieved by the proposed model.
引用
收藏
页码:1507 / 1513
页数:7
相关论文
共 50 条
  • [21] Video degradation model and its application to character recognition in e-Learning videos
    Sun, J
    Katsuyama, Y
    Naoi, S
    [J]. DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 555 - 558
  • [22] Chinese character structure models for handwritten Chinese character recognition
    Liu, Xia-Bi
    Jia, Yun-De
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2003, 23 (03): : 322 - 326
  • [23] A static candidates generation technique and its application in two-stage LDA chinese character recognition
    Liu Zhibin
    Jin Lianwen
    [J]. PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, 2007, : 571 - +
  • [24] Language Modeling of Chinese Personal Names Based on Character Units for Continuous Chinese Speech Recognition
    Hu, Xinhui
    Yamamoto, Hirofumi
    Kikui, Genichiro
    Sagisaka, Yoshinori
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1874 - +
  • [25] Chinese Character Recognition Based on Character Reconstruction
    Yun Li
    Mei Xie
    [J]. 2009 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLUMES I & II: COMMUNICATIONS, NETWORKS AND SIGNAL PROCESSING, VOL I/ELECTRONIC DEVICES, CIRUITS AND SYSTEMS, VOL II, 2009, : 460 - 463
  • [26] HMMRF: A stochastic model for offline handwritten Chinese character recognition
    Wang, Q
    Zhao, RC
    Chi, ZR
    Feng, DD
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1475 - 1478
  • [27] A hybrid post-processing system for offline handwritten Chinese character recognition based on a statistical language model
    Xu, RF
    Yeung, DS
    Sh, DM
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (03) : 415 - 428
  • [28] APPROACHES TO CHINESE CHARACTER RECOGNITION
    STALLINGS, W
    [J]. PATTERN RECOGNITION, 1976, 8 (02) : 87 - 98
  • [29] MODIFIED QUADRATIC DISCRIMINANT FUNCTIONS AND THE APPLICATION TO CHINESE CHARACTER-RECOGNITION
    KIMURA, F
    TAKASHINA, K
    TSURUOKA, S
    MIYAKE, Y
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (01) : 149 - 153
  • [30] FUZZY-ATTRIBUTE GRAPH WITH APPLICATION TO CHINESE CHARACTER-RECOGNITION
    CHAN, KP
    CHEUNG, YS
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (01): : 153 - 160