Topic Language Model Adaption for Recognition of Homologous Offline Handwritten Chinese Text Image

被引:3
|
作者
Wang, Yanwei [1 ]
Ding, Xiaoqing [1 ]
Liu, Changsong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
Character based bi-gram; offline handwritten Chinese text image recognition; over-segmentation and merging; topic language model;
D O I
10.1109/LSP.2014.2308572
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the content of a full text page usually focuses on a specific topic, a topic language model adaption method is proposed to improve the recognition performance of homologous offline handwritten Chinese text image. Firstly, the text images are recognized with a character based bi-gram language model. Secondly, the topic of the text image is matched adaptively. Finally, the text image is recognized again with the best matched topic language model. To obtain a tradeoff between the recognition performance and computational complexity, a restricted topic language model adaption method is further presented. The methods have been evaluated on 100 offline Chinese text images. Compared to the general language model, the topic language model adaption has reduced the relative error rate by 11.94%. The restricted topic language model has lessened the running time by 19.22% at the cost of losing 0.35% of the accuracy.
引用
收藏
页码:550 / 553
页数:4
相关论文
共 50 条
  • [1] Parsimonious HMMs for Offline Handwritten Chinese Text Recognition
    Wang, Wenchao
    Du, Jun
    Wang, Zi-Rui
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 145 - 150
  • [2] Unsupervised language model adaptation for handwritten Chinese text recognition
    Wang, Qiu-Feng
    Yin, Fei
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2014, 47 (03) : 1202 - 1216
  • [3] Optimizing the integration of a statistical language model in HMM based offline handwritten text recognition
    Zimmermann, M
    Bunke, H
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 541 - 544
  • [4] Offline Recognition of Malayalam Handwritten Text
    Shanjana, C.
    James, Ajay
    8TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2014, 2015, 19 : 772 - 779
  • [5] N-gram language models for offline handwritten text recognition
    Zimmermann, M
    Bunke, H
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 203 - 208
  • [6] A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition
    Li, Nanxi
    Jin, Lianwen
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3664 - 3668
  • [7] Deep Neural Network based Hidden Markov Model for Offline Handwritten Chinese Text Recognition
    Du, Jun
    Wang, Zi-Rui
    Zhai, Jian-Fang
    Hu, Jin-Shui
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3428 - 3433
  • [8] Searching from the Prediction of Visual and Language Model for Handwritten Chinese Text Recognition
    Liu, Brian
    Sun, Weicong
    Kang, Wenjing
    Xu, Xianchao
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 274 - 288
  • [9] Retrieval-based language model adaptation for handwritten Chinese text recognition
    Hu, Shuying
    Wang, Qiufeng
    Huang, Kaizhu
    Wen, Min
    Coenen, Frans
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (02) : 109 - 119
  • [10] Retrieval-based language model adaptation for handwritten Chinese text recognition
    Shuying Hu
    Qiufeng Wang
    Kaizhu Huang
    Min Wen
    Frans Coenen
    International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 109 - 119