OPEN VOCABULARY HANDWRITING RECOGNITION USING COMBINED WORD-LEVEL AND CHARACTER-LEVEL LANGUAGE MODELS

被引:0
|
作者
Kozielski, Michal [1 ]
Rybach, David [1 ]
Hahn, Stefan [1 ]
Schlueter, Ralf [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Aachen, Germany
关键词
open vocabulary recognition; handwriting recognition; character-based language models; NORMALIZATION; COMBINATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a unified search strategy for open vocabulary handwriting recognition using weighted finite state transducers. Additionally to a standard word-level language model we introduce a separate n-gram character-level language model for out-of-vocabulary word detection and recognition. The probabilities assigned by those two models are combined into one Bayes decision rule. We evaluate the proposed method on the IAM database of English handwriting. An improvement from 22.2% word error rate to 1 7.3 % is achieved comparing to the closed-vocabulary scenario and the best published result.
引用
收藏
页码:8257 / 8261
页数:5
相关论文
共 50 条
  • [1] Creating word-level language models for large-vocabulary handwriting recognition
    John F. Pitrelli
    Amit Roy
    [J]. International Journal on Document Analysis and Recognition, 2003, 5 (2) : 126 - 137
  • [2] Creating word-level language models for handwriting recognition
    Pitrelli, JF
    Roy, A
    [J]. SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 721 - 725
  • [3] Integrating Character-level and Word-level Representation for Affect in Arabic Tweets
    Alharbi, Abdullah I.
    Smith, Phillip
    Lee, Mark
    [J]. Data and Knowledge Engineering, 2022, 138
  • [4] Integrating Character-level and Word-level Representation for Affect in Arabic Tweets
    Alharbi, Abdullah, I
    Smith, Phillip
    Lee, Mark
    [J]. DATA & KNOWLEDGE ENGINEERING, 2022, 138
  • [5] A CHINESE CHARACTER-LEVEL AND WORD-LEVEL COMPLEMENTARY TEXT CLASSIFICATION METHOD
    Chen, Wentong
    Fan, Chunxiao
    Wu, Yuexin
    Lou, Zhixiong
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 187 - 192
  • [6] End-to-End Recurrent Neural Network Models for Vietnamese Named Entity Recognition: Word-Level Vs. Character-Level
    Thai-Hoang Pham
    Phuong Le-Hong
    [J]. COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 219 - 232
  • [7] An Efficient Character-Level and Word-Level Feature Fusion Method for Chinese Text Classification
    Jin Wenzhen
    Zhu Hong
    Yang Guocai
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2019), 2019, 1229
  • [8] Deep Learning Speech Synthesis Model for Word/Character-Level Recognition in the Tamil Language
    Rajendran, Sukumar
    Raja, Kiruba Thangam
    Nagarajan, G.
    Dass, A. Stephen
    Kumar, M. Sandeep
    Jayagopal, Prabhu
    [J]. INTERNATIONAL JOURNAL OF E-COLLABORATION, 2023, 19 (04) : 20 - 20
  • [9] Detection of Malicious PowerShell Using Word-Level Language Models
    Tajiri, Yui
    Mimura, Mamoru
    [J]. ADVANCES IN INFORMATION AND COMPUTER SECURITY (IWSEC 2020), 2020, 12231 : 39 - 56
  • [10] Reading strategy of Hong Kong school-aged children: The development of word-level and character-level processing
    Chu, MMK
    Leung, MT
    [J]. APPLIED PSYCHOLINGUISTICS, 2005, 26 (04) : 505 - 520