OPEN VOCABULARY HANDWRITING RECOGNITION USING COMBINED WORD-LEVEL AND CHARACTER-LEVEL LANGUAGE MODELS

被引:0
|
作者
Kozielski, Michal [1 ]
Rybach, David [1 ]
Hahn, Stefan [1 ]
Schlueter, Ralf [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Aachen, Germany
关键词
open vocabulary recognition; handwriting recognition; character-based language models; NORMALIZATION; COMBINATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a unified search strategy for open vocabulary handwriting recognition using weighted finite state transducers. Additionally to a standard word-level language model we introduce a separate n-gram character-level language model for out-of-vocabulary word detection and recognition. The probabilities assigned by those two models are combined into one Bayes decision rule. We evaluate the proposed method on the IAM database of English handwriting. An improvement from 22.2% word error rate to 1 7.3 % is achieved comparing to the closed-vocabulary scenario and the best published result.
引用
收藏
页码:8257 / 8261
页数:5
相关论文
共 50 条
  • [31] Sign Pose-based Transformer for Word-level Sign Language Recognition
    Bohacek, Matyas
    Hruz, Marek
    [J]. 2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 182 - 191
  • [32] Character-Level Language Modeling with Recurrent Highway Hypernetworks
    Suarez, Joseph
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [33] Development of a Word-Level Classification and Vocabulary Learning (WCVL) System
    Baha, Kamal
    Shishido, Makoto
    [J]. 2022 JOINT 12TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 23RD INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS&ISIS), 2022,
  • [34] Character-Level Neural Language Modelling in the Clinical Domain
    Kreuzthaler, Markus
    Oleynik, Michel
    Schulz, Stefan
    [J]. DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 83 - 87
  • [35] Modeling word-level rate-of-speech variation in large vocabulary conversational speech recognition
    Zheng, J
    Franco, H
    Stolcke, A
    [J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 273 - 285
  • [36] Knowledge sources for word-level translation models
    Koehn, P
    Knight, K
    [J]. PROCEEDINGS OF THE 2001 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2001, : 27 - 35
  • [37] HDP-CNN: Highway deep pyramid convolution neural network combining word-level and character-level representations for phishing website detection
    Zheng, Faan
    Yan, Qiao
    Leung, Victor C. M.
    Yu, F. Richard
    Ming, Zhong
    [J]. COMPUTERS & SECURITY, 2022, 114
  • [38] Word-level Sign Language Recognition Using Linguistic Adaptation of 77 GHz FMCW Radar Data
    Rahman, M. Mahbubur
    Mdrafi, Robiulhossain
    Gurbuz, Ali C.
    Malaia, Evie
    Crawford, Chris
    Griffin, Darrin
    Gurbuz, Sevgi Z.
    [J]. 2021 IEEE RADAR CONFERENCE (RADARCONF21): RADAR ON THE MOVE, 2021,
  • [39] Experiments in Character-level Neural Network Models for Punctuation
    Gale, William
    Parthasarathy, Sarangarajan
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2794 - 2798
  • [40] A Multilingual and Multidomain Study on Dialog Act Recognition Using Character-Level Tokenization
    Ribeiro, Eugenio
    Ribeiro, Ricardo
    de Matos, David Martins
    [J]. INFORMATION, 2019, 10 (03)