A comparison of character n-grams and dictionaries used for script recognition

被引:3
|
作者
Brakensiek, A [1 ]
Rigoll, G [1 ]
机构
[1] Univ Duisburg Gesamthsch, Fac Elect Engn, Dept Comp Sci, D-47057 Duisburg, Germany
关键词
D O I
10.1109/ICDAR.2001.953791
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper an off-line script recognition system is described, which makes use of a language model, that consists of backoff character n-grams. The performance of this open vocabulary recognition is compared with the use of closed dictionaries. The system is based on Hidden Markov Models (HMMs) using a hybrid modeling technique, which depends on a neural vector quantizer The presented recognition results refer to the SEDAL-database of degraded English documents such as photocopy or fax and a writer-dependent handwritten database of cursive German script samples. Our resulting system for character recognition yields significantly better recognition results for an unlimited vocabulary using language models.
引用
收藏
页码:241 / 245
页数:3
相关论文
共 50 条
  • [21] IN SEARCH OF LEXICAL DISCRIMINATORS OF DEFINITION STYLE: COMPARING DICTIONARIES THROUGH N-GRAMS
    Kaminski, Mariusz Piotr
    INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2016, 29 (04) : 403 - 423
  • [22] Collocations and N-grams
    FREEBURY-JONES, D. A. R. R. E. N.
    RENAISSANCE AND REFORMATION, 2021, 44 (04) : 210 - 216
  • [23] The distribution of N-grams
    Egghe, L
    SCIENTOMETRICS, 2000, 47 (02) : 237 - 252
  • [24] Predicting Political Donations Using Twitter Hashtags and Character N-Grams
    Conrad, Colin
    Keselj, Vlado
    2016 IEEE 18TH CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL. 2, 2016, : 1 - 7
  • [25] Words versus character N-grams for anti-spam filtering
    Kanaris, Ioannis
    Kanaris, Konstantinos
    Houvardas, Ioannis
    Stamatatos, Efstathios
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) : 1047 - 1067
  • [26] Author Assertion of Furtive Write Print Using Character N-Grams
    Hassan, Feryal H.
    Chaurasia, Mousmi A.
    FUTURE INFORMATION TECHNOLOGY, 2011, 13 : 274 - 278
  • [27] Character N-grams translation in cross-language information retrieval
    Vilares, Jesus
    Oakes, Michael P.
    Vilares, Manuel
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 217 - +
  • [28] On Automatic Plagiarism Detection Based on n-Grams Comparison
    Barron-Cedeno, Alberto
    Rosso, Paolo
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 696 - 700
  • [29] Using character N-grams to explore diachronic change in medieval English
    Buckley, Kevin
    Vogel, Carl
    FOLIA LINGUISTICA, 2019, 53 : 249 - 299
  • [30] Feature selection on Chinese text classification using character n-grams
    Wei, Zhihua
    Miao, Duoqian
    Chauchat, Jean-Hugues
    Zhong, Caiming
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 500 - +