OCRSpell: An interactive spelling correction system for OCR errors in text

被引:31
|
作者
Taghva K. [1 ]
Stofsky E. [1 ]
机构
[1] Information Science Research Institute, University of Nevada, Las Vegas, Las Vegas
关键词
Error correction; Information retrieval; OCR-Spell checkers; Scanning;
D O I
10.1007/PL00013558
中图分类号
学科分类号
摘要
In this paper, we describe a spelling correction system designed specifically for OCR-generated text that selects candidate words through the use of information gathered from multiple knowledge sources. This system for text correction is based on static and dynamic device mappings, approximate string matching, and n-gram analysis. Our statistically based, Bayesian system incorporates a learning feature that collects confusion information at the collection and document levels. An evaluation of the new system is presented as well. © 2001 Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:125 / 137
页数:12
相关论文
共 50 条
  • [11] OCR Error Correction for Vietnamese OCR Text with Different Edit Distances
    Quoc-Dung Nguyen
    Nguyet-Minh Phan
    Kromer, Pavel
    ADVANCES IN INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS, INCOS-2022, 2022, 527 : 130 - 139
  • [12] UNSUPERVISED SPELLING CORRECTION FOR THE SLOVAK TEXT
    Hladek, Daniel
    Stas, Jan
    Juhar, Jozef
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2013, 11 (05) : 392 - 397
  • [13] Customised OCR Correction for Historical Medical Text
    Thompson, Paul
    McNaught, John
    Ananiadou, Sophia
    2015 DIGITAL HERITAGE INTERNATIONAL CONGRESS, VOL 1: DIGITIZATION & ACQUISITION, COMPUTER GRAPHICS & INTERACTION, 2015, : 35 - 42
  • [14] Improving the quality of Persian clinical text with a novel spelling correction system
    Dashti, Seyed Mohammad Sadegh
    Dashti, Seyedeh Fatemeh
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [15] Improved Iterative Correction for Distant Spelling Errors
    Gubanov, Sergey
    Galinskaya, Irina
    Baytin, Alexey
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 168 - 173
  • [16] AUTOMATIC CORRECTION OF SPELLING-ERRORS IN ARABIC
    ALFEDAGHI, S
    AMIN, A
    JOURNAL OF THE UNIVERSITY OF KUWAIT-SCIENCE, 1992, 19 (02): : 175 - 194
  • [17] POSITIVE AND NEGATIVE PRACTICE IN THE CORRECTION OF SPELLING ERRORS
    Peak, Helen
    Brooks, Jane
    Horson, Berkeley
    JOURNAL OF PSYCHOLOGY, 1941, 11 (01): : 103 - 114
  • [18] A TECHNIQUE FOR COMPUTER DETECTION AND CORRECTION OF SPELLING ERRORS
    DAMERAU, FJ
    COMMUNICATIONS OF THE ACM, 1964, 7 (03) : 171 - 176
  • [19] Post-correction of OCR Errors Using PyEnchant Spelling Suggestions Selected Through a Modified Needleman-Wunsch Algorithm
    Cappelatti, Ewerton
    Heidrich, Regina De Oliveira
    Oliveira, Ricardo
    Monticelli, Cintia
    Rodrigues, Ronaldo
    Goulart, Rodrigo
    Velho, Eduardo
    HCI INTERNATIONAL 2018 - POSTERS' EXTENDED ABSTRACTS, PT I, 2018, 850 : 3 - 10
  • [20] Utilizing Web Data in Identification and Correction of OCR Errors
    Taghva, Kazem
    Agarwal, Shivam
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021