A statistical approach to the generation of a database for evaluating OCR software

被引:0
|
作者
Brundick F.S. [1 ]
Brodeen A.E.M. [1 ]
Taylor M.S. [2 ]
机构
[1] US Army Research Laboratory, ATTN: AMSRL-CI-CT, Aberdeen Proving Ground
[2] University of Maryland, Maryland
关键词
Bootstrap; OCR evaluation; Statistics M.S. Taylor presently at OAO Corporation;
D O I
10.1007/s100320200067
中图分类号
学科分类号
摘要
In this paper we consider a statistical approach to augment a limited database of groundtruth documents for use in evaluation of optical character recognition software. A modified moving-blocks bootstrap procedure is used to construct surrogate documents for this purpose which prove to serve effectively and, in some regards, indistinguishably from groundtruth. The proposed method is validated through a rigorous statistical procedure. © 2002 Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:170 / 176
页数:6
相关论文
共 50 条
  • [1] EVALUATING DATABASE SOFTWARE FOR PCS
    HOHNER, G
    MACHINE DESIGN, 1984, 56 (20) : 129 - 132
  • [2] Semi-Automated OCR Database Generation for Nabataean Scripts
    Ul-Hasan, Adnan
    Bukhari, Syed Sagib
    Rashid, Sheikh Faisal
    Shafait, Faisal
    Breuel, Thomas M.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1667 - 1670
  • [3] Statistical database modeling for privacy preserving database generation
    Wu, XT
    Wang, YG
    Zheng, YL
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, 3488 : 382 - 390
  • [4] Evaluating the Next Generation of Multimedia Software
    Adams, Ray
    NEW DIRECTIONS IN INTELLIGENT INTERACTIVE MULTIMEDIA, 2008, 142 : 605 - 614
  • [5] A Chinese OCR spelling check approach based on statistical language models
    Li, Z
    Bao, T
    Zhu, XY
    Wang, CH
    Naoi, SS
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 4727 - 4732
  • [6] The constraint database approach to software verification
    Revesz, Peter
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, PROCEEDINGS, 2007, 4349 : 329 - 345
  • [7] Statistical and fuzzy approach for database security
    Lu, Gang
    Yi, Junkai
    Lue, Kevin
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 670 - +
  • [8] OCR and other valuable software
    Abelson, MN
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 1998, 113 (01) : 118 - 120
  • [9] OCR for printed Kannada text to Machine editable format using Database approach
    Sagar, B. M.
    Shobha, G.
    Ramakanth, P. Kumar
    PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON AUTOMATION AND INFORMATION, 2008, : 322 - +
  • [10] An engineering approach to evaluating simulation software
    Nowak, A
    DIE CASTING ENGINEER, 2000, 44 (01): : 28 - +