Open Source Historical OCR: The OCRopodium Project

被引:0
|
作者
Bryant, Michael [1 ]
Blanke, Tobias [1 ]
Hedges, Mark [1 ]
Palmer, Richard [1 ]
机构
[1] Kings Coll London, Ctr E Res, London, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present some initial results of OCRopodium project to build a scalable workflow for OCR of historical collections. Large-scale digitisation projects dealing with text-based historical material face challenges that are not well-catered-to by commercial software. Open source tools allow for better customisation to match these requirements, particularly with regard to character model training and per-project language modelling.
引用
收藏
页码:522 / 525
页数:4
相关论文
共 50 条
  • [21] COMPARISON OF OPEN SOURCE TOOLS FOR PROJECT MANAGEMENT
    Pereira, Andre Marques
    Goncalves, Rafael Queiroz
    Von Wangenheim, Christiane Gresse
    Buglione, Luigi
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2013, 23 (02) : 189 - 209
  • [22] Communication networks in an open source software project
    Roberts, Jeffrey
    Hann, IL-Horn
    Slaughter, Sandra
    [J]. OPEN SOURCE SYSTEMS, 2006, 203 : 297 - +
  • [23] Recognition of Offline Handwritten Chinese Characters Using the Tesseract Open Source OCR Engine
    Li, Qi
    An, Weihua
    Zhou, Anmi
    Ma, Lehui
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 452 - 456
  • [24] Improved Typesetting Models for Historical OCR
    Berg-Kirkpatrick, Taylor
    Klein, Dan
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 118 - 123
  • [25] Teaching Open Source: Involving Students in Free and Open Source Software (FOSS) Project Communities
    Dziallas, Sebastian
    Ellis, Heidi J. C.
    Chua, Mel
    Huss-Lederman, Steven
    Wurst, Karl R.
    [J]. SIGCSE 12: PROCEEDINGS OF THE 43RD ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2011, : 676 - 677
  • [26] From Open Science to Open Source (and beyond) A Historical Perspective on Open Practices without and with IT
    Wolff, Bastian
    Schlagwein, Daniel
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON OPEN COLLABORATION (OPENSYM), 2021,
  • [27] eScriptorium: An Open Source Platform for Historical Document Analysis
    Kiessling, Benjamin
    Tissot, Robin
    Stokes, Peter
    Ben Ezra, Daniel Stokl
    [J]. 2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 2ND INTERNATIONAL WORKSHOP ON OPEN SERVICES AND TOOLS FOR DOCUMENT ANALYSIS (OST), VOL 2, 2019, : 19 - +
  • [28] Open source optical character recognition for historical research
    Blanke, Tobias
    Bryant, Michael
    Hedges, Mark
    [J]. JOURNAL OF DOCUMENTATION, 2012, 68 (05) : 659 - 683
  • [29] HISTORICAL REFERENCE POINTS OF OPEN SOURCE UTILITY IN INTELLIGENCE
    Olaru, Gherghina
    [J]. REDEFINING COMMUNITY IN INTERCULTURAL CONTEXT, RCIC'15, 2015, 4 (01): : 234 - 238
  • [30] Sustained Participation in Open Source Software Project Communities
    Shi, Zhengzhong
    Sun, Hua
    [J]. JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2022, 62 (05) : 907 - 920