Semi-automated workflow for recognition of printed documents with heterogeneous content

被引:0
|
作者
Colesnicov, Alexandru [1 ]
Malahov, Ludmila [1 ]
Cojocaru, Svetlana [1 ]
Burtseva, Lyudmila [1 ]
机构
[1] Vladimir Andrunachievici Inst Math & Comp Sci, 5 Acad Str, MD-2028 Kishinev, Moldova
关键词
platform for heterogeneous document recognition; page layout analysis; non-textual content recognition;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The paper discusses problems of heterogeneous texts digitization. The archives of scanned printed documents grow dramatically by results of projects concerning cultural heritage preserving. Manual annotations of scanned document images and per page screen reading make the usage of these archives difficult and, sometimes, impossible. Existing document processing systems cannot automatically display content correctly due to the presence of heterogeneous content. We proposed a Web platform to maximize the support of semi-automated work of all used tools for recognition of heterogeneous documents. Maximizing support means both creating the convenient "single window" access to all tools, and reducing the manual part of the process as much as possible. For implementation, the convergent technology is used, which assembles complex software systems from ready-made heterogeneous modules on a single platform.
引用
收藏
页码:223 / 240
页数:18
相关论文
共 50 条
  • [1] A Comprehensive Semi-Automated Incident Handling Workflow
    Hashemi, Sayed Hadi
    Babaeizadeh, Mohammad
    Nowruzi, Mohsen
    Jazi, Hossein Hadian
    Shahmoradi, Mohammad
    Samani, Elaheh Biglar Beigi
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 1065 - 1070
  • [2] Evaluation of a Semi-Automated Workflow for Fragment Growing
    Pirard, Bernard
    Ertl, Peter
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (01) : 180 - 193
  • [3] A Semi-Automated Workflow Solution for Data Set Publication
    Vannan, Suresh
    Beaty, Tammy W.
    Cook, Robert B.
    Wright, Daine M.
    Devarakonda, Ranjeet
    Wei, Yaxing
    Hook, Les A.
    McMurry, Benjamin F.
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2016, 5 (03):
  • [4] A Robust and Semi-Automated HIPSC Gene Editing Workflow
    Richards, Claire
    Manos, Philip D.
    Taylor, Ian
    MOLECULAR THERAPY, 2021, 29 (04) : 375 - 375
  • [5] Semi-automated recognition of protozoa by image analysis
    Amaral, AL
    Baptiste, C
    Pons, MN
    Nicolau, A
    Lima, N
    Ferreira, EC
    Mota, M
    Vivier, H
    BIOTECHNOLOGY TECHNIQUES, 1999, 13 (02) : 111 - 118
  • [6] The SALIX Method: A semi-automated workflow for herbarium specimen digitization
    Barber, Anne
    Lafferty, Daryl
    Landrum, Leslie R.
    TAXON, 2013, 62 (03) : 581 - 590
  • [7] Semi-automated contour recognition using DICOMautomaton
    Clark, H.
    Wu, J.
    Moiseenko, V.
    Lee, R.
    Gill, B.
    Duzenli, C.
    Thomas, S.
    XVII INTERNATIONAL CONFERENCE ON THE USE OF COMPUTERS IN RADIATION THERAPY (ICCR 2013), 2014, 489
  • [8] A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences
    Ammar, Ammar
    Bonaretti, Serena
    Winckers, Laurent
    Quik, Joris
    Bakker, Martine
    Maier, Dieter
    Lynch, Iseult
    van Rijn, Jeaphianne
    Willighagen, Egon
    NANOMATERIALS, 2020, 10 (10) : 1 - 14
  • [9] Semi-automated content zoning of spam emails
    Brucks, Claudine
    Hilker, Michael
    Schommer, Christoph
    Wagner, Cynthia
    Weires, Ralph
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2008, 8 : 35 - 44
  • [10] A semi-automated, KNIME-based workflow for biofilm assays
    Leinweber, Katrin
    Mueller, Silke
    Kroth, Peter G.
    BMC MICROBIOLOGY, 2016, 16