Semi-automated workflow for recognition of printed documents with heterogeneous content

被引:0
|
作者
Colesnicov, Alexandru [1 ]
Malahov, Ludmila [1 ]
Cojocaru, Svetlana [1 ]
Burtseva, Lyudmila [1 ]
机构
[1] Vladimir Andrunachievici Inst Math & Comp Sci, 5 Acad Str, MD-2028 Kishinev, Moldova
关键词
platform for heterogeneous document recognition; page layout analysis; non-textual content recognition;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The paper discusses problems of heterogeneous texts digitization. The archives of scanned printed documents grow dramatically by results of projects concerning cultural heritage preserving. Manual annotations of scanned document images and per page screen reading make the usage of these archives difficult and, sometimes, impossible. Existing document processing systems cannot automatically display content correctly due to the presence of heterogeneous content. We proposed a Web platform to maximize the support of semi-automated work of all used tools for recognition of heterogeneous documents. Maximizing support means both creating the convenient "single window" access to all tools, and reducing the manual part of the process as much as possible. For implementation, the convergent technology is used, which assembles complex software systems from ready-made heterogeneous modules on a single platform.
引用
收藏
页码:223 / 240
页数:18
相关论文
共 50 条
  • [31] A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking
    Korybski, Tomasz
    Davitti, Elena
    Orasan, Constantin
    Braun, Sabine
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4405 - 4413
  • [32] A new semi-automated workflow for chemical data retrieval and quality checking for modeling applications
    Domenico Gadaleta
    Anna Lombardo
    Cosimo Toma
    Emilio Benfenati
    Journal of Cheminformatics, 10
  • [33] A semi-automated workflow solution for multimodal neuroimaging: application to patients with traumatic brain injury
    Wong K.-P.
    Bergsneider M.
    Glenn T.C.
    Kepe V.
    Barrio J.R.
    Hovda D.A.
    Vespa P.M.
    Huang S.-C.
    Brain Informatics, 2016, 3 (1) : 1 - 15
  • [34] A Semi-Automated Workflow for Brain Slice Histology Alignment, Registration, and Cell Quantification (SHARCQ)
    Lauridsen, Kristoffer
    Ly, Annie
    Prevost, Emily D.
    McNulty, Connor
    McGovern, Dillon J.
    Tay, Jian Wei
    Dragavon, Joseph
    Root, David H.
    ENEURO, 2022, 9 (02)
  • [35] Construction of biological networks from unstructured information based on a semi-automated curation workflow
    Szostak, Justyna
    Ansari, Sam
    Madan, Sumit
    Fluck, Juliane
    Talikka, Marja
    Iskandar, Anita
    De Leon, Hector
    Hofmann-Apitius, Martin
    Peitsch, Manuel C.
    Hoeng, Julia
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2015,
  • [36] Flowct: A Semi-Automated Workflow for Deconvolution of Immunophenotypic Data and Objective Reporting on Large Datasets
    Botta, Cirino
    Maia, Catarina
    Perez Ruiz, Cristina
    Manrique, Irene
    Jose Garces, Juan
    Rodriguez, Sara
    Burgos, Leire
    Merino, Juana
    Lopez Lopez, Aitziber
    Puig, Noemi
    Teresa Cedena, Maria
    Paiva, Artur
    Rossi, Marco
    Tagliaferri, Pierosandro
    Tassone, Piefrancesco
    Gentile, Massimo
    Borrello, Ivan M.
    Rosinol Dachs, Laura
    Mateos, Maria-Victoria
    Lahuerta, Juan-Jose
    Blade, Joan
    San-Miguel, Jesus
    Paiva, Bruno
    BLOOD, 2019, 134
  • [37] iBio-GATS-A Semi-Automated Workflow for Structural Modelling of Insect Odorant Receptors
    Thanu, Vaanathi Chidambara
    Jabeen, Amara
    Ranganathan, Shoba
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (05)
  • [38] A new semi-automated workflow for chemical data retrieval and quality checking for modeling applications
    Gadaleta, Domenico
    Lombardo, Anna
    Toma, Cosimo
    Benfenati, Emilio
    JOURNAL OF CHEMINFORMATICS, 2018, 10
  • [39] Content Enrichment for Semi-Automated Production of Added Value Content for Free Press and Web
    Bettega, S. M.
    Fioravanti, F.
    Gigli, L.
    Grassi, G.
    Spinu, M. B.
    FOURTH INTERNATIONAL CONFERENCE ON AUTOMATED SOLUTIONS FOR CROSS MEDIA CONTENT AND MULTI-CHANNEL DISTRIBUTION, PROCEEDINGS, 2008, : 57 - 62
  • [40] Semi-automated relevance feedback for distributed content based image retrieval
    Lee, I
    Guan, L
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1871 - 1874