Automatic generation of structured hyperdocuments from document images

被引:4
|
作者
Lee, JY
Park, JS
Byun, H
Moon, J
Lee, SW [1 ]
机构
[1] Korea Univ, Ctr Artificial Vis Res, Dept Comp Sci & Engn, Seongbuk Ku, Seoul 136701, South Korea
[2] Yonsei Univ, Dept Comp Sci, Seodaemoon Ku, Seoul 120749, South Korea
[3] Korea Univ, Dept Elect & Informat Engn, Chungnam 339800, South Korea
关键词
structured hyperdocument; multi-column document; document conversion; document image understanding; logical structure analysis;
D O I
10.1016/S0031-3203(01)00026-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As sharing documents through the World Wide Web has been recently and constantly increasing, the need for creating hyperdocuments to make them accessible and retrievable via the internet, in formats such as HTML and SGML/XML, has also been rapidly rising. Nevertheless, only a few works have been done on the conversion of paper documents into hyperdocuments. Moreover, most of these studies have concentrated on the direct conversion of single-column document images that include only text and image objects. In this paper, we propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that, by using the proposed methods, their corresponding HTML documents can be generated in the same visual layout as that of the document images, and their structured table of contents page can be also produced with the hierarchically ordered section titles hyperlinked to the contents. (C) 2001 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:485 / 503
页数:19
相关论文
共 50 条
  • [41] Automatic script identification from document images using cluster-based templates
    Hochberg, J
    Kelly, P
    Thomas, T
    Kerns, L
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) : 176 - 181
  • [42] Page Object Detection from PDF Document Images by Deep Structured Prediction and Supervised Clustering
    Li, Xiao-Hui
    Yin, Fei
    Liu, Cheng-Lin
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3627 - 3632
  • [43] TasvirEt: A Benchmark Dataset for Automatic Turkish Description Generation from Images
    Unal, Mesut Erhan
    Citamak, Begum
    Yagcioglu, Semih
    Erdem, Aykut
    Erdem, Erkut
    Cinbis, Nazli Ikizler
    Cakici, Ruket
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1977 - 1980
  • [44] Automatic Generation of 3D Animations from Text and Images
    Cannavo, Alberto
    Gatteschi, Valentina
    Macis, Luca
    Lamberti, Fabrizio
    EXTENDED REALITY, XR SALENTO 2022, PT I, 2022, 13445 : 77 - 91
  • [45] Construction of an automatic document generation model and its application
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    Jisuanji Jicheng Zhizao Xitong, 2008, 7 (1297-1305): : 1297 - 1305
  • [46] Automatic Schema Generation for Document-Oriented Systems
    Gomez, Paola
    Casallas, Rubby
    Roncancio, Claudia
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT I, 2020, 12391 : 152 - 163
  • [47] Automatic link generation and repair mechanism for document management
    Shimada, T
    Futakata, A
    THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL. II: DIGITAL DOCUMENTS TRACK, 1998, : 226 - 235
  • [48] An integrated approach for automatic semantic structure extraction in document images
    Berardi, M
    Lapi, M
    Malerba, D
    DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 179 - 190
  • [49] AUTOMATIC TEXT EXTRACTION, REMOVAL AND INPAINTING OF COMPLEX DOCUMENT IMAGES
    Chen, Yen-Lin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (1A): : 303 - 327
  • [50] Automatic dewarping of camera-captured comic document images
    Garai, Arpan
    Dutta, Arpita
    Biswas, Samit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 1537 - 1552