A hierarchical representation of form documents for identification and retrieval

被引:25
|
作者
Pınar Duygulu
Volkan Atalay
机构
[1] Department of Computer Engineering,
[2] Middle East Technical University,undefined
[3] Ankara,undefined
[4] 06531 Turkey; e-mail: {duygulu,undefined
[5] volkan}@ceng.metu.edu.tr ,undefined
关键词
Keywords: Form document processing – Logical layout extraction – Retrieval – Data processing;
D O I
10.1007/s100320100077
中图分类号
学科分类号
摘要
In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the structure of a form by using lines and the XY-tree approach. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Geometrical modifications and slight variations are handled by this representation. Logically identical forms are associated to the same or similar hierarchical structure. Identification and the retrieval of similar forms are performed by computing the edit distances between the generated trees.
引用
收藏
页码:17 / 27
页数:10
相关论文
共 50 条
  • [31] Continuous Word Representation using Neural Networks for Proper Name Retrieval from Diachronic Documents
    Fohr, Dominique
    Illina, Irina
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3506 - 3510
  • [32] Recurrent fuzzy CMAC in hierarchical form for dynamic system identification
    Rodriguez, Floriberto Ortiz
    Yu, Wen
    Moreno-Armendariz, Marco A.
    2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 3401 - +
  • [33] Combining attention model with hierarchical graph representation for region-based image retrieval
    Feng, Song-He
    Xu, De
    Li, Bing
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (08): : 2203 - 2206
  • [34] Deep Hashing for Speaker Identification and Retrieval Based on Auditory Sparse Representation
    Tran, Dung Kim
    Akagi, Masato
    Unoki, Masashi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 937 - 943
  • [35] INTERACTIVE RETRIEVAL OF COMPLEX DOCUMENTS
    CROFT, WB
    KROVETZ, R
    TURTLE, H
    INFORMATION PROCESSING & MANAGEMENT, 1990, 26 (05) : 593 - 613
  • [36] Interactive retrieval of complex documents
    Croft, W.B., 1600, (26):
  • [37] Abductive retrieval of structured documents
    Muller, AA
    MATHEMATICAL AND COMPUTER MODELLING, 1997, 26 (01) : 15 - 28
  • [38] IMAGE RETRIEVAL FOR COMPOUND DOCUMENTS
    HALFHILL, TR
    BYTE, 1994, 19 (08): : 104 - 104
  • [39] INFORMATION RETRIEVAL FOR SHORT DOCUMENTS
    Qi Haoliang Li Mu Gao Jianfeng Li Sheng Ministry of Education Microsoft Key Laboratory of Natural Language Processing and Speech Harbin Institute of Technology Harbin China Microsoft Research Asia Beijing China Microsoft Research Redmond WA USA
    JournalofElectronics, 2006, (06) : 933 - 936
  • [40] ANNOTATIONS ON DOCUMENTS FOR INFORMATION RETRIEVAL
    Patil, Vishal A.
    Khambre, Pankaj
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,