A hierarchical representation of form documents for identification and retrieval

被引:25
|
作者
Pınar Duygulu
Volkan Atalay
机构
[1] Department of Computer Engineering,
[2] Middle East Technical University,undefined
[3] Ankara,undefined
[4] 06531 Turkey; e-mail: {duygulu,undefined
[5] volkan}@ceng.metu.edu.tr ,undefined
关键词
Keywords: Form document processing – Logical layout extraction – Retrieval – Data processing;
D O I
10.1007/s100320100077
中图分类号
学科分类号
摘要
In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the structure of a form by using lines and the XY-tree approach. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Geometrical modifications and slight variations are handled by this representation. Logically identical forms are associated to the same or similar hierarchical structure. Identification and the retrieval of similar forms are performed by computing the edit distances between the generated trees.
引用
收藏
页码:17 / 27
页数:10
相关论文
共 50 条
  • [21] A conceptual representation of documents and queries for information retrieval systems by using light ontologies
    Dragoni, Mauro
    Pereira, Celia da Costa
    Tettamanzi, Andrea G. B.
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10376 - 10388
  • [22] Content-Based Retrieval of Aurora Images Based on the Hierarchical Representation
    Kim, Soo K.
    Ranganath, Heggere S.
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT II, 2010, 6475 : 249 - +
  • [23] Writer Identification and Writer Retrieval Using Vision Transformer for Forensic Documents
    Koepf, Michael
    Kleber, Florian
    Sablatnig, Robert
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 352 - 366
  • [24] A Complete Path Representation Method with a Modified Inverted Index for Efficient Retrieval of XML Documents
    Chang, Hsu-Kuang
    Hung, King-Chu
    Jou, I-Chang
    WSEAS Transactions on Computers, 2011, 10 (10): : 321 - 331
  • [25] ADAPTIVE INFORMATION-RETRIEVAL - USING A CONNECTIONIST REPRESENTATION TO RETRIEVE AND LEARN ABOUT DOCUMENTS
    BELEW, RK
    PROCEEDINGS OF THE TWELFTH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1989, 23 : 11 - 20
  • [26] Hierarchical histograms - A new representation scheme for image-based data retrieval
    Kumar, S
    Seetharaman, G
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2000, : 69 - 72
  • [27] Deep Multigraph Hierarchical Enhanced Semantic Representation for Cross-Modal Retrieval
    Zhu, Lei
    Zhang, Chengyuan
    Song, Jiayu
    Zhang, Shichao
    Tian, Chunwei
    Zhu, Xinghui
    IEEE MULTIMEDIA, 2022, 29 (03) : 17 - 26
  • [28] A geometric reasoning approach to hierarchical representation for B-rep model retrieval
    Li, Zhi
    Zhou, Xionghui
    Liu, Wei
    COMPUTER-AIDED DESIGN, 2015, 62 : 190 - 202
  • [29] Semantic retrieval and ranking of Semantic Web documents using free-form queries
    Spiliopoulos, Vassilis
    Kotis, Konstantinos
    Vouros, George A.
    International Journal of Metadata, Semantics and Ontologies, 2008, 3 (02) : 95 - 108
  • [30] DSRIM: A Deep Neural Information Retrieval Model Enhanced by a Knowledge Resource Driven Representation of Documents
    Gia-Hung Nguyen
    Soulier, Laure
    Tamine, Lynda
    Bricon-Souf, Nathalie
    ICTIR'17: PROCEEDINGS OF THE 2017 ACM SIGIR INTERNATIONAL CONFERENCE THEORY OF INFORMATION RETRIEVAL, 2017, : 19 - 26