Document image retrieval through word shape coding

被引:48
|
作者
Lu, Shijian [1 ]
Li, Linlin [2 ]
Tan, Chew Lim [2 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 119613, Singapore
[2] Natl Univ Singapore, Sch Comp, Dept Comp Sci, Singapore 117543, Singapore
关键词
document image retrieval; document image analysis; word shape coding;
D O I
10.1109/TPAMI.2008.89
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a document retrieval technique that is capable of searching document images without optical character recognition (OCR). The proposed technique retrieves document images by a new word shape coding scheme, which captures the document content through annotating each word image by a word shape code. In particular, we annotate word images by using a set of topological shape features including character ascenders/descenders, character holes, and character water reservoirs. With the annotated word shape codes, document images can be retrieved by either query keywords or a query document image. Experimental results show that the proposed document image retrieval technique is fast, efficient, and tolerant to various types of document degradation.
引用
收藏
页码:1913 / 1918
页数:6
相关论文
共 50 条
  • [41] Enhancing Document Image Retrieval in Education: Leveraging Ensemble-Based Document Image Retrieval Systems for Improved Precision
    Alzoubi, Yehia Ibrahim
    Topcu, Ahmet Ercan
    Ozdemir, Erdem
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [42] Using character shape coding for information retrieval
    Smeaton, AF
    Spitz, AL
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 974 - 978
  • [43] AGE AND INTELLIGENCE DIFFERENCES IN CODING AND RETRIEVAL OF WORD LISTS
    CRAIK, FIM
    MASANI, PA
    BRITISH JOURNAL OF PSYCHOLOGY, 1969, 60 : 315 - &
  • [44] A survey of document image word spotting techniques
    Giotis, Angelos P.
    Sfikas, Giorgos
    Gatos, Basilis
    Nikou, Christophoros
    PATTERN RECOGNITION, 2017, 68 : 310 - 332
  • [45] Impact of image organizations on multimedia document retrieval
    Haque, N
    Chowdhury, M
    Rahman, SM
    FOURTH ANNUAL ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2005, : 340 - 343
  • [46] Signature Detection and Matching for Document Image Retrieval
    Zhu, Guangyu
    Zheng, Yefeng
    Doermann, David
    Jaeger, Stefan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (11) : 2015 - 2031
  • [47] Structural similarity for document image classification and retrieval
    Kumar, Jayant
    Ye, Peng
    Doermann, David
    PATTERN RECOGNITION LETTERS, 2014, 43 : 119 - 126
  • [48] Attribute-based document image retrieval
    Melissa Cote
    Alexandra Branzan Albu
    International Journal on Document Analysis and Recognition (IJDAR), 2024, 27 : 57 - 71
  • [49] Evaluation of Gist Operator for Document Image Retrieval
    Alaei, Fahimeh
    Alaei, Alireza
    Pal, Umapada
    Blumenstein, Michael
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 369 - 374
  • [50] Document Retrieval Using SIFT Image Features
    Smith, Dan
    Harvey, Richard
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2011, 17 (01) : 3 - 15