Content-Based Document Image Retrieval Based on Document Modeling

被引:0
|
作者
Chwan-Yi Shiah
机构
[1] Fo Guang University,Department of Applied Informatics
关键词
Document modeling; Language model; Document image retrieval; Multinomial distribution; -gram model;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, language models have gained importance in the field of information retrieval. In this paper, we propose a generic language model to improve a content-based document retrieval system. In this approach, character images are extracted, clustered, and analyzed to form high-level semantic terms using a statistical document model. This model simulates the long-term relationships between characters. Documents are then indexed according to these terms, and a query document is proposed to retrieve the relevant documents. The query document can be a single keyword, or it can be synthesized from a text string. The aim is to generate a semantic representation from low-level image pixels through pattern matching and document modeling. The conventional approach of generating semantic terms in document retrieval includes every possible symbol sequence in the feature representation. Comparatively, our approach can considerably reduce the dimensions of the feature space while producing retrieval results comparable to those of the conventional and state-of-the-art approaches.
引用
收藏
页码:287 / 306
页数:19
相关论文
共 50 条
  • [21] Content-based multi-document summarizer
    Raman, S
    Sharma, R
    Raj, PCR
    Saravanan, M
    Murty, VS
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMPUTER SCIENCE II, 2002, : 139 - 143
  • [22] Semantic modeling of natural scenes for content-based image retrieval
    Vogel, Julia
    Schiele, Bernt
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (02) : 133 - 157
  • [23] Improved stochastic modeling of shapes for content-based image retrieval
    Müller, S
    Rigoll, G
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES (CBAIVL'99) - PROCEEDINGS, 1999, : 23 - 27
  • [24] Semantic Modeling of Natural Scenes for Content-Based Image Retrieval
    Julia Vogel
    Bernt Schiele
    International Journal of Computer Vision, 2007, 72 : 133 - 157
  • [25] Content-based Image Retrieval for Medical Image
    Zheng, Kaimei
    2015 11TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2015, : 219 - 222
  • [26] HIERARCHICAL CONTENT-BASED IMAGE RETRIEVAL
    俞勇
    施鹏飞
    JournalofShanghaiJiaotongUniversity, 1999, (01) : 9 - 13
  • [27] Survey on content-based image retrieval
    Liu Huailiang
    Wavelet Active Media Technology and Information Processing, Vol 1 and 2, 2006, : 930 - 935
  • [28] Content-Based Image Retrieval in Astronomy
    A. Csillaghy
    H. Hinterberger
    A.O. Benz
    Information Retrieval, 2000, 3 : 229 - 241
  • [29] CONTENT-BASED VESSEL IMAGE RETRIEVAL
    Mukherjee, Satabdi
    Cohen, Samuel
    Gertner, Izidor
    AUTOMATIC TARGET RECOGNITION XXVI, 2016, 9844
  • [30] Content-based image retrieval methods
    N. S. Vassilieva
    Programming and Computer Software, 2009, 35 : 158 - 180