Content-Based Document Image Retrieval Based on Document Modeling

被引:0
|
作者
Chwan-Yi Shiah
机构
[1] Fo Guang University,Department of Applied Informatics
关键词
Document modeling; Language model; Document image retrieval; Multinomial distribution; -gram model;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, language models have gained importance in the field of information retrieval. In this paper, we propose a generic language model to improve a content-based document retrieval system. In this approach, character images are extracted, clustered, and analyzed to form high-level semantic terms using a statistical document model. This model simulates the long-term relationships between characters. Documents are then indexed according to these terms, and a query document is proposed to retrieve the relevant documents. The query document can be a single keyword, or it can be synthesized from a text string. The aim is to generate a semantic representation from low-level image pixels through pattern matching and document modeling. The conventional approach of generating semantic terms in document retrieval includes every possible symbol sequence in the feature representation. Comparatively, our approach can considerably reduce the dimensions of the feature space while producing retrieval results comparable to those of the conventional and state-of-the-art approaches.
引用
下载
收藏
页码:287 / 306
页数:19
相关论文
共 50 条
  • [41] Content-based image retrieval speedup
    Fadaei, Sadegh
    Rashno, Abdolreza
    Rashno, Elyas
    2019 5TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS 2019), 2019,
  • [42] Localized content-based image retrieval
    Rahmani, Rouhollah
    Goldman, Sally A.
    Zhang, Hui
    Cholleti, Sharath R.
    Fritts, Jason E.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (11) : 1902 - 1912
  • [43] Image coding for content-based retrieval
    Swanson, MD
    Hosur, S
    Tewfik, AH
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '96, 1996, 2727 : 4 - 15
  • [44] Content-based image retrieval in astronomy
    Csillaghy, A
    Hinterberger, H
    Benz, AO
    INFORMATION RETRIEVAL, 2000, 3 (03): : 229 - 241
  • [45] Content-based image retrieval methods
    Vassilieva, N. S.
    PROGRAMMING AND COMPUTER SOFTWARE, 2009, 35 (03) : 158 - 180
  • [46] A content-based image retrieval system
    Huang, CL
    Huang, DH
    IMAGE AND VISION COMPUTING, 1998, 16 (03) : 149 - 163
  • [47] Learning in content-based image retrieval
    Huang, TS
    Zhou, XS
    Nakazato, M
    Wu, Y
    Cohen, I
    2ND INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, PROCEEDINGS, 2002, : 155 - 162
  • [48] Gaps in content-based image retrieval
    Deserno, Thomas M.
    Antani, Sameer
    Long, Rodney
    MEDICAL IMAGING 2007: PACS AND IMAGING INFORMATICS, 2007, 6516
  • [49] Content-Based Relevance Estimation in Retrieval Settings with Ranking-Incentivized Document Manipulations
    Vasilisky, Ziv
    Kurland, Oren
    Tennenholtz, Moshe
    Raiber, Fiana
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 205 - 214
  • [50] Interactive Content-Based Document Retrieval Using Fuzzy Attributed Relational Graph Matching
    Chaieb, Ramzi
    Kalti, Karim
    Ben Amara, Najoua Essoukri
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 921 - 925