Unifying Content and Context Similarities of the Textual and Visual Information in an Image Clustering Framework

被引:0
|
作者
Tahayna, Bashar [1 ]
Alashmi, Saadat M. [1 ]
Belkhatir, Mohammed
Abbas, Khaled
Wang, Yandan [1 ]
机构
[1] Monash Univ, Clayton, Vic 3800, Australia
关键词
Clustering; Classification; Content-based; Similarity measures; bipitrate graphs;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Content-based image retrieval (CBIR) has been a challenging problem and its performance relies on the efficiency in modeling the underlying content and the similarity measure between the query and the retrieved images. Most existing metrics evaluate pairwise image similarity based only on image content, which is denoted as content similarity. However, other schemes utilize the annotations and the surrounding text to improve the retrieval results. In this study we refer to content as the visual and the textual information belonging to an image. We propose a representation of an image surrounding text in terms of concepts by utilizing an online knowledge source, e.g., Wikipedia, and propose a similarity metric that takes into account the new conceptual representation of the text. Moreover, we combine the content information with the contexts of an image to improve the retrieval percentage. The context of an image is built by constructing a vector with each dimension representing the content (visual and textual/conceptual) similarity between the image and any image in the collection. The context similarity between two images is obtained by computing the similarity between the corresponding context vectors using the vector similarity functions. Then, we fuse the similarity measures into a unified measure to evaluate the overall image similarity. Experimental results demonstrate that the new text representation and the use of the context similarity can significantly improve the retrieval performance.
引用
收藏
页码:515 / 526
页数:12
相关论文
共 50 条
  • [1] Impact of visual information to image retrieval by content and context
    Moulin, Christophe
    Largeron, Christine
    Géry, Mathias
    [J]. CORIA 2010: Actes de la COnference en Recherche d'Information et Applications - Proceedings of the Conference on Information Retrieval and Applications, 2010, : 179 - 193
  • [2] Unifying textual and visual cues for content-based image retrieval on the World Wide Web
    Sclaroff, Stan
    Cascia, Marco La
    Sethi, Saratendu
    Taycher, Leonid
    [J]. Computer Vision and Image Understanding, 1999, 75 (01): : 86 - 98
  • [3] Unifying textual and visual cues for content-based image retrieval on the World Wide Web
    Sclaroff, S
    La Cascia, M
    Sethi, S
    Taycher, L
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 75 (1-2) : 86 - 98
  • [4] Combining Content and Context Similarities for Image Retrieval
    Wan, Xiaojun
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 749 - 754
  • [5] Integrating Visual and Textual Features for Web Image Clustering
    Xia, D. S.
    Xiang, Z. Q.
    Zou, Y. X.
    [J]. 2015 1ST IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2015, : 116 - 123
  • [6] Unifying Pictorial and Textual Features for Screen Content Image Quality Evaluation
    Chen, Yihua
    Liang, Xiaoping
    Yu, Mengzhu
    Tang, Zhenjun
    [J]. PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1099 - 1103
  • [7] Visual and textual information fusion using Kernel method for content based image retrieval
    Unar, Salahuddin
    Wang, Xingyuan
    Zhang, Chuan
    [J]. INFORMATION FUSION, 2018, 44 : 176 - 187
  • [8] Multi-view content-context information bottleneck for image clustering
    Hu, Shizhe
    Wang, Bo
    Lou, Zhengzheng
    Ye, Yangdong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [9] Semantic indexing of multimedia content using textual and visual information
    [J]. Amrane, A. (amrane@mail.cerist.dz), 1600, Inderscience Enterprises Ltd. (05): : 2 - 3
  • [10] Social Image Search exploiting Joint Visual-Textual information within a Fuzzy Hypergraph Framework
    Pliakos, Konstantinos
    Kotropoulos, Constantine
    [J]. 2014 IEEE 16TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2014,