Information Fusion for Combining Visual and Textual Image Retrieval in ImageCLEF@ICPR

被引:0
|
作者
Zhou, Xin [1 ]
Depeursinge, Adrien [1 ]
Mueller, Henning [1 ]
机构
[1] Univ Hosp Geneva, Geneva, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the ImageCLEF image retrieval competition multimodal image retrieval has been evaluated over the past seven years. For ICPR 2010 a contest was organized for the fusion of visual and textual retrieval as this was one task where most participants had problems. In this paper, classical approaches such as the maximum combinations (combMAX), the sum combinations (combSUM) and the multiplication of the sum and the number of non-zero scores (combMNZ) were employed and the trade-off between two fusion effects (chorus and dark horse effects) was studied based on the sum of n maxima. Various normalization strategies were tried out. The fusion algorithms are evaluated using the best four visual and textual runs of the ImageCLEF medical image retrieval task 2008 and 2009. The results show that fused runs outperform the best original runs and multi-modality fusion statistically outperforms single modality fusion. The logarithmic rank penalization shows to be the most stable normalization. The dark horse effect is in competition with the chorus effect and each of them can produce best fusion performance depending on the nature of the input data.
引用
收藏
页码:129 / 137
页数:9
相关论文
共 50 条
  • [1] The ImageCLEF Medical Retrieval Task at ICPR 2010-Information Fusion to Combine Visual and Textual Information
    Mueller, Henning
    Kalpathy-Cramer, Jayashree
    [J]. RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 99 - +
  • [2] FIRE in ImageCLEF 2005: Combining content-based image retrieval with textual information retrieval
    Deselaers, Thomas
    Weyand, Tobias
    Keysers, Daniel
    Macherey, Wolfgang
    Ney, Hermann
    [J]. ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 652 - 661
  • [3] The University of Surrey Visual Concept Detection System at ImageCLEF@ICPR: Working Notes
    Tahir, M. A.
    Yan, F.
    Barnard, M.
    Awais, M.
    Mikolajczyk, K.
    Kittler, J.
    [J]. RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 162 - 170
  • [4] Combining textual and visual features for image retrieval
    Martinez-Fernandez, J. L.
    Villena Roman, Julio
    Garcia-Serrano, Ana M.
    Gonzalez-Cristobal, Jose Carlos
    [J]. ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 680 - 691
  • [5] Visual and textual information fusion using Kernel method for content based image retrieval
    Unar, Salahuddin
    Wang, Xingyuan
    Zhang, Chuan
    [J]. INFORMATION FUSION, 2018, 44 : 176 - 187
  • [6] Improving Performance of Medical Images Retrieval by Combining Textual and Visual Information
    Diaz-Galiano, M. C.
    Martin-Valdivia, M. T.
    Montejo-Raez, A.
    Urena-Lopez, L. A.
    [J]. MICAI 2007: SIXTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, : 185 - 192
  • [7] Combining textual and visual features for cross-language medical image retrieval
    Cheng, Pei-Cheng
    Chien, Been-Chian
    Ke, Hao-Ren
    Yang, Wei-Pang
    [J]. ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 712 - 723
  • [8] An Integrated Approach for Medical Image Retrieval through Combining Textual and Visual Features
    Ye, Zheng
    Huang, Xiangji
    Hu, Qinmin
    Lin, Hongfei
    [J]. MULTILINGUAL INFORMATION ACCESS EVALUATION II: MULTIMEDIA EXPERIMENTS, PT II, 2010, 6242 : 195 - +
  • [9] Retrieval of multimedia objects by combining semantic information from visual and textual descriptors
    Sjoberg, Mats
    Laaksonen, Jorma
    Polla, Matti
    Honkela, Timo
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 75 - 83
  • [10] Integrating textual and visual information for cross-language image retrieval
    Lin, WC
    Chang, YC
    Chen, HH
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 454 - 466