Multi-modal browsing of images in Web documents

被引:1
|
作者
Chen, F [1 ]
Gargi, U [1 ]
Niles, L [1 ]
Schütze, H [1 ]
机构
[1] Xerox Corp, Palo Alto Res Ctr, Palo Alto, CA 94304 USA
来源
关键词
multi-modal information access; image/document browsing and retrieval; clustering; Web documents;
D O I
10.1117/12.335809
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe a system for performing browsing and retrieval on a collection of web images and associated text on an HTML page. Browsing is combined with retrieval to help a user locate interesting portions of the corpus, without the need to formulate a query well matched to the corpus. Multi-modal information, in the form of text surrounding an image and some simple image features, is used in this process. Using the system, a user progressively narrows a collection to a small number of elements of interest, similar to the Scatter/Gather system(1) developed for text browsing. We have extended the Scatter/Gather method to use multi-modal features. With the use of multiple features, some collection elements may have unknown or undefined values for some features; we present a method for incorporating these elements into the result set. This method also provides a way to handle the case when a search is narrowed to a part of the space near a boundary between two clusters. A number of examples illustrating our system are provided.
引用
收藏
页码:122 / 133
页数:12
相关论文
共 50 条
  • [41] Object detection in multi-modal images using genetic programming
    Bhanu, B
    Lin, YQ
    [J]. APPLIED SOFT COMPUTING, 2004, 4 (02) : 175 - 201
  • [42] On the Effectiveness of Images in Multi-modal Text Classification: An Annotation Study
    Ma, Chunpeng
    Shen, Aili
    Yoshikawa, Hiyori
    Iwakura, Tomoya
    Beck, Daniel
    Baldwin, Timothy
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [43] Semantic Segmentation of Defects in Infrastructures through Multi-modal Images
    Shahsavarani, Sara
    Lopez, Fernando
    Ibarra-Castanedo, Clemente
    Maldague, Xavier P., V
    [J]. THERMOSENSE: THERMAL INFRARED APPLICATIONS XLVI, 2024, 13047
  • [44] COERCIVE REGION-LEVEL REGISTRATION FOR MULTI-MODAL IMAGES
    Chen, Yu-Hui
    Wei, Dennis
    Newstadt, Gregory
    Simmons, Jeffrey
    Hero, Alfred
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2419 - 2423
  • [45] Entropy and Laplacian images: Structural representations for multi-modal registration
    Wachinger, Christian
    Navab, Nassir
    [J]. MEDICAL IMAGE ANALYSIS, 2012, 16 (01) : 1 - 17
  • [46] Effectively Filtering Images for Better Multi-modal Knowledge Graph
    Peng, Huang
    Xu, Hao
    Tang, Jiuyang
    Wu, Jibing
    Huang, Hongbin
    [J]. WEB AND BIG DATA. APWEB-WAIM 2022 INTERNATIONAL WORKSHOPS, KGMA 2022, SEMIBDMA 2022, DEEPLUDA 2022, 2023, 1784 : 10 - 22
  • [47] A Generative Model for Brain Tumor Segmentation in Multi-Modal Images
    Menze, Bjoern H.
    Van Leemput, Koen
    Lashkari, Danial
    Weber, Marc-Andre
    Ayache, Nicholas
    Golland, Polina
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2010, PT II,, 2010, 6362 : 151 - +
  • [48] Multi-modal Automatic Montaging of Adaptive Optics Retinal Images
    Chen, Min
    Cooper, Robert F.
    Han, Grace K.
    Gee, Lames
    Brainard, David H.
    Morgan, Jessica Ijams Wolfing
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2016, 57 (12)
  • [49] Registration of multi-modal brain images using the rigidity constraint
    Ding, L
    Goshtasby, A
    [J]. 2ND ANNUAL IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2001, : 217 - 222
  • [50] Intensity gradient based registration and fusion of multi-modal images
    Haber, Eldad
    Modersitzki, Jan
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 2, 2006, 4191 : 726 - 733