Multi-modal browsing of images in Web documents

被引：1

作者：

Chen, F ^{[1
]}

Gargi, U ^{[1
]}

Niles, L ^{[1
]}

Schütze, H ^{[1
]}

机构：

[1] Xerox Corp, Palo Alto Res Ctr, Palo Alto, CA 94304 USA

来源：

DOCUMENT RECOGNITION AND RETRIEVAL VI | 1999年 / 3651卷

关键词：

multi-modal information access; image/document browsing and retrieval; clustering; Web documents;

D O I：

10.1117/12.335809

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we describe a system for performing browsing and retrieval on a collection of web images and associated text on an HTML page. Browsing is combined with retrieval to help a user locate interesting portions of the corpus, without the need to formulate a query well matched to the corpus. Multi-modal information, in the form of text surrounding an image and some simple image features, is used in this process. Using the system, a user progressively narrows a collection to a small number of elements of interest, similar to the Scatter/Gather system(1) developed for text browsing. We have extended the Scatter/Gather method to use multi-modal features. With the use of multiple features, some collection elements may have unknown or undefined values for some features; we present a method for incorporating these elements into the result set. This method also provides a way to handle the case when a search is narrowed to a part of the space near a boundary between two clusters. A number of examples illustrating our system are provided.

引用

页码：122 / 133

页数：12

共 50 条

[41] Object detection in multi-modal images using genetic programming
Bhanu, B
Lin, YQ
[J]. APPLIED SOFT COMPUTING, 2004, 4 (02) : 175 - 201
[42] On the Effectiveness of Images in Multi-modal Text Classification: An Annotation Study
Ma, Chunpeng
Shen, Aili
Yoshikawa, Hiyori
Iwakura, Tomoya
Beck, Daniel
Baldwin, Timothy
[J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
[43] Semantic Segmentation of Defects in Infrastructures through Multi-modal Images
Shahsavarani, Sara
Lopez, Fernando
Ibarra-Castanedo, Clemente
Maldague, Xavier P., V
[J]. THERMOSENSE: THERMAL INFRARED APPLICATIONS XLVI, 2024, 13047
[44] COERCIVE REGION-LEVEL REGISTRATION FOR MULTI-MODAL IMAGES
Chen, Yu-Hui
Wei, Dennis
Newstadt, Gregory
Simmons, Jeffrey
Hero, Alfred
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2419 - 2423
[45] Entropy and Laplacian images: Structural representations for multi-modal registration
Wachinger, Christian
Navab, Nassir
[J]. MEDICAL IMAGE ANALYSIS, 2012, 16 (01) : 1 - 17
[46] Effectively Filtering Images for Better Multi-modal Knowledge Graph
Peng, Huang
Xu, Hao
Tang, Jiuyang
Wu, Jibing
Huang, Hongbin
[J]. WEB AND BIG DATA. APWEB-WAIM 2022 INTERNATIONAL WORKSHOPS, KGMA 2022, SEMIBDMA 2022, DEEPLUDA 2022, 2023, 1784 : 10 - 22
[47] A Generative Model for Brain Tumor Segmentation in Multi-Modal Images
Menze, Bjoern H.
Van Leemput, Koen
Lashkari, Danial
Weber, Marc-Andre
Ayache, Nicholas
Golland, Polina
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2010, PT II,, 2010, 6362 : 151 - +
[48] Multi-modal Automatic Montaging of Adaptive Optics Retinal Images
Chen, Min
Cooper, Robert F.
Han, Grace K.
Gee, Lames
Brainard, David H.
Morgan, Jessica Ijams Wolfing
[J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2016, 57 (12)
[49] Registration of multi-modal brain images using the rigidity constraint
Ding, L
Goshtasby, A
[J]. 2ND ANNUAL IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2001, : 217 - 222
[50] Intensity gradient based registration and fusion of multi-modal images
Haber, Eldad
Modersitzki, Jan
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 2, 2006, 4191 : 726 - 733

← 1 2 3 4 5 →