Multi-modal browsing of images in Web documents

被引:1
|
作者
Chen, F [1 ]
Gargi, U [1 ]
Niles, L [1 ]
Schütze, H [1 ]
机构
[1] Xerox Corp, Palo Alto Res Ctr, Palo Alto, CA 94304 USA
来源
关键词
multi-modal information access; image/document browsing and retrieval; clustering; Web documents;
D O I
10.1117/12.335809
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe a system for performing browsing and retrieval on a collection of web images and associated text on an HTML page. Browsing is combined with retrieval to help a user locate interesting portions of the corpus, without the need to formulate a query well matched to the corpus. Multi-modal information, in the form of text surrounding an image and some simple image features, is used in this process. Using the system, a user progressively narrows a collection to a small number of elements of interest, similar to the Scatter/Gather system(1) developed for text browsing. We have extended the Scatter/Gather method to use multi-modal features. With the use of multiple features, some collection elements may have unknown or undefined values for some features; we present a method for incorporating these elements into the result set. This method also provides a way to handle the case when a search is narrowed to a part of the space near a boundary between two clusters. A number of examples illustrating our system are provided.
引用
收藏
页码:122 / 133
页数:12
相关论文
共 50 条
  • [1] Extractive summarization of documents with images based on multi-modal RNN
    Chen, Jingqiang
    Hai Zhuge
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 99 : 186 - 196
  • [2] Multi-modal web-browsing - An empirical approach to improve the browsing process of Internet retrieved results
    Rigas, Dimitrios
    Ciuffreda, Antonio
    [J]. SIGMAP 2006: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2006, : 269 - +
  • [3] Metaknowledge Extraction Based on Multi-Modal Documents
    Liu, Shu-Kan
    Xu, Rui-Lin
    Geng, Bo-Ying
    Sun, Qiao
    Duan, Li
    Liu, Yi-Ming
    [J]. IEEE ACCESS, 2021, 9 : 50050 - 50060
  • [4] MULTI-MODAL TRAVEL INFORMATION ON THE WEB
    Pun-Cheng, Lilian S. C.
    Shea, Geoffrey Y. K.
    Mok, Esmond C. M.
    [J]. TRANSPORTATION AND LOGISTICS, 2003, : 285 - 290
  • [5] Loosely-coupled approach towards multi-modal browsing
    Jan Kleindienst
    Ladislav Seredi
    Pekka Kapanen
    Janne Bergman
    [J]. Universal Access in the Information Society, 2003, 2 (2) : 173 - 188
  • [6] AMM: Towards adaptive ranking of multi-modal documents
    Akbari M.
    Nie L.
    Chua T.-S.
    [J]. International Journal of Multimedia Information Retrieval, 2015, 4 (4) : 233 - 245
  • [7] How Web Design influences User Experience: a Multi-modal Method for Real-Time Assessment during Web Browsing
    Caldiroli, Cristina Liviana
    Garbo, Roberta
    Pallavicini, Federica
    Antonietti, Alessandro
    Mangiatordi, Andrea
    Mantovani, Fabrizia
    [J]. 2017 14TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2017, : 1063 - 1066
  • [8] Inequality: multi-modal equation entry on the web
    Franceschini, Andrea
    Sharkey, James P.
    Beresford, Alastair R.
    [J]. L@S '19: PROCEEDINGS OF THE SIXTH (2019) ACM CONFERENCE ON LEARNING @ SCALE, 2019,
  • [9] A framework for creating customized multi-modal interfaces for XML documents
    Rollins, S
    Sundaresan, N
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 933 - 936
  • [10] Multi-modal and Multi-spectral Registration for Natural Images
    Shen, Xiaoyong
    Xu, Li
    Zhang, Qi
    Jia, Jiaya
    [J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 309 - 324