Deep Multimodal Learning for Information Retrieval

被引:0
|
作者
Ji, Wei [1 ]
Wei, Yinwei [2 ]
Zheng, Zhedong [1 ]
Fei, Hao [1 ]
Chua, Tat-Seng [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Monash Univ, Clayton, Vic, Australia
关键词
Information retrieval; Multi-modal; CLIP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information retrieval (IR) is a fundamental technique that aims to acquire information from a collection of documents, web pages, or other sources. While traditional text-based IR has achieved great success, the under-utilization of varied data sources in different modalities (i.e., text, images, audio, and video) would hinder IR techniques from giving its full advancement and thus limits the applications in the real world. Within recent years, the rapid development of deep multimodal learning paves the way for advancing IR with multi-modality. Benefiting from a variety of data types and modalities, some latest prevailing techniques are invented to show great facilitation in multi-modal and IR learning, such as CLIP, ChatGPT, GPT4, etc. In the context of IR, deep multi-modal learning has shown the prominent potential to improve the performance of retrieval systems, by enabling them to better understand and process the diverse types of data that they encounter. Given the great potential shown by multimodal-empowered IR, there can be still unsolved challenges and open questions in the related directions. With this workshop, we aim to provide a platform for discussion about multi-modal IR among scholars, practitioners, and other interested parties.
引用
收藏
页码:9739 / 9741
页数:3
相关论文
共 50 条
  • [21] Explainable Information Retrieval using Deep Learning for Medical images
    Singh, Apoorva
    Pannu, Husanbir
    Malhi, Avleen
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2022, 19 (01) : 277 - 307
  • [22] Revisiting Information Retrieval and Deep Learning Approaches for Code Summarization
    Zhu, Tingwei
    Li, Zhong
    Pan, Minxue
    Shi, Chaoxuan
    Zhang, Tian
    Pei, Yu
    Li, Xuandong
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS, ICSE-COMPANION, 2023, : 328 - 329
  • [23] Neural Bookmarks: Information Retrieval with Deep Learning and EEG Data
    Bruns, Glenn
    Haidar, Michael
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 22864 - 22870
  • [24] A Deep Learning Algorithm for Music Information Retrieval Recommendation System
    Liu H.
    Zhao C.
    Computer-Aided Design and Applications, 2024, 21 (S13): : 1 - 16
  • [25] MULTI-LINGUAL INFORMATION RETRIEVAL USING DEEP LEARNING
    Dodal, Sonam Sanjogkumar
    Kulkarni, Pallavi V.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [26] A Retrieval Method for Spatiotemporal Information of Chorography Based on Deep Learning
    Huan S.
    Recent Advances in Computer Science and Communications, 2023, 16 (02) : 30 - 36
  • [27] Multimodal Deep Learning using Images and Text for Information Graphic Classification
    Kim, Edward
    McCoy, Kathleen F.
    ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, : 143 - 148
  • [28] EFFECT OF TERRAIN INFORMATION ON MULTIMODAL DEEP LEARNING FOR FLOOD DISASTER DETECTION
    Miyamoto, Takashi
    Stricker, Marco
    Ogishima, Jun
    Iselborn, Kevin
    Nuske, Marlon
    Dengel, Andreas
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 448 - 451
  • [29] Research on Online Review Information Classification Based on Multimodal Deep Learning
    Liu, Jingnan
    Sun, Yefang
    Zhang, Yueyi
    Lu, Chenyuan
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [30] Course video recommendation with multimodal information in online learning platforms: A deep learning framework
    Xu, Wei
    Zhou, Yuhan
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2020, 51 (05) : 1734 - 1747