Deep Multimodal Learning for Information Retrieval

被引:0
|
作者
Ji, Wei [1 ]
Wei, Yinwei [2 ]
Zheng, Zhedong [1 ]
Fei, Hao [1 ]
Chua, Tat-Seng [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Monash Univ, Clayton, Vic, Australia
关键词
Information retrieval; Multi-modal; CLIP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information retrieval (IR) is a fundamental technique that aims to acquire information from a collection of documents, web pages, or other sources. While traditional text-based IR has achieved great success, the under-utilization of varied data sources in different modalities (i.e., text, images, audio, and video) would hinder IR techniques from giving its full advancement and thus limits the applications in the real world. Within recent years, the rapid development of deep multimodal learning paves the way for advancing IR with multi-modality. Benefiting from a variety of data types and modalities, some latest prevailing techniques are invented to show great facilitation in multi-modal and IR learning, such as CLIP, ChatGPT, GPT4, etc. In the context of IR, deep multi-modal learning has shown the prominent potential to improve the performance of retrieval systems, by enabling them to better understand and process the diverse types of data that they encounter. Given the great potential shown by multimodal-empowered IR, there can be still unsolved challenges and open questions in the related directions. With this workshop, we aim to provide a platform for discussion about multi-modal IR among scholars, practitioners, and other interested parties.
引用
收藏
页码:9739 / 9741
页数:3
相关论文
共 50 条
  • [41] Chapter 8: Multimedia and Multimodal Information Retrieval
    Bozzon, Alessandro
    Fraternali, Piero
    SEARCH COMPUTING: CHALLENGES AND DIRECTIONS, 2010, 5950 : 135 - 155
  • [42] A study of untrained models for multimodal information retrieval
    Melanie Imhof
    Martin Braschler
    Information Retrieval Journal, 2018, 21 : 81 - 106
  • [43] IMAGE SEGMENTATION APPROACH IN MULTIMODAL INFORMATION RETRIEVAL
    Ahmed, Shaikh Riaz
    Li, Jian-Ping
    Hammad, Memon Muhammad
    Asif, Khan
    2013 10TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2013, : 167 - 170
  • [44] An Adaptable Search Engine for Multimodal Information Retrieval
    Hubert, Gilles
    Mothe, Josiane
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (08): : 1625 - 1634
  • [45] Multimodal preference aggregation for multimedia information retrieval
    Bruno, Eric
    Marchand-Maillet, Stéphane
    Journal of Multimedia, 2009, 4 (05): : 321 - 329
  • [46] Deep Multimodal Complementarity Learning
    Wang, Daheng
    Zhao, Tong
    Yu, Wenhao
    Chawla, Nitesh, V
    Jiang, Meng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10213 - 10224
  • [47] Synthetic Sensor Data Generation Exploiting Deep Learning Techniques and Multimodal Information
    Romanelli, Fabrizio
    Martinelli, Francesco
    IEEE SENSORS LETTERS, 2023, 7 (07)
  • [48] COMBINING MULTIMODAL INFORMATION FOR METAL ARTEFACT REDUCTION: AN UNSUPERVISED DEEP LEARNING FRAMEWORK
    Ranzini, Marta B. M.
    Groothuis, Irme
    Klaser, Kerstin
    Cardoso, M. Jorge
    Henckel, Johann
    Ourselin, Sebastien
    Hart, Alister
    Modat, Marc
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 600 - 604
  • [49] Multimodal Information-Based Broad and Deep Learning Model for Emotion Understanding
    Li, Min
    Chen, Luefeng
    Wu, Min
    Pedrycz, Witold
    Hirota, Kaoru
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7410 - 7414
  • [50] Information Retrieval and Optimization in Distribution and Logistics Management Using Deep Reinforcement Learning
    Yang, Li
    Sathishkumar, V. E.
    Manickam, Adhiyaman
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS AND SUPPLY CHAIN MANAGEMENT, 2023, 16 (01)