Multimodal Representation Learning For Real-World Applications

被引:0
|
作者
Joshi, Abhinav [1 ]
机构
[1] Indian Inst Technol Kanpur, Kanpur, Uttar Pradesh, India
关键词
Multimodal Representations; Multimodal Fusion; Cross-modal Processing; Deep Learning Architectures; Machine Learning; RECOGNITION;
D O I
10.1145/3536221.3557030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal representation learning has shown tremendous improvements in recent years. An extensive set of works for fusing multiple modalities have shown promising results on the public benchmarks. However, most famous works target unrealistic settings or toy datasets, and a considerable gap exists between the real-world implications of the existing methods. In this work, we aim to bridge the gap between the well-defned benchmark settings and the real-world use cases. We aim to explore architectures inspired by existing promising approaches that have the potential to be implemented in real-world instances. Moreover, we also try to move the research forward by addressing questions that can be solved using multimodal approaches and have a considerable impact on the community. With this work, we attempt to leverage the multimodal representation learning methods, which directly apply to real-world settings.
引用
收藏
页码:717 / 723
页数:7
相关论文
共 50 条
  • [1] Learning Color Names for Real-World Applications
    van de Weijer, Joost
    Schmid, Cordelia
    Verbeek, Jakob
    Larlus, Diane
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) : 1512 - 1523
  • [2] Overview of transformative learning II: real-world applications
    Anand, Tejwansh S.
    Anand, Shohreh V.
    Welch, Marguerite
    Marsick, Victoria J.
    Langer, Arthur
    [J]. REFLECTIVE PRACTICE, 2020, 21 (06) : 744 - 758
  • [3] Reinforcement Learning in Robotics: Applications and Real-World Challenges
    Kormushev, Petar
    Calinon, Sylvain
    Caldwell, Darwin G.
    [J]. ROBOTICS, 2013, 2 (03): : 122 - 148
  • [4] REAL-WORLD APPLICATIONS
    SMITH, M
    [J]. COMMUNICATIONS OF THE ACM, 1992, 35 (07) : 20 - &
  • [5] Time Series Representation for Real-World Applications of Deep Neural Networks
    Levasseur, Guillaume
    Bersini, Hugues
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Machine Learning: Algorithms, Real-World Applications and Research Directions
    Sarker I.H.
    [J]. SN Computer Science, 2021, 2 (3)
  • [7] A Review on Machine Learning Strategies for Real-World Engineering Applications
    Jhaveri, Rutvij H.
    Revathi, A.
    Ramana, Kadiyala
    Raut, Roshani
    Dhanaraj, Rajesh Kumar
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [8] Real-World ISAR Object Recognition Using Deep Multimodal Relation Learning
    Xue, Bin
    Tong, Ningning
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (10) : 4256 - 4267
  • [9] Foreword of the Special Issue on Real-World Applications of Machine Learning
    Rizvi, Syed Tahir Hussain
    Arif, Arslan
    [J]. ELECTRONICS, 2024, 13 (08)
  • [10] Real-World Computer Vision for Real-World Applications: Challenges and Directions
    Tabkhi, Hamed
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2023, 542 : 727 - 750