Possibilities of the Latest AI Models in Production – Multi-Modal Foundation Models in Production

被引:0
|
作者
Behnen, H. [1 ]
Woltersmann, J.-H. [2 ,3 ]
Wolfschläger, D. [2 ,3 ]
Schmitt, R.H. [2 ,3 ]
机构
[1] RWTH AachenUniversity, Germany
[2] WZL | RWTH Aachen University, Germany
[3] Intelligence in Quality Sensing (IQS) Lehrstuhl für Informations -, Qualitätsund Sensorsysteme in der Produktion, Campus-Boulevard 30, Aachen,52074, Germany
来源
WT Werkstattstechnik | 2024年 / 114卷 / 11-12期
关键词
D O I
10.37544/1436-4980-2024-11-12-43
中图分类号
学科分类号
摘要
Current challenges in production, such as shortage of skilled workers, increase the need to automate processes and increase productivity. Multi-modal foundation models address this automation demand for a variety of applications by deriving decisions based on heterogeneous information sources. However, applications around this technology are currently rare. This article therefore provides an overview of the potential and challenges of these models in production. © 2024, VDI Fachmedien GmBH & Co. KG. All rights reserved.
引用
收藏
页码:747 / 754
相关论文
共 50 条
  • [41] MIS '24: 1st ACM Multimedia Workshop on Multi-modal Misinformation Governance in the Era of Foundation Models
    Fan, Shaojing
    Wang, Zheng
    Shao, Rui
    Bai, Song
    Zhu, Hongyuan
    Nie, Liqiang
    Satoh, Shin'ichi
    PROCEEDINGS OF THE 1ST ACM MULTIMEDIA WORKSHOP ON MULTI-MODAL MISINFORMATION GOVERNANCE IN THE ERA OF FOUNDATION MODELS, MIS 2024, 2024, : 1 - 2
  • [42] Review on the usage of deep learning models in multi-modal sentiment analysis
    Naga Durga Saile K.
    Venkatramaphanikumar S.
    Venkata Krishna Kishore K.
    Bhattacharyya D.
    Venkatramaphanikumar, S. (svrphanikumar@yahoo.com), 1600, Institute of Electronics Engineers of Korea (09): : 435 - 444
  • [43] Multi-modal Terminology Management Corpora, Data Models, and Implementations in TermStar
    Giai, Enrico
    Poeta, Nicola
    Turnbull, David
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 1752 : 219 - 226
  • [44] Dataset and Models for Item Recommendation Using Multi-Modal User Interactions
    Bruun, Simone Borg
    Balog, Krisztian
    Maistro, Maria
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 709 - 718
  • [45] Exploring Fusion Strategies in Deep Learning Models for Multi-Modal Classification
    Zhang, Duoyi
    Nayak, Richi
    Bashar, Md Abul
    DATA MINING, AUSDM 2021, 2021, 1504 : 102 - 117
  • [46] The future of action recognition: are multi-modal visual language models the key?
    Gumuskaynak, Enes
    Eken, Suleyman
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (04)
  • [47] Evaluation of Random Field Models in Multi-modal Unsupervised Tampering Localization
    Korus, Pawel
    Huang, Jiwu
    2016 8TH IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS 2016), 2016,
  • [48] Improving Zero-shot Generalization and Robustness of Multi-modal Models
    Ge, Yunhao
    Ren, Jie
    Gallagher, Andrew
    Wang, Yuxiao
    Yang, Ming-Hsuan
    Adam, Hartwig
    Itti, Laurent
    Lakshminarayanan, Balaji
    Zhao, Raping
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11093 - 11101
  • [49] Multi-modal large language models in radiology: principles, applications, and potential
    Shen, Yiqiu
    Xu, Yanqi
    Ma, Jiajian
    Rui, Wushuang
    Zhao, Chen
    Heacock, Laura
    Huang, Chenchan
    ABDOMINAL RADIOLOGY, 2024,