Possibilities of the Latest AI Models in Production – Multi-Modal Foundation Models in Production

被引:0
|
作者
Behnen, H. [1 ]
Woltersmann, J.-H. [2 ,3 ]
Wolfschläger, D. [2 ,3 ]
Schmitt, R.H. [2 ,3 ]
机构
[1] RWTH AachenUniversity, Germany
[2] WZL | RWTH Aachen University, Germany
[3] Intelligence in Quality Sensing (IQS) Lehrstuhl für Informations -, Qualitätsund Sensorsysteme in der Produktion, Campus-Boulevard 30, Aachen,52074, Germany
来源
WT Werkstattstechnik | 2024年 / 114卷 / 11-12期
关键词
D O I
10.37544/1436-4980-2024-11-12-43
中图分类号
学科分类号
摘要
Current challenges in production, such as shortage of skilled workers, increase the need to automate processes and increase productivity. Multi-modal foundation models address this automation demand for a variety of applications by deriving decisions based on heterogeneous information sources. However, applications around this technology are currently rare. This article therefore provides an overview of the potential and challenges of these models in production. © 2024, VDI Fachmedien GmBH & Co. KG. All rights reserved.
引用
收藏
页码:747 / 754
相关论文
共 50 条
  • [31] Spline regression models for complex multi-modal regulatory networks
    Ozmen, A.
    Kropat, E.
    Weber, G. -W.
    OPTIMIZATION METHODS & SOFTWARE, 2014, 29 (03): : 515 - 534
  • [32] MMA: Multi-Modal Adapter for Vision-Language Models
    Yang, Lingxiao
    Zhang, Ru-Yuan
    Wang, Yanchen
    Xie, Xiaohua
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23826 - +
  • [33] Incorporating Concreteness in Multi-Modal Language Models with Curriculum Learning
    Sezerer, Erhan
    Tekir, Selma
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [34] Multi-Modal Estimation with Kernel Embeddings for Learning Motion Models
    McCalman, Lachlan
    O'Callaghan, Simon
    Ramos, Fabio
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 2845 - 2852
  • [35] Generative Multi-Modal Knowledge Retrieval with Large Language Models
    Long, Xinwei
    Zeng, Jiali
    Meng, Fandong
    Ma, Zhiyuan
    Zhang, Kaiyan
    Zhou, Bowen
    Zhou, Jie
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18733 - 18741
  • [36] Multi-Modal Generative Models for Learning Epistemic Active Sensing
    Korthals, Timo
    Rudolph, Daniel
    Leitner, Juergen
    Hesse, Marc
    Rueckert, Ulrich
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 3319 - 3325
  • [37] MULTI-MODAL BIG-DATA MANAGEMENT FOR FILM PRODUCTION
    Kim, Hansung
    Pabst, Simon
    Sneddon, Justin
    Waine, Ted
    Clifford, Jeff
    Hilton, Adrian
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4833 - 4837
  • [38] Multi-Modal Experience Inspired AI Creation
    Cao, Qian
    Chen, Xu
    Song, Ruihua
    Jiang, Hao
    Yang, Guang
    Cao, Zhao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1445 - 1454
  • [39] Comparison of Multi-Modal Large Language Models with Deep Learning Models for Medical Image Classification
    Than, Joel Chia Ming
    Vong, Wan Tze
    Yong, Kelvin Sheng Chek
    2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,
  • [40] Foundation Models, Generative AI, and Large Language Models
    Ross, Angela
    McGrow, Kathleen
    Zhi, Degui
    Rasmy, Laila
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387