Possibilities of the Latest AI Models in Production – Multi-Modal Foundation Models in Production

被引：0

作者：

Behnen, H. ^{[1
]}

Woltersmann, J.-H. ^{[2
,3
]}

Wolfschläger, D. ^{[2
,3
]}

Schmitt, R.H. ^{[2
,3
]}

机构：

[1] RWTH AachenUniversity, Germany

[2] WZL | RWTH Aachen University, Germany

[3] Intelligence in Quality Sensing (IQS) Lehrstuhl für Informations -, Qualitätsund Sensorsysteme in der Produktion, Campus-Boulevard 30, Aachen,52074, Germany

来源：

WT Werkstattstechnik | 2024年 / 114卷 / 11-12期

关键词：

D O I：

10.37544/1436-4980-2024-11-12-43

中图分类号：

学科分类号：

摘要：

Current challenges in production, such as shortage of skilled workers, increase the need to automate processes and increase productivity. Multi-modal foundation models address this automation demand for a variety of applications by deriving decisions based on heterogeneous information sources. However, applications around this technology are currently rare. This article therefore provides an overview of the potential and challenges of these models in production. © 2024, VDI Fachmedien GmBH & Co. KG. All rights reserved.

引用

页码：747 / 754

共 50 条

[41] MIS '24: 1st ACM Multimedia Workshop on Multi-modal Misinformation Governance in the Era of Foundation Models
Fan, Shaojing
Wang, Zheng
Shao, Rui
Bai, Song
Zhu, Hongyuan
Nie, Liqiang
Satoh, Shin'ichi
PROCEEDINGS OF THE 1ST ACM MULTIMEDIA WORKSHOP ON MULTI-MODAL MISINFORMATION GOVERNANCE IN THE ERA OF FOUNDATION MODELS, MIS 2024, 2024, : 1 - 2
[42] Review on the usage of deep learning models in multi-modal sentiment analysis
Naga Durga Saile K.
Venkatramaphanikumar S.
Venkata Krishna Kishore K.
Bhattacharyya D.
Venkatramaphanikumar, S. (svrphanikumar@yahoo.com), 1600, Institute of Electronics Engineers of Korea (09): : 435 - 444
[43] Multi-modal Terminology Management Corpora, Data Models, and Implementations in TermStar
Giai, Enrico
Poeta, Nicola
Turnbull, David
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 1752 : 219 - 226
[44] Dataset and Models for Item Recommendation Using Multi-Modal User Interactions
Bruun, Simone Borg
Balog, Krisztian
Maistro, Maria
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 709 - 718
[45] Exploring Fusion Strategies in Deep Learning Models for Multi-Modal Classification
Zhang, Duoyi
Nayak, Richi
Bashar, Md Abul
DATA MINING, AUSDM 2021, 2021, 1504 : 102 - 117
[46] The future of action recognition: are multi-modal visual language models the key?
Gumuskaynak, Enes
Eken, Suleyman
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (04)
[47] Evaluation of Random Field Models in Multi-modal Unsupervised Tampering Localization
Korus, Pawel
Huang, Jiwu
2016 8TH IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS 2016), 2016,
[48] Improving Zero-shot Generalization and Robustness of Multi-modal Models
Ge, Yunhao
Ren, Jie
Gallagher, Andrew
Wang, Yuxiao
Yang, Ming-Hsuan
Adam, Hartwig
Itti, Laurent
Lakshminarayanan, Balaji
Zhao, Raping
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11093 - 11101
[49] Multi-modal large language models in radiology: principles, applications, and potential
Shen, Yiqiu
Xu, Yanqi
Ma, Jiajian
Rui, Wushuang
Zhao, Chen
Heacock, Laura
Huang, Chenchan
ABDOMINAL RADIOLOGY, 2024,
[50] Large multi-modal models - the present or future of artificial intelligence in medicine?
Sonicki, Zdenko
CROATIAN MEDICAL JOURNAL, 2024, 65 (01) : 1 - 2

← 1 2 3 4 5 →