共 50 条
- [31] Multi-Modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18180 - 18187
- [33] Task-Oriented Multi-Modal Mutual Learning for Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21902 - 21912
- [34] Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26753 - 26763
- [35] Multi-modal Adapter for Medical Vision-and-Language Learning MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 393 - 402
- [37] Demonstrating CAESURA: Language Models as Multi-Modal Query Planners COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 472 - 475
- [38] Multi-modal Language Models for Human-Robot Interaction COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, : 109 - 111
- [39] MMA: Multi-Modal Adapter for Vision-Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23826 - +
- [40] Gait Image Classification Using Deep Learning Models for Medical Diagnosis CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6039 - 6063