共 50 条
- [42] A Comprehensive Benchmark and Evaluation of Thai Finger Spelling in Multi-Modal Deep Learning Models IEEE ACCESS, 2024, 12 : 158079 - 158093
- [43] MULTI-MODAL DEEP LEARNING ON IMAGING GENETICS FOR SCHIZOPHRENIA CLASSIFICATION 2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
- [47] Depicting Beyond Scores: Advancing Image Quality Assessment Through Multi-modal Language Models COMPUTER VISION - ECCV 2024, PT XLVII, 2025, 15105 : 259 - 276
- [48] Fusion of Deep Learning Models for Multi-View Image Classification SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXII, 2023, 12547
- [49] VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
- [50] An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (12): : 4333 - 4336