共 50 条
- [1] Vision-Language Model for Visual Question Answering in Medical Imagery BIOENGINEERING-BASEL, 2023, 10 (03):
- [3] SELF-SUPERVISED VISION-LANGUAGE PRETRAINING FOR MEDIAL VISUAL QUESTION ANSWERING 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
- [4] BIVL-Net: Bidirectional Vision-Language Guidance for Visual Question Answering PATTERN RECOGNITION AND COMPUTER VISION, PT III, PRCV 2024, 2025, 15033 : 481 - 495
- [6] Vision-language models for medical report generation and visual question answering: a review FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
- [7] Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6859 - 6865
- [8] Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 513 - 529
- [9] VISION AND TEXT TRANSFORMER FOR PREDICTING ANSWERABILITY ON VISUAL QUESTION ANSWERING 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 934 - 938
- [10] Faster, Stronger, and More Interpretable: Massive Transformer Architectures for Vision-Language Tasks ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2023, 3 (03): : 1369 - 1388