共 50 条
- [31] Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 349 - 357
- [32] Interpretable Complex Question Answering WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2455 - 2457
- [34] Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [35] VinVL: Revisiting Visual Representations in Vision-Language Models 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5575 - 5584
- [38] BRAVE: Broadening the Visual Encoding of Vision-Language Models COMPUTER VISION - ECCV 2024, PT XVI, 2025, 15074 : 113 - 132
- [39] SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IX, 2023, 14228 : 281 - 290