共 50 条
- [1] A Transformer-based Medical Visual Question Answering Model [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1712 - 1718
- [2] TRANS-VQA: Fully Transformer-Based Image Question-Answering Model Using Question-guided Vision Attention [J]. INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2024, 27 (73): : 111 - 128
- [3] VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21525 - 21535
- [4] Transformer-Based Neural Network for Answer Selection in Question Answering [J]. IEEE ACCESS, 2019, 7 : 26146 - 26156
- [5] Multimodal Learning and Reasoning for Visual Question Answering [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [7] Transformer-based Sparse Encoder and Answer Decoder for Visual Question Answering [J]. 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 120 - 123
- [8] Surgical-VQA: Visual Question Answering in Surgical Scenes Using Transformer [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 33 - 43
- [9] Multimodal Knowledge Reasoning for Enhanced Visual Question Answering [J]. 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 224 - 230
- [10] MUREL: Multimodal Relational Reasoning for Visual Question Answering [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998