共 50 条
- [41] Local self-attention in transformer for visual question answering Applied Intelligence, 2023, 53 : 16706 - 16723
- [42] A Transformer-based Medical Visual Question Answering Model 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1712 - 1718
- [45] TRAR: Routing the Attention Spans in Transformer for Visual Question Answering 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2054 - 2064
- [48] An Empirical Study on the Language Modal in Visual Question Answering PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4109 - 4117
- [49] LANGUAGE TRANSFORMERS FOR REMOTE SENSING VISUAL QUESTION ANSWERING 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 4855 - 4858
- [50] Target-Driven Structured Transformer Planner for Vision-Language Navigation PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4194 - 4203