共 50 条
- [1] QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document [J]. DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 659 - 673
- [4] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering [J]. IEEE ACCESS, 2020, 8 : 35662 - 35671
- [6] Multimodal Cross-guided Attention Networks for Visual Question Answering [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTER MODELING, SIMULATION AND ALGORITHM (CMSA 2018), 2018, 151 : 347 - 353
- [8] Multimodal Bi-direction Guided Attention Networks for Visual Question Answering [J]. Neural Processing Letters, 2023, 55 : 11921 - 11943
- [9] An Improved Attention for Visual Question Answering [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1653 - 1662
- [10] Differential Attention for Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7680 - 7688