共 50 条
- [41] Multimodal Local Perception Bilinear Pooling for Visual Question Answering [J]. IEEE ACCESS, 2018, 6 : 57923 - 57932
- [42] Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1839 - 1848
- [44] Multi-Channel Co-Attention Network for Visual Question Answering [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
- [45] CRA-Net: Composed Relation Attention Network for Visual Question Answering [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1202 - 1210
- [47] Efficient Multi-step Reasoning Attention Network for Visual Question Answering [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
- [49] Compound-Attention Network with Original Feature injection for visual question and answering [J]. Signal, Image and Video Processing, 2021, 15 : 1853 - 1861
- [50] Affective Visual Question Answering Network [J]. IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, : 170 - 173