共 50 条
- [23] Visual question answering model based on the fusion of multimodal features by a two-wav co-attention mechanism [J]. IMAGING SCIENCE JOURNAL, 2021, 69 (1-4): : 177 - 189
- [24] Feature Fusion Attention Visual Question Answering [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 412 - 416
- [25] Bi-direction Co-Attention Network on Visual Question Answering for Blind People [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
- [27] Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1839 - 1848
- [28] ADAPTIVE ATTENTION FUSION NETWORK FOR VISUAL QUESTION ANSWERING [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 997 - 1002
- [30] SPCA-Net: a based on spatial position relationship co-attention network for visual question answering [J]. VISUAL COMPUTER, 2022, 38 (9-10): : 3097 - 3108