共 50 条
- [48] Word-to-region attention network for visual question answering [J]. Multimedia Tools and Applications, 2019, 78 : 3843 - 3858
- [49] Deep Attention Neural Tensor Network for Visual Question Answering [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 21 - 37