共 64 条
- [51] Li N N, Chen Z Z., Learning compact reward for image captioning, (2020)
- [52] Chen H G, Zhang H, Chen P Y, Et al., Attacking visual language grounding with adversarial examples: A case study on neural image captioning, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2587-2597, (2018)
- [53] Shekhar R, Pezzelle S, Klimovich Y, Et al., FOIL it! Find One mismatch between Image and Language caption, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 255-265, (2017)
- [54] Dai B, Lin D H., Contrastive learning for image captioning, (2017)
- [55] Feng Y, Ma L, Liu W, Et al., Unsupervised image captioning, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4120-4129, (2019)
- [56] Bhargava S, Forsyth D., Exposing and correcting the gender bias in image captioning datasets and models, (2019)
- [57] Shuster K, Humeau S, Hu H X, Et al., Engaging image captioning via personality, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12508-12518, (2019)
- [58] Kim D J, Choi J, Oh T H, Et al., Dense relational captioning: Triple-stream networks for relationship-based captioning, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6264-6273, (2019)
- [59] Biten A F, Gomez L, Rusinol M, Et al., Good news, everyone! context driven entity-aware captioning for news images, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12458-12467, (2019)
- [60] Guo L T, Liu J, Yao P, Et al., MSCap: multi-style image captioning with unpaired stylized text, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4199-4208, (2019)