共 12 条
- [1] Chen S, Jin Q, Wang P., Say as you wish: fine grained control of image caption generation with abstract scene graphs, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9962-9971, (2020)
- [2] Shi J, Zhang H, Li J., Explainable and explicit visual reasoning over scene graphs, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8376-8384, (2019)
- [3] Rennie S J, Marcheret E, Mroueh Y., Self critical sequence training for image captioning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7008-7024, (2017)
- [4] Lu J, Yang J, Batra D., Neural baby talk, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7219-7228, (2018)
- [5] Anderson P, He X, Buehler C., Bottom up and top down attention for image captioning and visual question answering, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6077-6086, (2018)
- [6] Deshpande A, Aneja J, Wang L., Fast, diverse and accurate image captioning guided by part of speech, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10695-10704, (2019)
- [7] Yang X, Tang K, Zhang H., Auto encoding scene graphs for image captioning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 10685-10694, (2019)
- [8] Chen L, Zhang H, Xiao J., SCA CNN: spatial and channel wise attention in convolutional networks for image captioning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5659-5667, (2017)
- [9] Feng Y, Ma L, Liu W., Unsupervised image captioning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4125-4134, (2019)
- [10] Jiang W, Ma L, Jiang Y., Recurrent fusion network for image captioning, Proceedings of the European Conference on Computer Vision, pp. 499-515, (2018)