共 64 条
- [1] Quan Y, Li Z X, Zhang C L, Et al., Fusing deep dilated convolutions network and light-weight network for object detection, Acta Electronica Sinica, 48, 2, pp. 390-397, (2020)
- [2] Liu Y, Liu H Y, Fan J L, Et al., A survey of research and application of small object detection based on deep learning, Acta Electronica Sinica, 48, 3, pp. 590-601, (2020)
- [3] Image caption的发展历程和最新工作的简要综述(2010-2018)
- [4] Vinyals O, Toshev A, Bengio S, Et al., Show and tell: Lessons learned from the 2015 MSCOCO image captioning challenge, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 4, pp. 652-663, (2017)
- [5] Tan X, Ren Y, He D, Et al., Multilingual neural machine translation with knowledge distillation, (2019)
- [6] Karpathy A, Li F F., Deep visual-semantic alignments for generating image descriptions, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 4, pp. 664-676, (2017)
- [7] Simonyan K, Zisserman A., Very deep convolutional networks for large-scale image recognition, (2014)
- [8] Fang H, Gupta S, Iandola F, Et al., From captions to visual concepts and back, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1473-1482, (2015)
- [9] Li N, Chen Z., Image cationing with visual-semantic LSTM, Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, pp. 793-799, (2018)
- [10] Anderson P, He X D, Buehler C, Et al., Bottom-up and top-down attention for image captioning and visual question answering, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6077-6086, (2018)