共 31 条
- [1] Kulkarni G, Premraj V, Ordonez V, Dhar S, Li S M, Choi Y, Et al., BabyTalk: Understanding and generating simple image descriptions, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 12, pp. 2891-2903, (2013)
- [2] Mao J H, Xu W, Yang Y, Wang J, Yuille A L., Deep captioning with multimodal recurrent neural networks (m-RNN), Proceedings of the 3rd International Conference on Learning Representations, (2015)
- [3] Tang Peng-Jie, Wang Han-Li, Xu Kai-Sheng, Multi-objective layer-wise optimization and multi-level probability fusion for image description generation using LSTM, Acta Automatica Sinica, 44, 7, pp. 1237-1249, (2018)
- [4] Cho K, Van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Et al., Learning phrase representations using RNN encoder-decoder for statistical machine translation, (2014)
- [5] Bahdanau D, Cho K, Bengio Y., Neural machine translation by jointly learning to align and translate, Proceedings of the 3rd International Conference on Learning Representations, (2015)
- [6] Sutskever I, Vinyals O, Le Q V., Sequence to sequence learning with neural networks, Proceedings of the 27th International Conference on Neural Information Processing Systems, (2014)
- [7] Vinyals O, Toshev A, Bengio S, Erhan D., Show and tell: A neural image caption generator, Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3156-3164, (2015)
- [8] Zhang Xue-Song, Zhuang Yan, Yan Fei, Wang Wei, Status and development of transfer learning based category-level object recognition and detection, Acta Automatica Sinica, 45, 7, pp. 1224-1243, (2019)
- [9] You Q Z, Jin H L, Wang Z W, Fang C, Luo J B., Image captioning with semantic attention, Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4651-4659, (2016)
- [10] Hochreiter S, Schmidhuber J., Long short-term memory, Neural Computation, 9, 8, pp. 1735-1780, (1997)