共 23 条
- [1] Ren M., Kiros R., Zemel R., Exploring models and data for image question answering, Proc of the 29th Conf on Advances in Neural Information Processing Systems, pp. 2953-2961, (2015)
- [2] Agrawal A., Lu J., Antol S., Et al., VQA: Visual question answering, International Journal of Computer Vision, 123, 1, pp. 1-28, (2017)
- [3] Jiang S., Min W., Wang S., Surver and prospect of intelligent interaction-oriented image recognition techniques, Journal of Computer Research and Development, 53, 1, pp. 113-122, (2016)
- [4] Lecun Y., Boser B.E., Denker J.S., Et al., Backpropagation applied to handwritten zip code recognition, Neural Computation, 1, 4, pp. 541-551, (2014)
- [5] Elman J.L., Finding structure in time, Cognitive Science, 14, 2, pp. 179-211, (1990)
- [6] Simonyan K., Zisserman A., Very deep convolutional networks for large-scale image recognition, (2014)
- [7] He K., Zhang X., Ren S., Et al., Deep residual learning for image recognition, Proc of the 29th IEEE Conf on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
- [8] Girshick R., Donahue J., Darrell T., Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, Proc of the 27th IEEE Conf on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
- [9] Cho K., Van Merrienboer B., Gulcehre C., Et al., Learning phrase representations using RNN encoder-decoder for statistical machine translation, Proc of the 19th Conf on Empirical Methods in Natural Language, pp. 1724-1734, (2014)
- [10] Donahue J., Anne H.L., Guadarrama S., Et al., Long-term recurrent convolutional networks for visual recognition and description, Proc of the 28th IEEE Conf on Computer Vision and Pattern Recognition, pp. 2625-2634, (2015)