共 41 条
- [1] Shou Z., Wang D., Chang S.F., Temporal action localization in untrimmed videos via multi-stage CNNs, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1049-1058, (2016)
- [2] Hochreiter S., Schmidhuber J., Long short-term memory, Neural Computation, 9, 8, pp. 1735-1780, (1997)
- [3] Russakovsky O., Deng J., Su H., Et al., ImageNet large scale visual recognition challenge, International Journal of Computer Vision, 115, 3, pp. 211-252, (2015)
- [4] Lin T.Y., Maire M., Belongie S., Et al., Microsoft COCO: common objects in context, Proceedings of European Conference on Computer Vision, pp. 740-755, (2014)
- [5] Laptev I., Marszalek M., Schmid C., Et al., Learning realistic human actions from movies, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, (2008)
- [6] Soomro K., Zamir A.R., Shah M., UCF101: a dataset of 101 human actions classes from videos in the wild
- [7] Jiang Y.G., Liu J., Zamir A.R., Et al., THUMOS challenge: action recognition with a large number of classes
- [8] Kay W., Carreira J., Simonyan K., Et al., The kinetics human action video dataset
- [9] Abu-El-Haija S., Kothari N., Lee J., Et al., YouTube-8M: a large-scale video classification benchmark
- [10] Chen D.L., Dolan W.B., Collecting highly parallel data for paraphrase evaluation, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 1, pp. 190-200, (2011)