共 50 条
- [32] Semantic Tag Augmented XlanV Model for Video Captioning PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4818 - 4822
- [33] Visual to Text: Survey of Image and Video Captioning IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2019, 3 (04): : 297 - 312
- [38] Semantic analysis based on fusion of audio/visual features for soccer video PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY, 2021, 183 : 563 - 571
- [39] Combining caption and visual features for semantic event classification of baseball video 2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 1255 - 1258
- [40] When Visual Object-Context Features Meet Generic and Specific Semantic Priors in Image Captioning TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069