共 50 条
- [1] Multimodal Deep Neural Network with Image Sequence Features for Video Captioning [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
- [3] MIVCN: Multimodal interaction video captioning network based on semantic association graph [J]. Applied Intelligence, 2022, 52 : 5241 - 5260
- [5] MULTIMODAL SEMANTIC ATTENTION NETWORK FOR VIDEO CAPTIONING [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1300 - 1305
- [7] VIDEO CAPTIONING WITH TEMPORAL AND REGION GRAPH CONVOLUTION NETWORK [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
- [9] Multimodal-enhanced hierarchical attention network for video captioning [J]. Multimedia Systems, 2023, 29 : 2469 - 2482
- [10] Multirate Multimodal Video Captioning [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1877 - 1882