共 50 条
- [21] Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2615 - 2624
- [24] Image Captioning Based on Visual Relevance and Context Dual Attention Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):
- [25] Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning INTERSPEECH 2023, 2023, : 4164 - 4168
- [28] A Concise and Varied Visual Features-Based Image Captioning Model with Visual Selection CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (02): : 2873 - 2894
- [30] Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning International Journal of Computer Vision, 2022, 130 : 1920 - 1937