共 50 条
- [41] Dense Video Captioning with Hierarchical Attention-Based Encoder-Decoder Networks 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
- [42] Video Captioning with Guidance of Multimodal Latent Topics PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1838 - 1846
- [43] Attend to Knowledge: Memory-Enhanced Attention Network for Image Captioning ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 161 - 171
- [44] Critic-based Attention Network for Event-based Video Captioning PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 811 - 817
- [45] Semantic Enhanced Encoder-Decoder Network (SEN) for Video Captioning PROCEEDINGS OF THE 2ND WORKSHOP ON MULTIMEDIA FOR ACCESSIBLE HUMAN COMPUTER INTERFACES (MAHCI '19), 2019, : 25 - 32
- [46] Leveraging Weighted Fine-Grained Cross-Graph Attention for Visual and Semantic Enhanced Video Captioning Network THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2465 - 2473
- [47] Reconstruction Network for Video Captioning 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7622 - 7631
- [49] Video Captioning via Hierarchical Reinforcement Learning 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4213 - 4222
- [50] RESTHT: relation-enhanced spatial-temporal hierarchical transformer for video captioning VISUAL COMPUTER, 2025, 41 (01): : 591 - 604