共 50 条
- [31] Deep multimodal embedding for video captioning Multimedia Tools and Applications, 2019, 78 : 31793 - 31805
- [32] Multimodal Pretraining for Dense Video Captioning 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 470 - 490
- [35] Hierarchical Context-aware Network for Dense Video Event Captioning 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2004 - 2013
- [37] Gated Hierarchical Attention for Image Captioning COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 21 - 37
- [38] MIVCN: Multimodal interaction video captioning network based on semantic association graph Applied Intelligence, 2022, 52 : 5241 - 5260
- [40] Hierarchical Memory Modelling for Video Captioning PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 63 - 71