共 50 条
- [21] Multi-modal Representation Learning for Video Advertisement Content Structuring PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4770 - 4774
- [23] Learning Multi-Modal Word Representation Grounded in Visual Context THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5626 - 5633
- [27] Multi-Modal Representation Learning with Text-Driven Soft Masks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2798 - 2807
- [28] MMEarth: Exploring Multi-modal Pretext Tasks for Geospatial Representation Learning COMPUTER VISION - ECCV 2024, PT LXIV, 2025, 15122 : 164 - 182