共 50 条
- [1] Multi-Modal Contrastive Pre-training for Recommendation [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 99 - 108
- [2] CLAP: Contrastive Language-Audio Pre-training Model for Multi-modal Sentiment Analysis [J]. PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 622 - 626
- [3] Multi-modal Masked Autoencoders for Medical Vision-and-Language Pre-training [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 679 - 689
- [4] MULTI-MODAL PRE-TRAINING FOR AUTOMATED SPEECH RECOGNITION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 246 - 250
- [5] TableVLM: Multi-modal Pre-training for Table Structure Recognition [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2437 - 2449
- [6] Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 378 - 395
- [7] Versatile Multi-Modal Pre-Training for Human-Centric Perception [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16135 - 16145
- [9] Graph-Text Multi-Modal Pre-training for Medical Representation Learning [J]. CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 261 - 281
- [10] MMPT'21: International JointWorkshop on Multi-Modal Pre-Training for Multimedia Understanding [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 694 - 695