共 50 条
- [21] Multi-modal Masked Autoencoders for Medical Vision-and-Language Pre-training MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 679 - 689
- [25] Multi-modal Pathological Pre-training via Masked Autoencoders for Breast Cancer Diagnosis MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 457 - 466
- [26] A multi-modal pre-training transformer for universal transfer learning in metal–organic frameworks Nature Machine Intelligence, 2023, 5 : 309 - 318
- [27] StEP: Style-based Encoder Pre-training for Multi-modal Image Synthesis 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3711 - 3720
- [29] MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition INTERSPEECH 2023, 2023, : 4943 - 4947
- [30] Multi-modal Graph Contrastive Learning for Micro-video Recommendation PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1807 - 1811