共 50 条
- [1] Multi-modal Pathological Pre-training via Masked Autoencoders for Breast Cancer Diagnosis [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 457 - 466
- [2] Multi-modal Adapter for Medical Vision-and-Language Learning [J]. MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 393 - 402
- [4] Cross-modal Semantic Alignment Pre-training for Vision-and-Language Navigation [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4233 - 4241
- [5] Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 378 - 395
- [6] Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark [J]. CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 209, 2023, 209 : 117 - +
- [7] MGeo: Multi-Modal Geographic Language Model Pre-Training [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 185 - 194
- [9] Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5152 - 5161
- [10] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23346 - 23356