共 50 条
- [1] Efficient Large-Scale Multi-Modal Classification [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5198 - 5204
- [2] Multi-Modal Contrastive Pre-training for Recommendation [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 99 - 108
- [3] MULTI-MODAL PRE-TRAINING FOR AUTOMATED SPEECH RECOGNITION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 246 - 250
- [4] MGeo: Multi-Modal Geographic Language Model Pre-Training [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 185 - 194
- [5] TableVLM: Multi-modal Pre-training for Table Structure Recognition [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2437 - 2449
- [6] Pre-training on Large-Scale Heterogeneous Graph [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 756 - 766
- [7] Real-time Emotion Pre-Recognition in Conversations with Contrastive Multi-modal Dialogue Pre-training [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1045 - 1055
- [8] Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 378 - 395
- [9] Versatile Multi-Modal Pre-Training for Human-Centric Perception [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16135 - 16145
- [10] Effective Classification for Multi-modal Behavioral Authentication on Large-Scale Data [J]. 2020 15TH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIAJCIS 2020), 2020, : 101 - 109