共 50 条
- [2] Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11727 - 11736
- [3] Multi-modal Broad Learning System for Medical Image and Text-based Classification [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3439 - 3442
- [4] Modal Complementarity Based on Multimodal Large Language Model for Text-Based Person Retrieval [J]. WEB AND BIG DATA, APWEB-WAIM 2024, PT I, 2024, 14961 : 264 - 279
- [5] SIAMCLIM: TEXT-BASED PEDESTRIAN SEARCH VIA MULTI-MODAL SIAMESE CONTRASTIVE LEARNING [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1800 - 1804
- [6] PaSeMix: A Multi-modal Partitional Semantic Data Augmentation Method for Text-Based Person Search [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 468 - 479
- [7] Text-Video Retrieval via Multi-Modal Hypergraph Networks [J]. PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 369 - 377
- [8] Deep Neural Architecture for Multi-Modal Retrieval based on Joint Embedding Space for Text and Images [J]. WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 28 - 36
- [9] Automatic generation of multi-modal dialogue from text based on discourse structure analysis [J]. ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 27 - +
- [10] MULTI-MODAL LEARNING WITH TEXT MERGING FOR TEXTVQA [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1985 - 1989