共 50 条
- [22] Multi-Modal Dynamic Graph Transformer for Visual Grounding 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15513 - 15522
- [23] Conversational multi-modal browser: An integrated multi-modal browser and dialog manager 2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 348 - 351
- [24] OCR-Aware Scene Graph Generation Via Multi-modal Object Representation Enhancement and Logical Bias Learning PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 201 - 215
- [25] Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2215 - 2224
- [29] Multi-modal Graph and Sequence Fusion Learning for Recommendation PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 357 - 369
- [30] Fast Multi-Modal Unified Sparse Representation Learning PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 448 - 452