共 50 条
- [1] Prompting Visual-Language Models for Efficient Video Understanding COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 105 - 124
- [2] VISA: Reasoning Video Object Segmentation via Large Language Models COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 98 - 115
- [3] Towards Language-Driven Video Inpainting via Multimodal Large Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 12501 - 12511
- [5] VideoAgent: Long-Form Video Understanding with Large Language Model as Agent COMPUTER VISION - ECCV 2024, PT LXXX, 2025, 15138 : 58 - 76
- [6] The Importance of Understanding Language in Large Language Models AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (10): : 6 - 7
- [8] Understanding Telecom Language Through Large Language Models IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6542 - 6547
- [9] Understanding HTML']HTML with Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2803 - 2821