共 50 条
- [1] Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5038 - 5047
- [2] ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22168 - 22178
- [3] VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] Leveraging per Image-Token Consistency for Vision-Language Pre-training [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19155 - 19164
- [5] ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15375 - 15385
- [6] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
- [7] Toward Real-world Panoramic Image Enhancement [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2675 - 2684
- [9] Debiased Subjective Assessment of Real-World Image Enhancement [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 711 - 721
- [10] Image restoration for real-world under-display imaging [J]. OPTICS EXPRESS, 2021, 29 (23) : 37820 - 37834