Adapting Vision-Language Models via Learning to Inject Knowledge

被引:0
|
作者
Xuan, Shiyu [1 ,2 ]
Yang, Ming [3 ]
Zhang, Shiliang [2 ,4 ]
机构
[1] Nanjing University of Science and Technology, School of Computer Science and Engineering, Nanjing,210094, China
[2] Peking University, National Key Laboratory for Multimedia Information Processing, School of Computer Science, Beijing,100871, China
[3] Ant Group, Multi-Modality Cognition Department, Zhejiang, Hangzhou,310023, China
[4] Peng Cheng Laboratory, Shenzhen,518055, China
关键词
Encoding (symbols) - Semantic Segmentation - Semantics - Visual languages;
D O I
10.1109/TIP.2024.3468884
中图分类号
学科分类号
摘要
引用
收藏
页码:5798 / 5809
相关论文
共 50 条
  • [1] Adapting vision-language AI models to cardiology tasks
    Arnaout, Rima
    [J]. NATURE MEDICINE, 2024,
  • [2] Learning to Prompt for Vision-Language Models
    Kaiyang Zhou
    Jingkang Yang
    Chen Change Loy
    Ziwei Liu
    [J]. International Journal of Computer Vision, 2022, 130 : 2337 - 2348
  • [3] Learning to Prompt for Vision-Language Models
    Zhou, Kaiyang
    Yang, Jingkang
    Loy, Chen Change
    Liu, Ziwei
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (09) : 2337 - 2348
  • [4] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
    Ye, Shuquan
    Xie, Yujia
    Chen, Dongdong
    Xu, Yichong
    Yuan, Lu
    Zhu, Chenguang
    Liao, Jing
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
  • [5] Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models
    Wang, Yubin
    Jiang, Xinyang
    Cheng, De
    Li, Dongsheng
    Zhao, Cairong
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5749 - 5757
  • [6] Exploring Vision-Language Models for Imbalanced Learning
    Wang Y.
    Yu Z.
    Wang J.
    Heng Q.
    Chen H.
    Ye W.
    Xie R.
    Xie X.
    Zhang S.
    [J]. International Journal of Computer Vision, 2024, 132 (1) : 224 - 237
  • [7] Conditional Prompt Learning for Vision-Language Models
    Zhou, Kaiyang
    Yang, Jingkang
    Loy, Chen Change
    Liu, Ziwei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16795 - 16804
  • [8] CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection
    Khan, Sohail Ahmed
    Duc-Tien Dang-Nguyen
    [J]. PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1006 - 1015
  • [9] Learning Domain Invariant Prompt for Vision-Language Models
    Zhao, Cairong
    Wang, Yubin
    Jiang, Xinyang
    Shen, Yifei
    Song, Kaitao
    Li, Dongsheng
    Miao, Duoqian
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1348 - 1360
  • [10] QLDT: adaptive Query Learning for HOI Detection via vision-language knowledge Transfer
    Wang, Xincheng
    Gao, Yongbin
    Yu, Wenjun
    Wu, Chenmou
    Chen, Mingxuan
    Ma, Honglei
    Chen, Zhichao
    [J]. APPLIED INTELLIGENCE, 2024, 54 (19) : 9008 - 9027