Open-World Dynamic Prompt and Continual Visual Representation Learning

被引:0
|
作者
Kim, Youngeun [1 ]
Fang, Jun [2 ]
Zhang, Qin [2 ]
Cai, Zhaowei [3 ]
Shen, Yantao [2 ]
Duggal, Rahul [2 ]
Raychaudhuri, Dripta S. [2 ]
Tut, Zhuowen [2 ]
Xing, Yifan [2 ]
Dabeer, Onkar [2 ]
机构
[1] Yale Univ, New Haven, CT USA
[2] AWS AI Labs, Seattle, WA 98109 USA
[3] Amazon AGI, Seattle, WA USA
来源
关键词
Dynamic Prompt Generation; Continual Learning; Open-World Visual Representation Learning;
D O I
10.1007/978-3-031-72967-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The open world is inherently dynamic, characterized by ever-evolving concepts and distributions. Continual learning (CL) in this dynamic open-world environment presents a significant challenge in effectively generalizing to unseen test-time classes. To address this challenge, we introduce a new practical CL setting tailored for open-world visual representation learning. In this setting, subsequent data streams systematically introduce novel classes that are disjoint from those seen in previous training phases, while also remaining distinct from the unseen test classes. In response, we present Dynamic Prompt and Representation Learner (DPaRL), a simple yet effective Prompt-based CL (PCL) method. Our DPaRL learns to generate dynamic prompts for inference, as opposed to relying on a static prompt pool in previous PCL methods. In addition, DPaRL jointly learns dynamic prompt generation and discriminative representation at each training stage whereas prior PCL methods only refine the prompt learning throughout the process. Our experimental results demonstrate the superiority of our approach, surpassing state-of-the-art methods on well-established open-world image retrieval benchmarks by an average of 4.7% improvement in Recall@1 performance.
引用
收藏
页码:357 / 374
页数:18
相关论文
共 50 条
  • [31] Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception
    Hindel, Julia
    Cattaneo, Daniele
    Valada, Abhinav
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1904 - 1911
  • [32] Open-World Graph Active Learning for Node Classification
    Xu, Hui
    Xiang, Liyao
    Ou, Junjie
    Weng, Yuting
    Wang, Xinbing
    Zhou, Chenghu
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (02)
  • [33] Contrastive Pseudo Learning for Open-World DeepFake Attribution
    Sun, Zhimin
    Chen, Shen
    Yao, Taiping
    Yin, Bangjie
    Yi, Ran
    Ding, Shouhong
    Ma, Lizhuang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20825 - 20835
  • [34] Modeling of Human Visual Attention in Multiparty Open-World Dialogues
    Stefanov, Kalin
    Salvi, Giampiero
    Kontogiorgos, Dimosthenis
    Kjellstrom, Hedvig
    Beskow, Jonas
    ACM TRANSACTIONS ON HUMAN-ROBOT INTERACTION, 2019, 8 (02)
  • [35] Representation-Based Completion of Knowledge Graph with Open-World Data
    Yue, Kun
    Wang, Jiahui
    Li, Xinbai
    Hu, Kuang
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 1 - 8
  • [36] Open-world Active Learning for Echocardiography View Classification
    Zamzmi, Ghada
    Oguguo, Tochi
    Rajaraman, Sivaramakrishnan
    Antani, Sameer
    MEDICAL IMAGING 2022: COMPUTER-AIDED DIAGNOSIS, 2022, 12033
  • [37] EventBind: Learning a Unified Representation to Bind Them All for Event-Based Open-World Understanding
    Zhou, Jiazhou
    Zheng, Xu
    Lyu, Yuanhuiyi
    Wang, Lin
    COMPUTER VISION - ECCV 2024, PT LXX, 2025, 15128 : 477 - 494
  • [38] NGC: A Unified Framework for Learning with Open-World Noisy Data
    Wu, Zhi-Fan
    Wei, Tong
    Jiang, Jianwen
    Mao, Chaojie
    Tang, Mingqian
    Li, Yu-Feng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 62 - 71
  • [39] Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models
    Tang, Lv
    Jiang, Peng-Tao
    Xiao, Haoke
    Li, Bo
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (01) : 1 - 15
  • [40] Open-World Relationship Prediction
    Wang, Jingchao
    Wang, Xinzhi
    Luo, Xiangfeng
    Qin, Wei
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 323 - 330