RoMA: Robust Model Adaptation for Offline Model-based Optimization

被引:0
|
作者
Yu, Sihyun [1 ]
Ahn, Sungsoo [2 ]
Song, Le [2 ,3 ]
Shin, Jinwoo [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Deajeon, South Korea
[2] Mohamed bin Zayed Univ Artificial Intelligence MB, Abu Dhabi, U Arab Emirates
[3] BioMap, Bidar, Karnataka, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of searching an input maximizing a black-box objective function given a static dataset of input-output queries. A popular approach to solving this problem is maintaining a proxy model, e.g., a deep neural network (DNN), that approximates the true objective function. Here, the main challenge is how to avoid adversarially optimized inputs during the search, i.e., the inputs where the DNN highly overestimates the true objective function. To handle the issue, we propose a new framework, coined robust model adaptation (RoMA), based on gradient-based optimization of inputs over the DNN. Specifically, it consists of two steps: (a) a pre-training strategy to robustly train the proxy model and (b) a novel adaptation procedure of the proxy model to have robust estimates for a specific set of candidate solutions. At a high level, our scheme utilizes the local smoothness prior to overcome the brittleness of the DNN. Experiments under various tasks show the effectiveness of RoMA compared with previous methods, obtaining state-of-the-art results, e.g., RoMA outperforms all at 4 out of 6 tasks and achieves runner-up results at the remaining tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] MOPO: Model-based Offline Policy Optimization
    Yu, Tianhe
    Thomas, Garrett
    Yu, Lantao
    Ermon, Stefano
    Zou, James
    Levine, Sergey
    Finn, Chelsea
    Ma, Tengyu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Model-based Policy Optimization with Unsupervised Model Adaptation
    Shen, Jian
    Zhao, Han
    Zhang, Weinan
    Yu, Yong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Parallel-mentoring for Offline Model-based Optimization
    Chen, Can
    Beckham, Christopher
    Liu, Zixuan
    Liu, Xue
    Pal, Christopher
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] COMBO: Conservative Offline Model-Based Policy Optimization
    Yu, Tianhe
    Kumar, Aviral
    Rafailov, Rafael
    Rajeswaran, Aravind
    Levine, Sergey
    Finn, Chelsea
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Adaptation Augmented Model-based Policy Optimization
    Shen, Jian
    Lai, Hang
    Liu, Minghuan
    Zhao, Han
    Yu, Yong
    Zhang, Weinan
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [6] Model-Based Offline Adaptive Policy Optimization with Episodic Memory
    Cao, Hongye
    Wei, Qianru
    Zheng, Jiangbin
    Shi, Yanqing
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 50 - 62
  • [7] Conservative Objective Models for Effective Offline Model-Based Optimization
    Trabucco, Brandon
    Kumar, Aviral
    Geng, Xinyang
    Levine, Sergey
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7368 - 7378
  • [8] ROMO: Retrieval-enhanced Offline Model-based Optimization
    Chen, Mingcheng
    Zhao, Haoran
    Zhao, Yuxiang
    Fan, Hulei
    Gao, Hongqiao
    Yu, Yong
    Tian, Zheng
    [J]. 2023 5TH INTERNATIONAL CONFERENCE ON DISTRIBUTED ARTIFICIAL INTELLIGENCE, DAI 2023, 2023,
  • [9] Model-Based Offline Policy Optimization with Distribution Correcting Regularization
    Shen, Jian
    Chen, Mingcheng
    Zhang, Zhicheng
    Yang, Zhengyu
    Zhang, Weinan
    Yu, Yong
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 174 - 189
  • [10] Bidirectional Learning for Offline Infinite-width Model-based Optimization
    Chen, Can
    Zhang, Yingxue
    Fu, Jie
    Liu, Xue
    Coates, Mark
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,