RoMA: Robust Model Adaptation for Offline Model-based Optimization

被引：0

作者：

Yu, Sihyun ^{[1
]}

Ahn, Sungsoo ^{[2
]}

Song, Le ^{[2
,3
]}

Shin, Jinwoo ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Deajeon, South Korea

[2] Mohamed bin Zayed Univ Artificial Intelligence MB, Abu Dhabi, U Arab Emirates

[3] BioMap, Bidar, Karnataka, India

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the problem of searching an input maximizing a black-box objective function given a static dataset of input-output queries. A popular approach to solving this problem is maintaining a proxy model, e.g., a deep neural network (DNN), that approximates the true objective function. Here, the main challenge is how to avoid adversarially optimized inputs during the search, i.e., the inputs where the DNN highly overestimates the true objective function. To handle the issue, we propose a new framework, coined robust model adaptation (RoMA), based on gradient-based optimization of inputs over the DNN. Specifically, it consists of two steps: (a) a pre-training strategy to robustly train the proxy model and (b) a novel adaptation procedure of the proxy model to have robust estimates for a specific set of candidate solutions. At a high level, our scheme utilizes the local smoothness prior to overcome the brittleness of the DNN. Experiments under various tasks show the effectiveness of RoMA compared with previous methods, obtaining state-of-the-art results, e.g., RoMA outperforms all at 4 out of 6 tasks and achieves runner-up results at the remaining tasks.

引用

页数：13

共 50 条

[1] MOPO: Model-based Offline Policy Optimization
Yu, Tianhe
Thomas, Garrett
Yu, Lantao
Ermon, Stefano
Zou, James
Levine, Sergey
Finn, Chelsea
Ma, Tengyu
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] Model-based Policy Optimization with Unsupervised Model Adaptation
Shen, Jian
Zhao, Han
Zhang, Weinan
Yu, Yong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[3] Parallel-mentoring for Offline Model-based Optimization
Chen, Can
Beckham, Christopher
Liu, Zixuan
Liu, Xue
Pal, Christopher
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[4] COMBO: Conservative Offline Model-Based Policy Optimization
Yu, Tianhe
Kumar, Aviral
Rafailov, Rafael
Rajeswaran, Aravind
Levine, Sergey
Finn, Chelsea
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Adaptation Augmented Model-based Policy Optimization
Shen, Jian
Lai, Hang
Liu, Minghuan
Zhao, Han
Yu, Yong
Zhang, Weinan
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[6] Model-Based Offline Adaptive Policy Optimization with Episodic Memory
Cao, Hongye
Wei, Qianru
Zheng, Jiangbin
Shi, Yanqing
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 50 - 62
[7] Conservative Objective Models for Effective Offline Model-Based Optimization
Trabucco, Brandon
Kumar, Aviral
Geng, Xinyang
Levine, Sergey
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7368 - 7378
[8] ROMO: Retrieval-enhanced Offline Model-based Optimization
Chen, Mingcheng
Zhao, Haoran
Zhao, Yuxiang
Fan, Hulei
Gao, Hongqiao
Yu, Yong
Tian, Zheng
[J]. 2023 5TH INTERNATIONAL CONFERENCE ON DISTRIBUTED ARTIFICIAL INTELLIGENCE, DAI 2023, 2023,
[9] Model-Based Offline Policy Optimization with Distribution Correcting Regularization
Shen, Jian
Chen, Mingcheng
Zhang, Zhicheng
Yang, Zhengyu
Zhang, Weinan
Yu, Yong
[J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 174 - 189
[10] Bidirectional Learning for Offline Infinite-width Model-based Optimization
Chen, Can
Zhang, Yingxue
Fu, Jie
Liu, Xue
Coates, Mark
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →