Stochastic Enforced Hill-Climbing

被引：0

作者：

Wu, Jia-Hong ^{[1
]}

Kalyanam, Rajesh ^{[2
]}

Givan, Robert ^{[2
]}

机构：

[1] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan

[2] Purdue Univ, W Lafayette, IN 47907 USA

来源：

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH | 2011年 / 42卷

基金：

美国国家科学基金会;

关键词：

IGNORING DELETE LISTS; FF PLANNING SYSTEM; SEARCH;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Enforced hill-climbing is an effective deterministic hill-climbing technique that deals with local optima using breadth-first search (a process called "basin flooding"). We propose and evaluate a stochastic generalization of enforced hill-climbing for online use in goal-oriented probabilistic planning problems. We assume a provided heuristic function estimating expected cost to the goal with flaws such as local optima and plateaus that thwart straightforward greedy action choice. While breadth-first search is effective in exploring basins around local optima in deterministic problems, for stochastic problems we dynamically build and solve a heuristic-based Markov decision process (MDP) model of the basin in order to find a good escape policy exiting the local optimum. We note that building this model involves integrating the heuristic into the MDP problem because the local goal is to improve the heuristic. We evaluate our proposal in twenty-four recent probabilistic planning-competition benchmark domains and twelve probabilistically interesting problems from recent literature. For evaluation, we show that stochastic enforced hill-climbing (SEH) produces better policies than greedy heuristic following for value/cost functions derived in two very different ways: one type derived by using deterministic heuristics on a deterministic relaxation and a second type derived by automatic learning of Bellman-error features from domain-specific experience. Using the first type of heuristic, SEH is shown to generally outperform all planners from the first three international probabilistic planning competitions.

引用

页码：815 / 850

页数：36

共 50 条

[41] Hill-climbing Strategies on Various Landscapes: An Empirical Comparison
Basseur, Matthieu
Goeffon, Adrien
[J]. GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 479 - 486
[42] STABILITY AND SUBHARMONICS IN A SINUSOIDAL PERTURBATION HILL-CLIMBING SYSTEM
JAMES, DJG
[J]. INTERNATIONAL JOURNAL OF CONTROL, 1971, 13 (01) : 165 - &
[43] On the vulnerability of face verification systems to hill-climbing attacks
Galbally, Javier
McCool, Chris
Fierrez, Julian
Marcel, Sebastien
Ortega-Garcia, Javier
[J]. PATTERN RECOGNITION, 2010, 43 (03) : 1027 - 1038
[44] Late acceptance hill-climbing for high school timetabling
George H. G. Fonseca
Haroldo G. Santos
Eduardo G. Carrano
[J]. Journal of Scheduling, 2016, 19 : 453 - 465
[45] Late acceptance hill-climbing for high school timetabling
Fonseca, George H. G.
Santos, Haroldo G.
Carrano, Eduardo G.
[J]. JOURNAL OF SCHEDULING, 2016, 19 (04) : 453 - 465
[46] RESIDENTIAL SEGREGATION BY HILL-CLIMBING AGENTS ON THE POTENTIAL LANDSCAPE
Shin, Jae Kyun
Fossett, Mark
[J]. ADVANCES IN COMPLEX SYSTEMS, 2008, 11 (06): : 875 - 899
[47] Fast forward planning by guided enforced hill climbing
Akramifar, S. A.
Ghassem-Sani, G.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (08) : 1327 - 1339
[48] Improvement of Hill-Climbing Method by Constraining Voltage Operating Point
Tan, Chin-Yew
Abd Rahim, Nasrudin
Selvaraj, Jeyraj
[J]. 2013 IEEE CONFERENCE ON CLEAN ENERGY AND TECHNOLOGY (CEAT), 2013, : 108 - 113
[49] A fast hill-climbing algorithm for Bayesian networks structure learning
Gamez, Jose A.
Mateo, Juan L.
Puerta, Jose M.
[J]. SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, PROCEEDINGS, 2007, 4724 : 585 - +
[50] METHOD OF ANALYSIS OF THE ACCURACY OF HILL-CLIMBING CONTROL-SYSTEMS
GNOENSKII, LS
RAFAELYAN, RS
[J]. AUTOMATION AND REMOTE CONTROL, 1980, 41 (07) : 963 - 966

← 1 2 3 4 5 →