Automating the evaluation of planning systems

被引:7
|
作者
Linares Lopez, Carlos [1 ]
Jimenez, Sergio [1 ]
Helmert, Malte [2 ]
机构
[1] Univ Carlos III Madrid, Dept Comp Sci, Madrid, Spain
[2] Univ Basel, Dept Math & Comp Sci, Basel, Switzerland
关键词
Automated planning; evaluation; competition; COMPETITION; ALGORITHMS;
D O I
10.3233/AIC-130572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in automated planning is getting more and more focused on empirical evaluation. Likewise the need for methodologies and benchmarks to build solid evaluations of planners is increasing. In 1998 the planning community made a move to address this need and initiated the International Planning Competition - or IPC for short. This competition has typically been conducted every two years in the context of the International Conference on Automated Planning and Scheduling (ICAPS) and tries to define standard metrics and benchmarks to reliably evaluate planners. In the sixth edition of the competition, IPC 2008, there was an attempt to automate the evaluation of all entries in the competition which was imitated to a large extent and extended in several ways in the seventh edition, IPC 2011. As a result, a software for automatically running planning experiments and inspecting the results is available, encouraging researchers to use it for their own research interests. The software allows researchers to reproduce and inspect the results of IPC 2011, but also to generate and analyze new experiments with private sets of planners and problems. In this paper we provide a gentle introduction to this software and examine the main difficulties, both from a scientific and engineering point of view, in assessing the performance of automated planners.
引用
收藏
页码:331 / 354
页数:24
相关论文
共 50 条
  • [11] Automating deployment planning with an aspect weaver
    White, J.
    Schmidt, D. C.
    IET SOFTWARE, 2009, 3 (03) : 167 - 183
  • [12] Automating usability evaluation
    Smith, GB
    Howes, A
    ENGINEERING PSYCHOLOGY AND COGNITIVE ERGONOMICS VOLUME SIX: INDUSTRIAL ERGONOMICS, HCI, AND APPLIED COGNITIVE PSYCHOLOGY, 2001, : 79 - 86
  • [13] Automating the Evaluation of Trustworthiness
    Sel, Marc
    Mitchell, Chris J.
    TRUST, PRIVACY AND SECURITY IN DIGITAL BUSINESS (TRUSTBUS 2021), 2021, 12927 : 18 - 31
  • [14] ASPECTS OF AUTOMATING PERFORMANCE EVALUATION FOR MAN-MACHINE SYSTEMS.
    Bezbogov, A.A.
    Kibernetika i Vychislitel'naya Tekhnika, 1983, (61): : 105 - 112
  • [15] AUTOMATING SYSTEMS ENGINEERING
    Kuhn, Dorothy A.
    INCOSE International Symposium, 1994, 4 (01) : 364 - 370
  • [16] Systems for automating monitoring
    Naumov, V.L.
    Ryzhov, V.Yu.
    Ol'khovoj, S.L.
    Koks i Khimiya, 2001, (12): : 39 - 42
  • [17] AUTOMATING QUALITY SYSTEMS
    TANNOCK, JDT
    WORT, RG
    SAVAGE, BM
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 1990, 204 (04) : 231 - 236
  • [18] Evaluation of a novel algorithm for automating virtual surgical planning in mandibular reconstruction using fibula flaps
    Modabber, Ali
    Rauen, Alexandra
    Ayoub, Nassim
    Moehlhenrich, Stephan Christian
    Peters, Florian
    Kniha, Kristian
    Hoelzle, Frank
    Raith, Stefan
    JOURNAL OF CRANIO-MAXILLOFACIAL SURGERY, 2019, 47 (09) : 1378 - 1386
  • [19] Planning and Evaluation of Digital Assistance Systems
    Hold, Philipp
    Erol, Selim
    Reisinger, Gehard
    Sihn, Wilfried
    7TH CONFERENCE ON LEARNING FACTORIES (CLF 2017), 2017, 9 : 143 - 150
  • [20] ANALYTICAL EVALUATION OF HIERARCHICAL PLANNING SYSTEMS
    DEMPSTER, MAH
    FISHER, ML
    JANSEN, L
    LAGEWEG, BJ
    LENSTRA, JK
    KAN, AHGR
    OPERATIONS RESEARCH, 1981, 29 (04) : 707 - 716