Automating the evaluation of planning systems

被引:7
|
作者
Linares Lopez, Carlos [1 ]
Jimenez, Sergio [1 ]
Helmert, Malte [2 ]
机构
[1] Univ Carlos III Madrid, Dept Comp Sci, Madrid, Spain
[2] Univ Basel, Dept Math & Comp Sci, Basel, Switzerland
关键词
Automated planning; evaluation; competition; COMPETITION; ALGORITHMS;
D O I
10.3233/AIC-130572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research in automated planning is getting more and more focused on empirical evaluation. Likewise the need for methodologies and benchmarks to build solid evaluations of planners is increasing. In 1998 the planning community made a move to address this need and initiated the International Planning Competition - or IPC for short. This competition has typically been conducted every two years in the context of the International Conference on Automated Planning and Scheduling (ICAPS) and tries to define standard metrics and benchmarks to reliably evaluate planners. In the sixth edition of the competition, IPC 2008, there was an attempt to automate the evaluation of all entries in the competition which was imitated to a large extent and extended in several ways in the seventh edition, IPC 2011. As a result, a software for automatically running planning experiments and inspecting the results is available, encouraging researchers to use it for their own research interests. The software allows researchers to reproduce and inspect the results of IPC 2011, but also to generate and analyze new experiments with private sets of planners and problems. In this paper we provide a gentle introduction to this software and examine the main difficulties, both from a scientific and engineering point of view, in assessing the performance of automated planners.
引用
收藏
页码:331 / 354
页数:24
相关论文
共 50 条
  • [1] Automating Human Evaluation of Dialogue Systems
    Reddy, Sujan
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 229 - 234
  • [2] Automating the Architecture Evaluation of Enterprise Information Systems
    Pinto, Felipe
    Kulesza, Uira
    Guerra, Eduardo
    ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 3, 2013, : 333 - 340
  • [3] HIERARCHICAL COMPUTER CONTROL SYSTEMS: AUTOMATING THE PLANNING PROCESS.
    O'Hara, Daniel J.
    Control engineering, 1984, 31 (09) : 156 - 158
  • [4] Automating the Evaluation of Interoperability Effectiveness in Heterogeneous IoT Systems
    Bouloukakis, Georgios
    Georgantas, Nikolaos
    Kattepur, Ajay
    Hassan, Houssam Hajj
    Issarny, Valerie
    IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE, ICSA 2024, 2024, : 58 - 68
  • [5] Engineering Assistance of Building Automation Systems - Automating Planning, Design and Comissioning
    Ploennigs, Joern
    Ryssel, Uwe
    Dibowski, Henrik
    Lehmann, Matthias
    Kabitzsch, Klaus
    ATP EDITION, 2012, (09): : 28 - 35
  • [6] Automating the communications planning process
    Shirey, CL
    PROCEEDINGS OF THE 1996 TACTICAL COMMUNICATIONS CONFERENCE: ENSURING JOINT FORCE SUPERIORITY IN THE INFORMATION AGE, 1996, : 357 - 364
  • [7] AUTOMATING ACQUISITIONS - THE PLANNING PROCESS
    BRYANT, B
    LIBRARY RESOURCES & TECHNICAL SERVICES, 1984, 28 (04): : 285 - 298
  • [8] AUTOMATING MANUFACTURABILITY EVALUATION IN CAD SYSTEMS THROUGH EXPERT-SYSTEMS APPROACHES
    VENKATACHALAM, AR
    EXPERT SYSTEMS WITH APPLICATIONS, 1994, 7 (04) : 495 - 506
  • [9] Automating QoS and QoE Evaluation of HTTP Adaptive Streaming Systems
    Christian Timmerer
    Anatoliy Zabrovskiy
    ZTECommunications, 2019, 17 (01) : 18 - 24
  • [10] Automating treatment planning system QA
    Kirk, M.
    Gong, X.
    Chu, J.
    MEDICAL PHYSICS, 2007, 34 (06) : 2422 - 2422