Adaptive Sampling using POMDPs with Domain-Specific Considerations

被引:3
|
作者
Salhotra, Gautam [1 ]
Denniston, Christopher E. [1 ]
Caron, David A. [1 ]
Sukhatme, Gaurav S. [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
D O I
10.1109/ICRA48506.2021.9561319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate improving Monte Carlo Tree Search based solvers for Partially Observable Markov Decision Processes (POMDPs), when applied to adaptive sampling problems. We propose improvements in rollout allocation, the action exploration algorithm, and plan commitment. The first allocates a different number of rollouts depending on how many actions the agent has taken in an episode. We find that rollouts are more valuable after some initial information is gained about the environment. Thus, a linear increase in the number of rollouts, i.e. allocating a fixed number at each step, is not appropriate for adaptive sampling tasks. The second alters which actions the agent chooses to explore when building the planning tree. We find that by using knowledge of the number of rollouts allocated, the agent can more effectively choose actions to explore. The third improvement is in determining how many actions the agent should take from one plan. Typically, an agent will plan to take the first action from the planning tree and then call the planner again from the new state. Using statistical techniques, we show that it is possible to greatly reduce the number of rollouts by increasing the number of actions taken from a single planning tree without affecting the agent's final reward. Finally, we demonstrate experimentally, on simulated and real aquatic data from an underwater robot, that these improvements can be combined, leading to better adaptive sampling. The code for this work is available at https://github com/uscresl/AdaptiveSamplingPOMCP.
引用
收藏
页码:2385 / 2391
页数:7
相关论文
共 50 条
  • [21] Adaptive Sampling and Actuation for POMDPs: Application to Precision Agriculture
    Antunes, D. J.
    Beumer, R. M.
    De Molengraft, M. J. G. Van
    Heemels, W. P. M. H.
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 2399 - 2404
  • [22] Domain-Specific Profiling
    Bergel, Alexandre
    Nierstrasz, Oscar
    Renggli, Lukas
    Ressia, Jorge
    OBJECTS, MODELS, COMPONENTS, PATTERNS, TOOLS 2011, 2011, 6705 : 68 - 82
  • [23] Domain-Specific Greed
    Weiss, Martin
    Schulze, Julian
    Krumm, Stefan
    Goeritz, Anja S. S.
    Hewig, Johannes
    Mussel, Patrick
    PERSONALITY AND SOCIAL PSYCHOLOGY BULLETIN, 2024, 50 (06) : 889 - 905
  • [24] Untangling Crosscutting Concerns in Domain-specific Languages with Domain-specific Join Points
    Dinkelaker, Tom
    Monperrus, Martin
    Mezini, Mira
    DSAL09: DOMAIN-SPECIFIC ASPECT LANGUAGES, 2009, : 1 - 5
  • [25] A Domain-Specific Compiler for a Parallel Multiresolution Adaptive Numerical Simulation Environment
    Rajbhandari, Samyam
    Kim, Jinsung
    Krishnamoorthy, Sriram
    Pouchet, Louis-Noel
    Rastello, Fabrice
    Harrison, Robert J.
    Sadayappan, P.
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 468 - 479
  • [26] An adaptive middleware design to support the dynamic interpretation of domain-specific models
    Morris, Karl A.
    Allison, Mark
    Costa, Fabio M.
    Wei, Jinpeng
    Clarke, Peter J.
    INFORMATION AND SOFTWARE TECHNOLOGY, 2015, 62 : 21 - 41
  • [27] Unsupervised Domain Adaptation for Event Detection using Domain-specific Adapters
    Nghia Ngo Trung
    Duy Phung
    Thien Huu Nguyen
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4015 - 4025
  • [28] Domain-Specific Languages and Code Synthesis Using Haskell
    Gill, Andy
    COMMUNICATIONS OF THE ACM, 2014, 57 (06) : 42 - 49
  • [29] Identification of domain-specific euphemistic tweets using clustering
    Devi M.D.
    Saharia N.
    International Journal of Information Technology, 2024, 16 (1) : 21 - 31
  • [30] Using Domain-Specific Languages For Analytic Graph Databases
    Sevenich, Martin
    Hong, Sungpack
    van Rest, Oskar
    Wu, Zhe
    Banerjee, Jayanta
    Chafi, Hassan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1257 - 1268