Monte-Carlo-based partially observable Markov decision process approximations for adaptive sensing

被引:18
|
作者
Chong, Edwin K. P. [1 ,3 ]
Kreucher, Christopher M. [2 ]
Hero, Alfred O., III [3 ]
机构
[1] Colorado State Univ, Ft Collins, CO 80523 USA
[2] Integr Applicat Incorp, Ann Arbor, MI USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
D O I
10.1109/WODES.2008.4605941
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive sensing involves actively managing sensor resources to achieve a sensing task, such as object detection, classification, and tracking, and represents a promising direction for new applications of discrete event system methods. We describe an approach to adaptive sensing based on approximately solving a partially observable Markov decision process (POMDP) formulation of the problem. Such approximations are necessary because of the very large state space involved in practical adaptive sensing problems, precluding exact computation of optimal solutions. We review the theory of POMDPs and show how the theory applies to adaptive sensing problems. We then describe Monte-Carlo-based approximation methods, with an example to illustrate their application in adaptive sensing. The example also demonstrates the gains that are possible from nonmyopic methods relative to myopic methods.
引用
收藏
页码:173 / +
页数:2
相关论文
共 50 条
  • [1] Partially Observable Markov Decision Process Approximations for Adaptive Sensing
    Edwin K. P. Chong
    Christopher M. Kreucher
    Alfred O. Hero
    Discrete Event Dynamic Systems, 2009, 19 : 377 - 422
  • [2] Partially Observable Markov Decision Process Approximations for Adaptive Sensing
    Chong, Edwin K. P.
    Kreucher, Christopher M.
    Hero, Alfred O., III
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2009, 19 (03): : 377 - 422
  • [3] Robust partially observable Markov decision process
    Osogami, Takayuki
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 106 - 115
  • [4] Parameter decision in adaptive partially observable Markov decision process with finite planning horizon
    Li, J.H.
    Han, Z.Z.
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (12): : 1653 - 1657
  • [5] ON THE ADAPTIVE-CONTROL OF A PARTIALLY OBSERVABLE BINARY MARKOV DECISION-PROCESS
    FERNANDEZGAUCHERAND, E
    ARAPOSTATHIS, A
    MARCUS, SI
    ADVANCES IN COMPUTING AND CONTROL, 1989, 130 : 217 - 229
  • [6] ON THE ADAPTIVE-CONTROL OF A PARTIALLY OBSERVABLE BINARY MARKOV DECISION-PROCESS
    FERNANDEZGAUCHERAND, E
    ARAPOSTATHIS, A
    MARCUS, SI
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1989, 130 : 217 - 229
  • [7] Autonomous Thermalling as a Partially Observable Markov Decision Process
    Guilliard, Iain
    Rogahn, Richard J.
    Piavis, Jim
    Kolobov, Andrey
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [8] Value-Function Approximations for Partially Observable Markov Decision Processes
    Hauskrecht, Milos
    Journal of Artificial Intelligence Research, 2001, 13 (00): : 33 - 94
  • [9] Value-function approximations for partially observable Markov decision processes
    Hauskrecht, M
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 : 33 - 94
  • [10] A Partially Observable Markov-Decision-Process-Based Blackboard Architecture for Cognitive Agents in Partially Observable Environments
    Itoh, Hideaki
    Nakano, Hidehiko
    Tokushima, Ryota
    Fukumoto, Hisao
    Wakuya, Hiroshi
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (01) : 189 - 204