Empirical analysis of an on-line adaptive system using a mixture of Bayesian networks

被引:9
|
作者
Kitakoshi, Daisuke [1 ]
Shioya, Hiroyuki [2 ]
Nakano, Ryohei [1 ]
机构
[1] Nagoya Inst Technol, Grad Sch Engn, Showa Ku, Nagoya, Aichi 4668555, Japan
[2] Muroran Inst Technol, Mizumoto, Muroran 0508585, Japan
基金
日本学术振兴会;
关键词
Adaptation to dynamic environments; Mixture of Bayesian networks; Reinforcement learning; Profit sharing; MODEL; PERFORMANCE;
D O I
10.1016/j.ins.2010.04.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An on-line reinforcement learning system that adapts to environmental changes using a mixture of Bayesian networks is described. Building intelligent systems able to adapt to dynamic environments is important for deploying real-world applications. Machine learning approaches, such as those using reinforcement learning methods and stochastic models, have been used to acquire behavior appropriate to environments characterized by uncertainty. However, efficient hybrid architectures based on these approaches have not yet been developed. The results of several experiments demonstrated that an agent using the proposed system can flexibly adapt to various kinds of environmental changes. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:2856 / 2874
页数:19
相关论文
共 50 条
  • [41] Adaptive back-propagation in on-line learning of multilayer networks
    West, AHL
    Saad, D
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 323 - 329
  • [42] Building adaptive tests using Bayesian networks
    Vomlel, J
    [J]. KYBERNETIKA, 2004, 40 (03) : 333 - 348
  • [43] Using Bayesian belief networks in adaptive management
    Nyberg, J. Brian
    Marcot, Bruce G.
    Sulyma, Randy
    [J]. CANADIAN JOURNAL OF FOREST RESEARCH-REVUE CANADIENNE DE RECHERCHE FORESTIERE, 2006, 36 (12): : 3104 - 3116
  • [44] Assessing Production Line Risk using Bayesian Belief Networks and System Dynamics
    Punyamurthula, Sudhir
    Badurdeen, Fazleena
    [J]. 46TH SME NORTH AMERICAN MANUFACTURING RESEARCH CONFERENCE, NAMRC 46, 2018, 26 : 76 - 86
  • [45] Non-linear system on-line identification using dynamic neural networks
    Pozniak, AS
    Sanchez, EN
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 1999, 5 (03): : 201 - 209
  • [46] On-line adaptive planning system for prostate IMRT treatment
    Thengphiew, D.
    Wu, Q.
    Wang, Z.
    Yoo, S.
    Lee, W. R.
    Vujaskovic, Z.
    Yin, F.
    [J]. INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2007, 69 (03): : S20 - S21
  • [47] On-line adaptive fuzzy diagnosis system: Fusion and supervision
    Boudaoud, AN
    Masson, MH
    [J]. (SAFEPROCESS'97): FAULT DETECTION, SUPERVISION AND SAFETY FOR TECHNICAL PROCESSES 1997, VOLS 1-3, 1998, : 1195 - 1200
  • [48] A Customized Plan Evaluation System for On-Line Adaptive Radiotherapy
    Jiang, L.
    Arhjoul, L.
    Anderson, J.
    Nedzi, L.
    Solberg, T.
    Mao, W.
    [J]. MEDICAL PHYSICS, 2012, 39 (06) : 3832 - 3832
  • [49] A simple and adaptive on-line path planning system for a UAV
    Ducard, G.
    Kulling, K. C.
    Geering, H. P.
    [J]. 2007 MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-4, 2007, : 1854 - +
  • [50] On-line adaptive controller system used on small UAV
    Gao Tongyue
    Tang Rui
    Rao Junjin
    Luo Jun
    Gong Zhenbang
    [J]. 2012 INTERNATIONAL SYMPOSIUM ON SAFETY SCIENCE AND TECHNOLOGY, 2012, 45 : 980 - 985