Collaborative Learning of Human and Computer: Supervised Actor-Critic based Collaboration Scheme

被引:0
|
作者
Devanga, Ashwin [1 ]
Yamauchi, Koichiro [2 ]
机构
[1] Indian Inst Technol Guwahati, Gauhati, India
[2] Chubu Univ, Ctr Engn, Kasugai, Aichi, Japan
关键词
Actor-Critic Model; Kernel Machine; Learning on a Budget; Super Neural Network; Colbagging; Supervised Learning; Reinforcement Learning; Collaborative Learning Scheme between Human and Learning Machine;
D O I
10.5220/0007568407940801
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent large-scale neural networks show a high performance to complex recognition tasks but to get such ability, it needs a huge number of learning samples and iterations to optimize it's internal parameters. However, under unknown environments, learning samples do not exist. In this paper, we aim to overcome this problem and help improve the learning capability of the system by sharing data between multiple systems. To accelerate the optimization speed, the novel system forms a collaboration with human and reinforcement learning neural network and for data sharing between systems to develop a super neural network.
引用
收藏
页码:794 / 801
页数:8
相关论文
共 50 条
  • [31] Sequence Alignment with Q-Learning Based on the Actor-Critic Model
    Li, Yarong
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)
  • [32] An actor-critic learning framework based on Lyapunov stability for automatic assembly
    Li, Xinwang
    Xiao, Juliang
    Cheng, Yu
    Liu, Haitao
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 4801 - 4812
  • [33] Variational value learning in advantage actor-critic reinforcement learning
    Zhang, Yaozhong
    Han, Jiaqi
    Hu, Xiaofang
    Dan, Shihao
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1955 - 1960
  • [34] An actor-critic learning framework based on Lyapunov stability for automatic assembly
    Xinwang Li
    Juliang Xiao
    Yu Cheng
    Haitao Liu
    [J]. Applied Intelligence, 2023, 53 : 4801 - 4812
  • [35] TRANSFER LEARNING BASED ON FORBIDDEN RULE SET IN ACTOR-CRITIC METHOD
    Takano, Toshiaki
    Takase, Haruhiko
    Kawanaka, Hiroharu
    Kita, Hidehiko
    Hayashi, Terumine
    Tsuruoka, Shinji
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2011, 7 (5B): : 2907 - 2917
  • [36] DAC: The Double Actor-Critic Architecture for Learning Options
    Zhang, Shangtong
    Whiteson, Shimon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [37] Efficient Model Learning Methods for Actor-Critic Control
    Grondman, Ivo
    Vaandrager, Maarten
    Busoniu, Lucian
    Babuska, Robert
    Schuitema, Erik
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
  • [38] Merging with Extraction Method for Transfer Learning in Actor-Critic
    Takano, Toshiaki
    Takase, Haruhiko
    Kawanaka, Hiroharu
    Tsuruoka, Shinji
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (07) : 814 - 821
  • [39] Latent Context Based Soft Actor-Critic
    Pu, Yuan
    Wang, Shaochen
    Yao, Xin
    Li, Bin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [40] Reinforcement learning with actor-critic for knowledge graph reasoning
    Linli Zhang
    Dewei Li
    Yugeng Xi
    Shuai Jia
    [J]. Science China Information Sciences, 2020, 63