Collaborative Learning of Human and Computer: Supervised Actor-Critic based Collaboration Scheme

被引:0
|
作者
Devanga, Ashwin [1 ]
Yamauchi, Koichiro [2 ]
机构
[1] Indian Inst Technol Guwahati, Gauhati, India
[2] Chubu Univ, Ctr Engn, Kasugai, Aichi, Japan
关键词
Actor-Critic Model; Kernel Machine; Learning on a Budget; Super Neural Network; Colbagging; Supervised Learning; Reinforcement Learning; Collaborative Learning Scheme between Human and Learning Machine;
D O I
10.5220/0007568407940801
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent large-scale neural networks show a high performance to complex recognition tasks but to get such ability, it needs a huge number of learning samples and iterations to optimize it's internal parameters. However, under unknown environments, learning samples do not exist. In this paper, we aim to overcome this problem and help improve the learning capability of the system by sharing data between multiple systems. To accelerate the optimization speed, the novel system forms a collaboration with human and reinforcement learning neural network and for data sharing between systems to develop a super neural network.
引用
收藏
页码:794 / 801
页数:8
相关论文
共 50 条
  • [1] Actor-Critic based Improper Reinforcement Learning
    Zaki, Mohammadi
    Mohan, Avinash
    Gopalan, Aditya
    Mannor, Shie
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [2] An actor-critic strategy for a safe and efficient human robot collaboration
    Gabrielli, Guglielmo
    Secchi, Cristian
    [J]. 2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 919 - 926
  • [3] Dynamic Charging Scheme Problem With Actor-Critic Reinforcement Learning
    Yang, Meiyi
    Liu, Nianbo
    Zuo, Lin
    Feng, Yong
    Liu, Minghui
    Gong, Haigang
    Liu, Ming
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (01): : 370 - 380
  • [4] Design and Implementation of an Adaptive Cruise Control System Based on Supervised Actor-Critic Learning
    Wang, Bin
    Zhao, Dongbin
    Li, Chengdong
    Dai, Yujie
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 243 - 248
  • [5] Supervised Advantage Actor-Critic for Recommender Systems
    Xin, Xin
    Karatzoglou, Alexandros
    Arapakis, Ioannis
    Jose, Joemon M.
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1186 - 1196
  • [6] Supervised actor-critic reinforcement learning with action feedback for algorithmic trading
    Qizhou Sun
    Yain-Whar Si
    [J]. Applied Intelligence, 2023, 53 : 16875 - 16892
  • [7] Supervised actor-critic reinforcement learning with action feedback for algorithmic trading
    Sun, Qizhou
    Si, Yain-Whar
    [J]. APPLIED INTELLIGENCE, 2023, 53 (13) : 16875 - 16892
  • [8] Actor-Critic Learning Based on Adaptive Importance Sampling
    Cheng Yuhu
    Feng Huanting
    Wang Xuesong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04) : 583 - 588
  • [9] Actor-critic learning based on fuzzy inference system
    Jouffe, L
    [J]. INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 339 - 344
  • [10] Granular computing in actor-critic learning
    Peters, James F.
    [J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64