Collaborative Learning of Human and Computer: Supervised Actor-Critic based Collaboration Scheme

被引：0

作者：

Devanga, Ashwin ^{[1
]}

Yamauchi, Koichiro ^{[2
]}

机构：

[1] Indian Inst Technol Guwahati, Gauhati, India

[2] Chubu Univ, Ctr Engn, Kasugai, Aichi, Japan

来源：

ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS | 2019年

关键词：

Actor-Critic Model; Kernel Machine; Learning on a Budget; Super Neural Network; Colbagging; Supervised Learning; Reinforcement Learning; Collaborative Learning Scheme between Human and Learning Machine;

D O I：

10.5220/0007568407940801

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent large-scale neural networks show a high performance to complex recognition tasks but to get such ability, it needs a huge number of learning samples and iterations to optimize it's internal parameters. However, under unknown environments, learning samples do not exist. In this paper, we aim to overcome this problem and help improve the learning capability of the system by sharing data between multiple systems. To accelerate the optimization speed, the novel system forms a collaboration with human and reinforcement learning neural network and for data sharing between systems to develop a super neural network.

引用

页码：794 / 801

页数：8

共 50 条

[1] Actor-Critic based Improper Reinforcement Learning
Zaki, Mohammadi
Mohan, Avinash
Gopalan, Aditya
Mannor, Shie
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[2] An actor-critic strategy for a safe and efficient human robot collaboration
Gabrielli, Guglielmo
Secchi, Cristian
[J]. 2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 919 - 926
[3] Dynamic Charging Scheme Problem With Actor-Critic Reinforcement Learning
Yang, Meiyi
Liu, Nianbo
Zuo, Lin
Feng, Yong
Liu, Minghui
Gong, Haigang
Liu, Ming
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (01): : 370 - 380
[4] Design and Implementation of an Adaptive Cruise Control System Based on Supervised Actor-Critic Learning
Wang, Bin
Zhao, Dongbin
Li, Chengdong
Dai, Yujie
[J]. 2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 243 - 248
[5] Supervised Advantage Actor-Critic for Recommender Systems
Xin, Xin
Karatzoglou, Alexandros
Arapakis, Ioannis
Jose, Joemon M.
[J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1186 - 1196
[6] Supervised actor-critic reinforcement learning with action feedback for algorithmic trading
Qizhou Sun
Yain-Whar Si
[J]. Applied Intelligence, 2023, 53 : 16875 - 16892
[7] Supervised actor-critic reinforcement learning with action feedback for algorithmic trading
Sun, Qizhou
Si, Yain-Whar
[J]. APPLIED INTELLIGENCE, 2023, 53 (13) : 16875 - 16892
[8] Actor-Critic Learning Based on Adaptive Importance Sampling
Cheng Yuhu
Feng Huanting
Wang Xuesong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04) : 583 - 588
[9] Actor-critic learning based on fuzzy inference system
Jouffe, L
[J]. INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 339 - 344
[10] Granular computing in actor-critic learning
Peters, James F.
[J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64

← 1 2 3 4 5 →