Developing Real-Time Scheduling Policy by Deep Reinforcement Learning
被引:6
|
作者:
Bo, Zitong
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Univ Chinese Acad Sci, Beijing, Peoples R ChinaChinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Bo, Zitong
[1
,2
]
Qiao, Ying
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R ChinaChinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Qiao, Ying
[1
]
Leng, Chang
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R ChinaChinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Leng, Chang
[1
]
Wang, Hongan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R ChinaChinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Wang, Hongan
[1
]
Guo, Chaoping
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R ChinaChinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Guo, Chaoping
[1
]
Zhang, Shaohui
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Natl Speed Skating Oval Operat Co Ltd, Beijing, Peoples R ChinaChinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
Zhang, Shaohui
[3
]
机构:
[1] Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Beijing Natl Speed Skating Oval Operat Co Ltd, Beijing, Peoples R China
real-time scheduling;
reinforcement learning;
multiprocessor system;
deep neural network;
D O I:
10.1109/RTAS52030.2021.00019
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
Designing scheduling policies for multiprocessor real-time systems is challenging since the multiprocessor scheduling problem is NP-complete. The existing heuristics are customized policies that may achieve poor performance under some specific task loads. Thus, a new design pattern is needed to make the multiprocessor scheduling policies perform well under various task loads. In this paper, we investigate a new real-time scheduling policy based on reinforcement learning. For any given real-time task set, our policy can automatically derive a high performance by online learning. Specifically, we model the real-time scheduling process as a multi-agent cooperative game and propose multi-agent self-cooperative learning that overcomes the curse of dimensionality and credit assignment problems. Simulation results show that our approach can learn high-performance policies for various task/system models.
机构:
Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R ChinaTsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
Liu, Shaohuai
Liu, Jinbo
论文数: 0引用数: 0
h-index: 0
机构:
State Grid Corp China, Natl Power Dispatching & Control Ctr, Beijing 100031, Peoples R China
Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R ChinaTsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
Liu, Jinbo
Yang, Nan
论文数: 0引用数: 0
h-index: 0
机构:
China Elect Power Res Inst, Beijing 100192, Peoples R ChinaTsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
Yang, Nan
Huang, Yupeng
论文数: 0引用数: 0
h-index: 0
机构:
China Elect Power Res Inst, Beijing 100192, Peoples R ChinaTsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
Huang, Yupeng
Jiang, Qirong
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R ChinaTsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
Jiang, Qirong
Gao, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R ChinaTsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
机构:
Toronto Metropolitan Univ, Dept Mech & Ind Engn, Maintenance Res Lab, Reliability Risk & Maintenance Res Lab RRMR Lab, Toronto, ON, CanadaToronto Metropolitan Univ, Dept Mech & Ind Engn, Maintenance Res Lab, Reliability Risk & Maintenance Res Lab RRMR Lab, Toronto, ON, Canada
Namoura, Hamed A.
Sharifi, Mani
论文数: 0引用数: 0
h-index: 0
机构:
Miami Univ, Farmer Sch Business, Dept Informat Syst & Analyt, Oxford, OH USAToronto Metropolitan Univ, Dept Mech & Ind Engn, Maintenance Res Lab, Reliability Risk & Maintenance Res Lab RRMR Lab, Toronto, ON, Canada
Sharifi, Mani
Ghaleb, Mageed
论文数: 0引用数: 0
h-index: 0
机构:
Toronto Metropolitan Univ, Dept Mech & Ind Engn, Maintenance Res Lab, Reliability Risk & Maintenance Res Lab RRMR Lab, Toronto, ON, CanadaToronto Metropolitan Univ, Dept Mech & Ind Engn, Maintenance Res Lab, Reliability Risk & Maintenance Res Lab RRMR Lab, Toronto, ON, Canada
机构:
Harbin Inst Technol, Sch Environm, Harbin 150090, Peoples R ChinaHarbin Inst Technol, Sch Environm, Harbin 150090, Peoples R China
Hu, Shiyuan
Gao, Jinliang
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Sch Environm, Harbin 150090, Peoples R ChinaHarbin Inst Technol, Sch Environm, Harbin 150090, Peoples R China
Gao, Jinliang
Zhong, Dan
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Sch Environm, Harbin 150090, Peoples R ChinaHarbin Inst Technol, Sch Environm, Harbin 150090, Peoples R China
Zhong, Dan
Wu, Rui
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Yuehai Water Investment Co Ltd, Shenzhen 518021, Peoples R ChinaHarbin Inst Technol, Sch Environm, Harbin 150090, Peoples R China
Wu, Rui
Liu, Luming
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Natl Engn Res Ctr Urban Water Resources Co Ltd, Harbin 150090, Peoples R ChinaHarbin Inst Technol, Sch Environm, Harbin 150090, Peoples R China
机构:
Nanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R ChinaNanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R China
Chen, Jian
Zhang, Hanlei
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R ChinaNanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R China
Zhang, Hanlei
Ma, Wenjing
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R ChinaNanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R China
Ma, Wenjing
Xu, Gangyan
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Fac Engn, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R ChinaNanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing, Peoples R China