Dynamic single machine scheduling using Q-learning agent

被引：0

作者：

Kong, LF ^{[1
]}

Wu, J ^{[1
]}

机构：

[1] S China Univ Technol, Coll Elect Power Engn, Guangzhou 510640, Peoples R China

来源：

Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9 | 2005年

关键词：

Q-learning; single machine scheduling; intelligent Agent; dispatching rule; simulated annealing;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Single machine scheduling methods have attracted a lot of attentions in recent years. Most dynamic single machine scheduling problems in practice have been addressed using dispatching rules. However, no single dispatching rule has been found to perform well for all important criteria, and no rule takes into account the status or the other resources of system's environment. In this research, an intelligent Agent-based single machine scheduling system is proposed, where the Agent is trained by a new improved Q-learning algorithm. In such scheduling system, Agent selects one of appropriate dispatching rules for machine based on available information. The Agent was trained by a new simulated annealing-based Q-learning algorithm. The simulation results show that the simulated annealing-based Q-learning Agent is able to learn to select the best dispatching rule for different system objectives. The results also indicate that simulated annealing-based Q-learning Agent could perform well for all criteria, which is impossible when using only one dispatching rule independently.

引用

页码：3237 / 3241

页数：5

共 50 条

[1] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
Zhicong Zhang
Li Zheng
Michael X. Weng
The International Journal of Advanced Manufacturing Technology, 2007, 34 : 968 - 980
[2] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
Zhang, Zhicong
Zheng, Li
Weng, Michael X.
International Journal of Advanced Manufacturing Technology, 2007, 34 (9-10): : 968 - 980
[3] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
Zhang, Zhicong
Zheng, Li
Weng, Michael X.
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
[4] Dynamic Parallel Machine Scheduling Using the Learning Agent
Yuan, Biao
Wang, Lei
Jiang, Zhibin
2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1565 - 1569
[5] Dynamic scheduling with fuzzy clustering based Q-learning
Wang, Guo-Lei
Lin, Lin
Zhong, Shi-Sheng
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2009, 15 (04): : 751 - 757
[6] Dynamic parallel machine scheduling with random breakdowns using the learning agent
Yuan B.
Jiang Z.
Wang L.
Jiang, Zhibin (zbjiang@sjtu.edu.cn), 2016, Inderscience Enterprises Ltd. (08) : 94 - 103
[7] Pricing in agent economies using multi-agent Q-learning
Tesauro, G
Kephart, JO
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2002, 5 (03) : 289 - 304
[8] Pricing in Agent Economies Using Multi-Agent Q-Learning
Gerald Tesauro
Jeffrey O. Kephart
Autonomous Agents and Multi-Agent Systems, 2002, 5 : 289 - 304
[9] Clustering state membership-based Q-learning for dynamic scheduling
Wang, Guolei
Zhong, Shisheng
Lin, Lin
Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (04): : 428 - 433
[10] Mounting of auction agent under dynamic environment by Q-learning and SARSA learning
Katou, T
Nagasaka, K
7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 472 - 475

← 1 2 3 4 5 →