Dynamic single machine scheduling using Q-learning agent

被引:0
|
作者
Kong, LF [1 ]
Wu, J [1 ]
机构
[1] S China Univ Technol, Coll Elect Power Engn, Guangzhou 510640, Peoples R China
关键词
Q-learning; single machine scheduling; intelligent Agent; dispatching rule; simulated annealing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single machine scheduling methods have attracted a lot of attentions in recent years. Most dynamic single machine scheduling problems in practice have been addressed using dispatching rules. However, no single dispatching rule has been found to perform well for all important criteria, and no rule takes into account the status or the other resources of system's environment. In this research, an intelligent Agent-based single machine scheduling system is proposed, where the Agent is trained by a new improved Q-learning algorithm. In such scheduling system, Agent selects one of appropriate dispatching rules for machine based on available information. The Agent was trained by a new simulated annealing-based Q-learning algorithm. The simulation results show that the simulated annealing-based Q-learning Agent is able to learn to select the best dispatching rule for different system objectives. The results also indicate that simulated annealing-based Q-learning Agent could perform well for all criteria, which is impossible when using only one dispatching rule independently.
引用
下载
收藏
页码:3237 / 3241
页数:5
相关论文
共 50 条
  • [1] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
    Zhang, Zhicong
    Zheng, Li
    Weng, Michael X.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
  • [2] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
    Zhicong Zhang
    Li Zheng
    Michael X. Weng
    The International Journal of Advanced Manufacturing Technology, 2007, 34 : 968 - 980
  • [3] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
    Zhang, Zhicong
    Zheng, Li
    Weng, Michael X.
    International Journal of Advanced Manufacturing Technology, 2007, 34 (9-10): : 968 - 980
  • [4] Dynamic Parallel Machine Scheduling Using the Learning Agent
    Yuan, Biao
    Wang, Lei
    Jiang, Zhibin
    2013 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM 2013), 2013, : 1565 - 1569
  • [5] Dynamic scheduling with fuzzy clustering based Q-learning
    Wang, Guo-Lei
    Lin, Lin
    Zhong, Shi-Sheng
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2009, 15 (04): : 751 - 757
  • [6] Dynamic parallel machine scheduling with random breakdowns using the learning agent
    Yuan B.
    Jiang Z.
    Wang L.
    Jiang, Zhibin (zbjiang@sjtu.edu.cn), 2016, Inderscience Enterprises Ltd. (08) : 94 - 103
  • [7] Pricing in agent economies using multi-agent Q-learning
    Tesauro, G
    Kephart, JO
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2002, 5 (03) : 289 - 304
  • [8] Pricing in Agent Economies Using Multi-Agent Q-Learning
    Gerald Tesauro
    Jeffrey O. Kephart
    Autonomous Agents and Multi-Agent Systems, 2002, 5 : 289 - 304
  • [9] Mounting of auction agent under dynamic environment by Q-learning and SARSA learning
    Katou, T
    Nagasaka, K
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 472 - 475
  • [10] Clustering state membership-based Q-learning for dynamic scheduling
    Wang, Guolei
    Zhong, Shisheng
    Lin, Lin
    Gaojishu Tongxin/Chinese High Technology Letters, 2009, 19 (04): : 428 - 433