Dynamic pricing of differentiated products with incomplete information based on reinforcement learning

被引:1
|
作者
Wang, Cheng [1 ,2 ]
Cui, Senbing [1 ]
Wu, Runhua [1 ]
Wang, Ziteng [3 ]
机构
[1] Zhejiang Univ Technol, Coll Mech Engn, Hangzhou 310023, Peoples R China
[2] Zhejiang Univ Technol, Taizhou Res Inst, Taizhou, Peoples R China
[3] Northern Illinois Univ, Dept Ind & Syst Engn, De Kalb, IL 60115 USA
基金
中国国家自然科学基金;
关键词
customer preference; differentiated product; dynamic pricing; incomplete information; reinforcement learning; STRATEGY;
D O I
10.1049/cim2.12050
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the rapid development of the social economy, consumer demand is evolving towards diversification. To satisfy market demand, enterprises tend to improve competitiveness by providing differentiated products. How to price differentiated products becomes a hot topic. Traditionally, customers' preferences are assumed to be independent and identically distributed. With a known distribution, companies can easily make pricing decisions for differentiated products. However, such an assumption may be invalid in practice, especially for rapidly updating products. In this paper, a dynamic pricing policy for differentiated products with incomplete information is developed. An adaptive multi-armed bandit algorithm based on reinforcement learning is proposed to balance exploration and exploitation. Numerical examples show that the frequency of price adjustment affects the total profit significantly. Specifically, the more chances to adjust the price, the higher the total profit. Furthermore, experiments show that the dynamic pricing policy proposed in this paper outperforms other algorithms, such as Softmax and UCB1.
引用
收藏
页码:123 / 138
页数:16
相关论文
共 50 条
  • [1] Dynamic pricing of information products based on reinforcement learning: A yield-management approach
    Schwind, M
    Wendt, O
    [J]. KI2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2479 : 51 - 66
  • [2] Pricing and Positioning of Horizontally Differentiated Products with Incomplete Demand Information
    den Boer, Arnoud V.
    Chen, Boxiao
    Wang, Yining
    [J]. OPERATIONS RESEARCH, 2024, 72 (06) : 2446 - 2466
  • [3] Deep Reinforcement Learning for Dynamic Pricing of Perishable Products
    Burman, Vibhati
    Vashishtha, Rajesh Kumar
    Kumar, Rajan
    Ramanan, Sharadha
    [J]. OPTIMIZATION AND LEARNING, OLA 2021, 2021, 1443 : 132 - 143
  • [4] Dynamic Pricing for Differentiated PEV Charging Services Using Deep Reinforcement Learning
    Abdalrahman, Ahmed
    Zhuang, Weihua
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 1415 - 1427
  • [5] Dynamic pricing and reinforcement learning
    Carvalho, AX
    Puterman, ML
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2916 - 2921
  • [6] Dynamic pricing based on asymmetric multiagent reinforcement learning
    Könönen, V
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (01) : 73 - 98
  • [7] Dynamic Pricing by Multiagent Reinforcement Learning
    Han, Wei
    Liu, Lingbo
    Zheng, Huaili
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 226 - 229
  • [8] Reinforcement Learning for Fair Dynamic Pricing
    Maestre, Roberto
    Duque, Juan
    Rubio, Alberto
    Arevalo, Juan
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 120 - 135
  • [10] Dynamic pricing policies for interdependent perishable products or services using reinforcement learning
    Rana, Rupal
    Oliveira, Fernando S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 426 - 436