Intelligent Systems Utilization in Recommender Systems: A Reinforcement Learning Approach

被引:0
|
作者
Yazici, Ibrahim [1 ]
Ari, Emre [1 ]
机构
[1] Istanbul Tech Univ, Istanbul, Turkey
关键词
Reinforcement learning (RL); Recommender systems (RS); Upper Confidence Bound (UCB);
D O I
10.1007/978-3-031-09176-6_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommender systems (RS) have been gaining momentum with the advent of digitalization of our daily lives, accordingly, companies seek to attract most customers in this environment. One way of attracting more customers by advertisements is through online ads that make use of click-through rates (CTR) for the ads to build efficient RSS. For the RSS, frequently utilized methods are collaborative filtering (CF), content-based filtering (CBF) along with one of the traditional reinforcement learning approaches. The objective of this paper is to determine the best online ad among multiple advertisements to show the customers by reinforcement learning (RL). By treating the problem in multi-armed bandits, we modeled the problem with Bernoulli distribution by means of obtained CTRs. The best ad was tried to be chosen by the Bernoulli bandit with three settings; A/B/n testing, epsilon greedy, and Upper Confidence Bound (UCB) methods. The results show the explorations' contribution (with UCB and epsilon greedy) to the performance of the methods. Each method chose the same ad to show for online ads. UCB found the most preferable ad with a CTR rate of around 27.01%. It was followed by the epsilon greedy strategy with a CTR of around 25%. All the methods used determined the same ad alternative as the best according to the results obtained.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [21] Analysis of Autonomous Penetration Testing Through Reinforcement Learning and Recommender Systems
    Moreno, Ariadna Claudia
    Hernandez-Suarez, Aldo
    Sanchez-Perez, Gabriel
    Toscano-Medina, Linda Karina
    Perez-Meana, Hector
    Portillo-Portillo, Jose
    Olivares-Mercado, Jesus
    Garcia Villalba, Luis Javier
    SENSORS, 2025, 25 (01)
  • [22] DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems
    Zhao, Xiangyu
    Gu, Changsheng
    Zhang, Haoshenglun
    Yang, Xiwang
    Liu, Xiaobing
    Tang, Jiliang
    Liu, Hui
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 750 - 758
  • [23] Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems
    Wang, Siyu
    Cao, Yuanjiang
    Chen, Xiaocong
    Yao, Lina
    Wang, Xianzhi
    Sheng, Quan Z.
    FRONTIERS IN BIG DATA, 2022, 5
  • [24] REVEAL 2022: Reinforcement Learning-Based Recommender Systems at Scale
    Li, Ying
    Basilico, Justin
    Raimond, Yves
    Dimakopoulou, Maria
    Liaw, Richard
    Bailey, Paige
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 684 - 685
  • [25] Intelligent flying-beamformer for hybrid mmWave systems: A deep reinforcement learning approach
    Wang, Yang
    Chen, Yawen
    Lu, Zhaoming
    Wen, Xiangming
    COMPUTER NETWORKS, 2023, 231
  • [26] Intelligent Sharing for LTE and WiFi Systems in Unlicensed Bands: A Deep Reinforcement Learning Approach
    Tan, Junjie
    Zhang, Lin
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (05) : 2793 - 2808
  • [27] Intelligent tourism recommender systems: A survey
    Borras, Joan
    Moreno, Antonio
    Valls, Aida
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (16) : 7370 - 7389
  • [28] A reinforcement learning approach to personalized learning recommendation systems
    Tang, Xueying
    Chen, Yunxiao
    Li, Xiaoou
    Liu, Jingchen
    Ying, Zhiliang
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2019, 72 (01): : 108 - 135
  • [29] A Reinforcement Learning Approach for Smart Irrigation Systems
    Campoverde, Luis Miguel Samaniego
    Palmieria, Nunzia
    AUTONOMOUS AIR AND GROUND SENSING SYSTEMS FOR AGRICULTURAL OPTIMIZATION AND PHENOTYPING VII, 2022, 12114
  • [30] A HEURISTIC APPROACH TO REINFORCEMENT LEARNING CONTROL SYSTEMS
    WALTZ, MD
    FU, KS
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1965, AC10 (04) : 390 - &