adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential Decision Making Human-in-the-Loop Systems

被引:6
|
作者
Taherisadr, Mojtaba [1 ]
Stavroulakis, Stelios Andrew [1 ]
Elmalaki, Salma [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
关键词
Privacy; Reinforcement Learning; Human-in-the-Loop; IoT;
D O I
10.1145/3576842.3582325
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) presents numerous benefits compared to rule-based approaches in various applications. Privacy concerns have grown with the widespread use of RL trained with privacy-sensitive data in IoT devices, especially for human-in-the-loop systems. On the one hand, RL methods enhance the user experience by trying to adapt to the highly dynamic nature of humans. Onthe other hand, trained policies can leak the user's private information. Recent attention has been drawn to designing privacy-aware RL algorithms while maintaining an acceptable system utility. A central challenge in designing privacy-aware RL, especially for human-in-the-loop systems, is that humans have intrinsic variability, and their preferences and behavior evolve. The effect of one privacy leak mitigation can differ for the same human or across different humans over time. Hence, we can not design one fixed model for privacy-aware RL that fits all. To that end, we propose adaPARL, an adaptive approach for privacy-aware RL, especially for human-in-the-loop IoT systems. adaPARL provides a personalized privacy-utility trade-off depending on human behavior and preference. We validate the proposed adaPARL on two IoT applications, namely (i) Human-in-the-Loop Smart Home and (ii) Human-in-the-Loop Virtual Reality (VR) Smart Classroom. Results obtained on these two applications validate the generality of adaPARL and its ability to provide a personalized privacy-utility trade-off. On average, adaPARL improves the utility by 57% while reducing the privacy leak by 23% on average.
引用
收藏
页码:262 / 274
页数:13
相关论文
共 50 条
  • [21] Privacy-Aware Task Offloading via Two-Timescale Reinforcement Learning
    Dong, Jiyu
    Geng, Dongqing
    He, Xiaofan
    [J]. 2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 220 - 225
  • [22] Sequential Decision Making with "Sequential Information" in Deep Reinforcement Learning
    Xu, Aimin
    Yuan, Linghui
    Liu, Yunlong
    [J]. PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 173 - 184
  • [23] Pragmatic Image Compression for Human-in-the-Loop Decision-Making
    Reddy, Siddharth
    Dragan, Anca D.
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [24] ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning
    Chen, Sean
    Gao, Jensen
    Reddy, Siddharth
    Berseth, Glen
    Dragan, Anca D.
    Levine, Sergey
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7505 - 7512
  • [25] Traded Control of Human-Machine Systems for Sequential Decision-Making Based on Reinforcement Learning
    Zhang, Qianqian
    Kang, Yu
    Zhao, Yun-Bo
    Li, Pengfei
    You, Shiyi
    [J]. IEEE Transactions on Artificial Intelligence, 2022, 3 (04): : 553 - 566
  • [26] Human-in-the-Loop Reinforcement Learning in Continuous-Action Space
    Luo, Biao
    Wu, Zhengke
    Zhou, Fei
    Wang, Bing-Chuan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 10
  • [27] History-Aware Explanations: Towards Enabling Human-In-The-Loop in Self-Adaptive Systems
    Parra-Ullauri, Juan
    Garcia-Dominguez, Antonio
    Bencomo, Nelly
    Garcia-Paucar, Luis
    [J]. ACM/IEEE 25TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, MODELS 2022 COMPANION, 2022, : 286 - 295
  • [28] Deep-Reinforcement-Learning-Based User Profile Perturbation for Privacy-Aware Recommendation
    Xiao, Yilin
    Xiao, Liang
    Lu, Xiaozhen
    Zhang, Hailu
    Yu, Shui
    Poor, H. Vincent
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (06): : 4560 - 4568
  • [29] Reinforcement learning for decision making in sequential visual attention
    Paletta, Lucas
    Fritz, Gerald
    [J]. ATTENTION IN COGNITIVE SYSTEMS: THEORIES AND SYSTEMS FROM AN INTERDISCIPLINARY VIEWPOINT, 2007, 4840 : 293 - 306
  • [30] ADAS-RL: Adaptive Vector Scaling Reinforcement Learning For Human-in-the-Loop Lane Departure Warning
    Ahadi-Sarkani, Armand
    Elmalaki, Salma
    [J]. CPHS'21: PROCEEDINGS OF THE 2021 THE FIRST ACM INTERNATIONAL WORKSHOP ON CYBER-PHYSICAL-HUMAN SYSTEM DESIGN AND IMPLEMENTATION, 2021, : 7 - 12