MMDP: A Mobile-IoT Based Multi-Modal Reinforcement Learning Service Framework

被引:10
|
作者
Wang, Puming [1 ]
Yang, Laurence T. [1 ,3 ]
Li, Jintao [2 ]
Li, Xue [4 ]
Zhou, Xiaokang [5 ,6 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Meituan Dianping Co, Beijing 100010, Peoples R China
[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 2W5, Canada
[4] Henan Inst Technol, Sch Elect Informat Engn, Xinxiang 453003, Henan, Peoples R China
[5] Shiga Univ, Fac Data Sci, Hikone, Shiga 5228522, Japan
[6] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
关键词
Tensors; Internet of Things; Markov processes; Reinforcement learning; Public transportation; Sensor systems; Multi-modal reinforcement learning; mobile Internet of Things; service framework; social sensors; multi-modal Markov decision process; action-aware high-order transition tensor; tensor policy iteration algorithm; optimal tensor policy; BIG DATA; SYSTEMS;
D O I
10.1109/TSC.2020.2964663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of GPS technology, a new Mobile Internet of Things (M-IoT) is emerging, which perceives the city's rhythm and pulse day and night to collect a large scale of city data. It is urgent to innovate M-IoT service system for these large-scale and heterogeneous data. To cope with the problem, this article proposes a Mobile-IoT based multi-modal reinforcement learning service framework from data perspective, which has three highlights, i) Developing Action-aware High-order Transition Tensor (AHTT) to fuse the heterogeneous data from M-IoTs in a unified form. ii) Developing Multi-modal Markov Decision Process (MMDP) to model the multi-modal reinforcement learning for M-IoT service framework. iii) Developing Tensor Policy Iteration algorithm (TPIA) to solve the optimal tensor policy. Due to using tensor keeps the multi-modal relations of the context information in the process of solving the optimal policy. The proposed M-IoT service system provides more personalized service for taxi drivers. The experiment results shows that most taxi drivers earn more revenue according to the tensor policy.
引用
收藏
页码:675 / 684
页数:10
相关论文
共 50 条
  • [1] Service restoration in multi-modal optical transport networks with reinforcement learning
    Zhao, Zipiao
    Zhao, Yongli
    Li, Yajie
    Wang, Feng
    Li, Xinghua
    Han, Dahai
    Zhang, Jie
    [J]. OPTICS EXPRESS, 2021, 29 (03): : 3825 - 3840
  • [2] Multi-Modal Legged Locomotion Framework With Automated Residual Reinforcement Learning
    Yu, Chen
    Rosendo, Andre
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 10312 - 10319
  • [3] A Multi-Modal Person Perception Framework for Socially Interactive Mobile Service Robots
    Mueller, Steffen
    Wengefeld, Tim
    Trinh, Thanh Quang
    Aganian, Dustin
    Eisenbach, Markus
    Gross, Horst-Michael
    [J]. SENSORS, 2020, 20 (03)
  • [4] MMRL: A Multi-Modal Reinforcement Learning Technique for Energy-efficient Medical IoT Systems
    Abo-eleneen, Amr
    Mohamed, Amr
    [J]. IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 2026 - 2031
  • [5] A unified framework for multi-modal federated learning
    Xiong, Baochen
    Yang, Xiaoshan
    Qi, Fan
    Xu, Changsheng
    [J]. NEUROCOMPUTING, 2022, 480 : 110 - 118
  • [6] A Framework of Multi-modal Corpus for Mandarin Learning
    Liu, Yang
    Yang, Chunting
    [J]. 2009 IITA INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS ENGINEERING, PROCEEDINGS, 2009, : 476 - 479
  • [7] Multi-modal Learning for Video Recommendation based on Mobile Application Usage
    Jia, Xiaowei
    Wang, Aosen
    Li, Xiaoyi
    Xun, Guangxu
    Xu, Wenyao
    Zhang, Aidong
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 837 - 842
  • [8] A Multi-Modal Vertical Federated Learning Framework Based on Homomorphic Encryption
    Gong, Maoguo
    Zhang, Yuanqiao
    Gao, Yuan
    Qin, A. K.
    Wu, Yue
    Wang, Shanfeng
    Zhang, Yihong
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1826 - 1839
  • [9] Excavation of Fragmented Rocks with Multi-modal Model-based Reinforcement Learning
    Zhu, Yifan
    Wang, Liyang
    Zhang, Liangjun
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 6523 - 6530
  • [10] Mobile language learning with multimedia and multi-modal interfaces
    Joseph, Sam
    Uther, Maria
    [J]. FOURTH IEEE INTERNATIONAL WORKSHOP ON WIRELESS, MOBILE AND UBIQUITOUS TECHNOLOGY IN EDUCATION, PROCEEDINGS, 2006, : 124 - +