Federated learning enabled hotel customer classification towards imbalanced data

被引:0
|
作者
Liu, Tao [1 ,2 ]
Chen, Shouqiang [3 ]
Wu, Meng [1 ]
Yu, Miao [1 ]
机构
[1] China Univ Polit Sci & Law, Business Sch, Beijing, Peoples R China
[2] Minist Educ, Ctr Sci Res & Dev Higher Educ Inst, PRChina CSRD, Beijing, Peoples R China
[3] MCC Real Estate Grp Co Ltd, Beijing, Peoples R China
关键词
Federated learning; Customer classification; Imbalanced data;
D O I
10.1016/j.asoc.2024.112028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hotel customer classification is the basis of customer profiling, which can significantly benefit a hotel by providing more appropriate services for targeted customers. However, imbalanced data distribution from individual hotels cannot support a reliable classification result, and sharing personal information among hotels is not allowed. In this paper, we propose to achieve a privacy-preserved hotel customer classification model via federated learning. A significant challenge is that hotels with different star ratings or distributed in different city regions usually serve specific customer groups, resulting in imbalanced data that degrade classification accuracy. We introduce an attention mechanism and design a client selection strategy to balance global and local performance upon imbalanced data. Due to privacy issues, we evaluate our solution's communication cost and accuracy on public imbalanced datasets and demonstrate the real-world customer classification results. Extensive experiments show that our solution performs better than the COTA method.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Robust multiclass classification for learning from imbalanced biomedical data
    Phoungphol, Piyaphol
    Zhang, Yanqing
    Zhao, Yichuan
    Tsinghua Science and Technology, 2012, 17 (06) : 619 - 628
  • [42] Sampling Approaches for Imbalanced Data Classification Problem in Machine Learning
    Tyagi, Shivani
    Mittal, Sangeeta
    PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019, 2020, 597 : 209 - 221
  • [43] Robust Multiclass Classification for Learning from Imbalanced Biomedical Data
    Piyaphol Phoungphol
    TsinghuaScienceandTechnology, 2012, 17 (06) : 619 - 628
  • [44] Clustering-based incremental learning for imbalanced data classification
    Liu, Yuxin
    Du, Guangyu
    Yin, Chenke
    Zhang, Haichao
    Wang, Jia
    KNOWLEDGE-BASED SYSTEMS, 2024, 292
  • [45] An improved weighted extreme learning machine for imbalanced data classification
    Lu, Chengbo
    Ke, Haifeng
    Zhang, Gaoyan
    Mei, Ying
    Xu, Huihui
    MEMETIC COMPUTING, 2019, 11 (01) : 27 - 34
  • [46] imDC: an ensemble learning method for imbalanced classification with miRNA data
    Wang, C. Y.
    Hu, L. L.
    Guo, M. Z.
    Liu, X. Y.
    Zou, Q.
    GENETICS AND MOLECULAR RESEARCH, 2015, 14 (01): : 123 - 133
  • [47] Towards Deeper Insights into Deep Learning from Imbalanced Data
    Song, Jie
    Shen, Yun
    Jing, Yongcheng
    Song, Mingli
    COMPUTER VISION, PT I, 2017, 771 : 674 - 684
  • [48] Classification of Imbalanced Data Using Deep Learning with Adding Noise
    Fan, Wan-Wei
    Lee, Ching-Hung
    JOURNAL OF SENSORS, 2021, 2021 (2021)
  • [49] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964
  • [50] A neural network learning algorithm for highly imbalanced data classification
    Huang, Zhan Ao
    Sang, Yongsheng
    Sun, Yanan
    Lv, Jiancheng
    INFORMATION SCIENCES, 2022, 612 : 496 - 513