Federated Latent Dirichlet Allocation for User Preference Mining

被引:0
|
作者
Wu, Xing [1 ]
Fan, Yushun [1 ]
Zhang, Jia [2 ]
Gao, Zhenfeng [3 ]
机构
[1] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRist, Dept Automat, Beijing, Peoples R China
[2] Southern Methodist Univ, Dept Comp Sci, Dallas, TX USA
[3] Sangfor Technol Inc, Shenzhen, Peoples R China
来源
JOURNAL OF WEB ENGINEERING | 2023年 / 22卷 / 04期
关键词
Web service composition; user preference mining; federated learning; LDA; homomorphic encryption; blockchain; EFFICIENT; BLOCKCHAIN;
D O I
10.13052/jwe1540-9589.2244
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the field of Web services computing, a recent demand trend is to mine user preferences based on user requirements when creating Web service compositions, in order to meet comprehensive and ever evolving user needs. Machine learning methods such as the latent Dirichlet allocation (LDA) have been applied for user preference mining. However, training a high-quality LDA model typically requires large amounts of data. With the prevalence of government regulations and laws and the enhancement of people's awareness of privacy protection, the traditional way of collecting user data on a central server is no longer applicable. Therefore, it is necessary to design a privacy -preserving method to train an LDA model without massive collecting or leaking data. In this paper, we present novel federated LDA techniques to learn user preferences in the Web service ecosystem. On the basis of a user-level distributed LDA algorithm, we establish two federated LDA models in charge of two-layer training scenarios: a centralized synchronous federated LDA (CSFed-LDA) for synchronous scenarios and a decentralized asynchronous federated LDA (DAFed-LDA) for asynchronous ones. In the former CSFed-LDA model, an importance-based partially homomorphic encryption (IPHE) technique is developed to protect privacy in an efficient manner. In the latter DAFed-LDA model, blockchain technology is incor- porated and a multi-channel-based authority control scheme (MCACS) is designed to enhance data security. Extensive experiments over a real-world dataset ProgrammableWeb.com have demonstrated the model performance, security assurance and training speed of our approach.
引用
收藏
页码:639 / 678
页数:40
相关论文
共 50 条
  • [31] Topic Selection in Latent Dirichlet Allocation
    Wang, Biao
    Liu, Zelong
    Li, Maozhen
    Liu, Yang
    Qi, Man
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 756 - 760
  • [32] Crowd labeling latent Dirichlet allocation
    Pion-Tonachini, Luca
    Makeig, Scott
    Kreutz-Delgado, Ken
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (03) : 749 - 765
  • [33] The Auto Annotation Latent Dirichlet Allocation
    Xiang, Yingzhuo
    Yang, Dongmei
    Yan, Jikun
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 1908 - 1911
  • [34] Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation
    Syed, Shaheen
    Spruit, Marco
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2018, 12 (03) : 399 - 423
  • [35] Latent Dirichlet allocation for linking user-generated content and e-commerce data
    Zoghbi, Susana
    Vulic, Ivan
    Moens, Marie-Francine
    INFORMATION SCIENCES, 2016, 367 : 573 - 599
  • [36] Text mining of Reddit posts: Using latent Dirichlet allocation to identify common parenting issues
    Westrupp, Elizabeth M.
    Greenwood, Christopher J.
    Fuller-Tyszkiewicz, Matthew
    Berkowitz, Tomer S.
    Hagg, Lauryn
    Youssef, George
    PLOS ONE, 2022, 17 (02):
  • [37] Mining numerical measure of consumers' product evaluation expressed in words based on latent Dirichlet allocation
    Wang, Ziang
    Yang, Feng
    JOURNAL OF MODELLING IN MANAGEMENT, 2023, 18 (01) : 147 - 170
  • [38] Feature-Free Explainable Data Mining in SAR Images Using Latent Dirichlet Allocation
    Karmakar, Chandrabali
    Dumitru, Corneliu Octavian
    Schwarz, Gottfried
    Datcu, Mihai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 676 - 689
  • [39] Decision mining with user preference
    Yao, H
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 3060 : 576 - 577
  • [40] Joint Latent Dirichlet Allocation for Social Tags
    Yao, Jiangchao
    Wang, Yanfeng
    Zhang, Ya
    Sun, Jun
    Zhou, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (01) : 224 - 237