A distributed real-time recommender system for big data streams

被引:3
|
作者
Hazem, Heidy [1 ,5 ]
Awad, Ahmed [2 ,3 ,4 ]
Yousef, Ahmed Hassan [1 ,5 ]
机构
[1] Nile Univ, Giza, Egypt
[2] Tartu Univ, Tartu, Estonia
[3] Cairo Univ, Giza, Egypt
[4] Narva Rd 18 Tartu City, Tartu Cty, EE-51009 Tartu, Estonia
[5] Juhayna Sq,26th July Corridor, Giza, Egypt
关键词
Streaming; Big data; Online Recommender Systems; MATRIX FACTORIZATION;
D O I
10.1016/j.asej.2022.102026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recommender Systems (RS) play a crucial role in our lives. As users become continuously connected to the internet, they are less tolerant of obsolete recommendations made by an RS. Online RS has to address three requirements: continuous training and recommendation, handling concept drifts, and the ability to scale. Streaming RS proposed in the literature address the first two requirements only. That is because they run the training process on a single machine. To tackle the third challenge, we propose a Splitting and Replication mechanism for distributed streaming RS. Our mechanism is inspired by the shared-nothing architecture that underpins contemporary big data processing systems. We have applied our mechanism to two well-known approaches for online RS, namely, matrix factorization and item-based collaborative filtering. We conducted experiments comparing the performance with the baseline (single machine). Evaluating different data sets, experiments show online recall improvement by 40% with more than 50% less memory consumption. (c) 2022 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Distributed Real-Time Sentiment Analysis for Big Data Social Streams
    Rahnama, Amir Hossein Akhavan
    2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 789 - 794
  • [2] On the locality of Java 8 streams in real-time big data applications
    Chan, Yu
    Gray, Ian
    Wellings, Andy
    Audsley, Neil
    ACM International Conference Proceeding Series, 2014, 2014-October (October): : 20 - 28
  • [3] Survey on Real-time Anomaly Detection Technology for Big Data Streams
    Luo, Yuanvan
    Du, Xuehui
    Sun, Yi
    PROCEEDINGS OF 2018 12TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2018, : 26 - 30
  • [4] Research on real-time outlier detection over big data streams
    Chen, Liangchen
    Gao, Shu
    Cao, Xiufeng
    International Journal of Computers and Applications, 2020, 42 (01): : 93 - 101
  • [5] Distributed real-time ETL architecture for unstructured big data
    Mehmood, Erum
    Anees, Tayyaba
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (12) : 3419 - 3445
  • [6] Distributed real-time ETL architecture for unstructured big data
    Erum Mehmood
    Tayyaba Anees
    Knowledge and Information Systems, 2022, 64 : 3419 - 3445
  • [7] Distributed data real-time application system
    He, Xiang
    Ren, Kaiyin
    Zhang, Mingming
    Huang, Gaopan
    2015 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY (SCICT 2015), 2015, : 39 - 43
  • [8] Real-Time News Recommender System
    Fortuna, Blaz
    Fortuna, Carolina
    Mladenic, Dunja
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2010, 6323 : 583 - 586
  • [9] Digital Data Streams: CREATING VALUE FROM THE REAL-TIME FLOW OF BIG DATA
    Pigni, Federico
    Piccoli, Gabriele
    Watson, Richard
    CALIFORNIA MANAGEMENT REVIEW, 2016, 58 (03) : 5 - 25
  • [10] Real-Time Machine Learning Competition on Data Streams at the IEEE Big Data 2019
    Boulegane, Dihia
    Radulovic, Nedeljko
    Bifet, Albert
    Fievet, Ghislain
    Sohn, Jimin
    Nam, Yeonwoo
    Yu, Seojeong
    Choi, Dong-Wan
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3493 - 3497