A distributed real-time recommender system for big data streams

被引:3
|
作者
Hazem, Heidy [1 ,5 ]
Awad, Ahmed [2 ,3 ,4 ]
Yousef, Ahmed Hassan [1 ,5 ]
机构
[1] Nile Univ, Giza, Egypt
[2] Tartu Univ, Tartu, Estonia
[3] Cairo Univ, Giza, Egypt
[4] Narva Rd 18 Tartu City, Tartu Cty, EE-51009 Tartu, Estonia
[5] Juhayna Sq,26th July Corridor, Giza, Egypt
关键词
Streaming; Big data; Online Recommender Systems; MATRIX FACTORIZATION;
D O I
10.1016/j.asej.2022.102026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recommender Systems (RS) play a crucial role in our lives. As users become continuously connected to the internet, they are less tolerant of obsolete recommendations made by an RS. Online RS has to address three requirements: continuous training and recommendation, handling concept drifts, and the ability to scale. Streaming RS proposed in the literature address the first two requirements only. That is because they run the training process on a single machine. To tackle the third challenge, we propose a Splitting and Replication mechanism for distributed streaming RS. Our mechanism is inspired by the shared-nothing architecture that underpins contemporary big data processing systems. We have applied our mechanism to two well-known approaches for online RS, namely, matrix factorization and item-based collaborative filtering. We conducted experiments comparing the performance with the baseline (single machine). Evaluating different data sets, experiments show online recall improvement by 40% with more than 50% less memory consumption. (c) 2022 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/
引用
收藏
页数:16
相关论文
共 50 条
  • [21] HPC2-ARS: an Architecture for Real-time Analytic of Big Data Streams
    Cheng, Yingchao
    Cai, Ruichu
    Wen, Wen
    Hao, Zhifeng
    2018 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2018), 2018, : 319 - 322
  • [22] Real-Time Tweet Analytics Using Hybrid Hashtags on Twitter Big Data Streams
    Gupta, Vibhuti
    Hewett, Rattikorn
    INFORMATION, 2020, 11 (07)
  • [23] Real Time Recommender System for Music Data
    Athani, Manjula
    Pathak, Neelam
    Khan, Asif Ullah
    Gour, Bhupesh
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (08): : 88 - 91
  • [24] An ML Based Anomaly Detection System in real-time data streams
    Diaz Rivera, Javier Jose
    Khan, Talha Ahmed
    Akbar, Waleed
    Afaq, Muhammad
    Song, Wang-Cheol
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 1329 - 1334
  • [25] Lunatory: A Real-Time Distributed Trajectory Clustering Framework for Web Big Data
    Wu, Yang
    Pan, Zhicheng
    Chao, Pingfu
    Fang, Junhua
    Chen, Wei
    Zhao, Lei
    WEB ENGINEERING (ICWE 2022), 2022, 13362 : 219 - 234
  • [26] Adaptable parsing of real-time data streams
    Campanile, Ferdinando
    Cilardo, Alessandro
    Coppolino, Luigi
    Romano, Luigi
    15TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2007, : 412 - +
  • [27] A Real-Time Annotation of Motion Data Streams
    Elias, Petr
    Sedmidubsky, Jan
    Zezula, Pavel
    2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 154 - 161
  • [28] Real-Time Skyline Computation on Data Streams
    Rudenko, Lena
    Endres, Markus
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2018, 2018, 909 : 20 - 28
  • [29] Real-Time Data Mining for Event Streams
    Roudjane, Massiva
    Rebaine, Djamal
    Khoury, Raphael
    Halle, Sylvain
    2018 IEEE 22ND INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE (EDOC 2018), 2018, : 123 - 134
  • [30] A REAL-TIME MONITOR FOR A DISTRIBUTED REAL-TIME OPERATING SYSTEM
    TOKUDA, H
    KOTERA, M
    MERCER, CW
    SIGPLAN NOTICES, 1989, 24 (01): : 68 - 77