Real-time incremental recommendation for streaming data based on apache flink

被引:2
|
作者
Tang, Zhuo [1 ]
Liu, Zeyu [1 ]
Li, Kenli [1 ]
Li, Keqin [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Collaboratie filtering; online learning; incremental learning; recommendation system; low-rank matrix factorization; Apache Flink;
D O I
10.3233/IDA-184330
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collaborative filtering (CF), one of the most famous methods for building recommendation systems, recommends relevant items to users or predicting ratings of users' unknown items. Matrix factorization (MF) models are well-known model to deal with predicting the rating problem. However, the recommendation system based on matrix factorization is hard to keep up with the rapidly changing real-world data. When ratings on new users or new items come, the static model can not fit well on new data. As a consequence, if the current thing does not apply, the prediction accuracy will lose. In addition, it is a significant computation cost to rebuild the model on the whole data. To capture these changes, in this paper, we construct an online-and-offline Collaborative Filtering with a multi-method model to improve the traditional CF method, called Online SGD with Offline Knowledge (OSGDO for short). Besides, we propose a real-time incremental recommendation framework on Apache Flink, which is a scalable stream and batch data processing platform. Meanwhile, we implement our proposed method on our proposed framework. Our method proves to be good at online training when new observations arrive. And the results of experiments show that the dynamic training process we proposed is more efficient than rebuild the model on all the data. At the same time, our algorithm performs well in practice and can achieve impressive accuracy quickly when it is tested with the well-known data sets of MoviesLens and Netflix.
引用
收藏
页码:1421 / 1437
页数:17
相关论文
共 50 条
  • [1] Video2Flink: real-time video partitioning in Apache Flink and the cloud
    Dimitrios Kastrinakis
    Euripides G. M. Petrakis
    Machine Vision and Applications, 2023, 34
  • [2] Video2Flink: real-time video partitioning in Apache Flink and the cloud
    Kastrinakis, Dimitrios
    Petrakis, Euripides G. M.
    MACHINE VISION AND APPLICATIONS, 2023, 34 (03)
  • [3] An efficient architecture for processing real-time traffic data streams using apache flink
    Deepthi, B. Gnana
    Rani, K. Sandhya
    Krishna, P. Venkata
    Saritha, V.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 37369 - 37385
  • [4] An efficient architecture for processing real-time traffic data streams using apache flink
    B. Gnana Deepthi
    K. Sandhya Rani
    P. Venkata Krishna
    V. Saritha
    Multimedia Tools and Applications, 2024, 83 : 37369 - 37385
  • [5] Real-Time Deep Learning-Based Anomaly Detection Approach for Multivariate Data Streams with Apache Flink
    Ha, Tae Wook
    Kang, Jung Mo
    Kim, Myoung Ho
    ICWE 2021 WORKSHOPS, ICWE 2021 INTERNATIONAL WORKSHOPS, 2022, 1508 : 39 - 49
  • [6] Apache Storm Based on Topology for Real-Time Processing of Streaming Data from Social Networks
    Batyuk, Anatoliy
    Voityshyn, Volodymyr
    PROCEEDINGS OF THE 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2016, : 345 - 349
  • [7] Real-time user clickstream behavior analysis based on apache storm streaming
    Gautam Pal
    Katie Atkinson
    Gangmin Li
    Electronic Commerce Research, 2023, 23 : 1829 - 1859
  • [8] Real-time user clickstream behavior analysis based on apache storm streaming
    Pal, Gautam
    Atkinson, Katie
    Li, Gangmin
    ELECTRONIC COMMERCE RESEARCH, 2023, 23 (03) : 1829 - 1859
  • [9] Real-time Data Streaming using Apache Spark on Fully Configured Hadoop Cluster
    Prasad, Kashi Sai
    Pasupathy, S.
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2018, 13 (05): : 164 - 176
  • [10] An incremental payment method for Internet based streaming real-time media
    Fuchsberger, A
    INFORMATION SECURITY, 1998, 1396 : 321 - 327