Scalable Data Analytics Market Basket Model for Transactional Data Streams

被引:0
|
作者
Izang, Aaron A. [1 ]
Goga, Nicolae [1 ]
Kuyoro, Shade O. [1 ]
Alao, Olujimi D. [1 ]
Omotunde, Ayokunle A. [1 ]
Adio, Adesina K. [2 ]
机构
[1] Babcock Univ, Sch Comp & Engn Sci, Dept Comp Sci, Ilishan Remo, Ogun State, Nigeria
[2] Babcock Univ, Sch Sci & Technol, Dept Basic Sci, Ilishan Remo, Ogun State, Nigeria
关键词
Association rule mining; big data analytics; concept drift; market basket analysis; transactional data streams;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Transactional data streams (TDS) are incremental in nature thus, the process of mining is complicated. Such complications arise from challenges such as infinite length, feature evolution, concept evolution and concept drift. Tracking concept drift challenge is very difficult, thus very important for Market Basket Analysis (MBA) applications. Hence, the need for a strategy to accurately determine the suitability of item pairs within the available billions of pairs to solve concept drift chalenge of TDS in MBA. In this work, a Scalable Data Analytics Market Basket Model (SDAMBM) that handles concept drift issues in MBA was developed. Transactional data of 1,112,000 were extracted from a grocery store using Extraction, Transformation and Loading approach and 556,000 instances of the data were simulated from a cloud database. Calibev function was used to caliberate the data nodes. Lugui 7.2.9 and Comprehensive R Archive Network were used for table pivoting between the simulated data and the data collected. The SDAMBM was developed using a combination of components from elixir big data architecture, the research conceptual model and consumer behavior theories. Toad Modeler was then used to assemble the model. The SDAMBM was implemented using Monarch and Tableau to generate insights and data visualization of the transactions. Intelligent interpreters for auto decision grid, selectivity mechanism and customer insights were used as metrics to evaluate the model. The result showed that 79% of the customers from the customers' consumption pattern of the SDAMBM preferred buying snacks and drink as shown in the visualization report through the SDAMBM visualization dashboard. Finally, this study provided a data analytics approach for managing concept drift challenge in customers' buying pattern. Furthermore, a distinctive model for managing concept drift was also achieved. It is therefore recommended that the SDAMBM should be adopted for the enhancement of customers buying and consumption pattern by business ventures, organizations and retailers.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 50 条
  • [1] Scalable data analytics using crowdsourced repositories and streams
    Veloso, Bruno
    Leal, Fatima
    Gonzalez-Velez, Horacio
    Malheiro, Benedita
    Carlos Burguillo, Juan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 122 : 1 - 10
  • [2] AGAMI: Scalable Visual Analytics over Multidimensional Data Streams
    Lu, Mingxin
    Wong, Edmund
    Barajas, Daniel
    Li, Xiaochen
    Ogundipe, Mosopefoluwa
    Wilson, Nate
    Garg, Pragya
    Joshi, Alark
    Malensek, Matthew
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT 2020), 2020, : 57 - 66
  • [3] Clustering transactional data streams
    Li, Yanrong
    Gopalan, Raj P.
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 1069 - +
  • [4] Retail business analytics: Customer visit segmentation using market basket data
    Griva, Anastasia
    Bardaki, Cleopatra
    Pramatari, Katerina
    Papakiriakopoulos, Dimitris
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 100 : 1 - 16
  • [5] Scalable Analytics on Fast Data
    Kipf, Andreas
    Pandey, Varun
    Boettcher, Jan
    Braun, Lucas
    Neumann, Thomas
    Kemper, Alfons
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2019, 44 (01):
  • [6] Hancock: A language for analyzing transactional data streams
    Cortes, C
    Fisher, K
    Pregibon, D
    Rogers, A
    Smith, F
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2004, 26 (02): : 301 - 338
  • [7] Efficient reservoir sampling for transactional data streams
    Dash, Manoranjan
    Ng, Willie
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 662 - +
  • [8] Volatility Drift Prediction for Transactional Data Streams
    Koh, Yun Sing
    Huang, David Tse Jung
    Pearce, Chris
    Dobbie, Gillian
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1091 - 1096
  • [9] Basket2vec: Learning Retail Basket Representations From Transactional Data
    Piguave, Bryan V.
    Abad, Andres G.
    IEEE Access, 2024, 12 : 162392 - 162398
  • [10] Scalable Data-Intensive Analytics
    Hsu, Meichun
    Chen, Qiming
    BUSINESS INTELLIGENCE FOR THE REAL-TIME ENTERPRISE, 2009, 27 : 97 - +