Scalable Data Analytics Market Basket Model for Transactional Data Streams

被引:0
|
作者
Izang, Aaron A. [1 ]
Goga, Nicolae [1 ]
Kuyoro, Shade O. [1 ]
Alao, Olujimi D. [1 ]
Omotunde, Ayokunle A. [1 ]
Adio, Adesina K. [2 ]
机构
[1] Babcock Univ, Sch Comp & Engn Sci, Dept Comp Sci, Ilishan Remo, Ogun State, Nigeria
[2] Babcock Univ, Sch Sci & Technol, Dept Basic Sci, Ilishan Remo, Ogun State, Nigeria
关键词
Association rule mining; big data analytics; concept drift; market basket analysis; transactional data streams;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Transactional data streams (TDS) are incremental in nature thus, the process of mining is complicated. Such complications arise from challenges such as infinite length, feature evolution, concept evolution and concept drift. Tracking concept drift challenge is very difficult, thus very important for Market Basket Analysis (MBA) applications. Hence, the need for a strategy to accurately determine the suitability of item pairs within the available billions of pairs to solve concept drift chalenge of TDS in MBA. In this work, a Scalable Data Analytics Market Basket Model (SDAMBM) that handles concept drift issues in MBA was developed. Transactional data of 1,112,000 were extracted from a grocery store using Extraction, Transformation and Loading approach and 556,000 instances of the data were simulated from a cloud database. Calibev function was used to caliberate the data nodes. Lugui 7.2.9 and Comprehensive R Archive Network were used for table pivoting between the simulated data and the data collected. The SDAMBM was developed using a combination of components from elixir big data architecture, the research conceptual model and consumer behavior theories. Toad Modeler was then used to assemble the model. The SDAMBM was implemented using Monarch and Tableau to generate insights and data visualization of the transactions. Intelligent interpreters for auto decision grid, selectivity mechanism and customer insights were used as metrics to evaluate the model. The result showed that 79% of the customers from the customers' consumption pattern of the SDAMBM preferred buying snacks and drink as shown in the visualization report through the SDAMBM visualization dashboard. Finally, this study provided a data analytics approach for managing concept drift challenge in customers' buying pattern. Furthermore, a distinctive model for managing concept drift was also achieved. It is therefore recommended that the SDAMBM should be adopted for the enhancement of customers buying and consumption pattern by business ventures, organizations and retailers.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 50 条
  • [21] SCALE: a scalable framework for efficiently clustering transactional data
    Yan, Hua
    Chen, Keke
    Liu, Ling
    Yi, Zhang
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (01) : 1 - 27
  • [22] A New Measure of Complementarity in Market Basket Data
    Puka, Radoslaw
    Jedrusik, Stanislaw
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2021, 16 (04): : 670 - 681
  • [23] Improving the probabilistic modeling of market basket data
    Buchta, Christian
    ADVANCES IN DATA ANALYSIS, 2007, : 417 - 424
  • [24] Incremental and Parallel Analytics on Astrophysical Data Streams
    Mishin, Dmitryz
    Budavari, Tamas
    Szalay, Alexander
    Ahmad, Yanif
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1078 - 1086
  • [25] Privacy preserving market basket data analysis
    Guo, Ling
    Guo, Songtao
    Wu, Xintao
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 103 - +
  • [26] Finding localized associations in market basket data
    Aggarwal, CC
    Procopiuc, C
    Yu, PS
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 51 - 62
  • [27] Predictive Analytics for Complex IoT Data Streams
    Akbar, Adnan
    Khan, Abdullah
    Carrez, Francois
    Moessner, Klaus
    IEEE INTERNET OF THINGS JOURNAL, 2017, 4 (05): : 1571 - 1582
  • [28] Efficient similarity search for market basket data
    Alexandros Nanopoulos
    Yannis Manolopoulos
    The VLDB Journal, 2002, 11 : 138 - 152
  • [29] Efficient similarity search for market basket data
    Nanopoulos, A
    Manolopoulos, Y
    VLDB JOURNAL, 2002, 11 (02): : 138 - 152
  • [30] GEOSPATIAL INTERPOLATION ANALYTICS FOR DATA STREAMS IN EVENTSHOP
    Tang, Mengfan
    Agrawal, Pranav
    Pongpaichet, Siripen
    Jain, Ramesh
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,