Scalable Data Analytics Market Basket Model for Transactional Data Streams

被引:0
|
作者
Izang, Aaron A. [1 ]
Goga, Nicolae [1 ]
Kuyoro, Shade O. [1 ]
Alao, Olujimi D. [1 ]
Omotunde, Ayokunle A. [1 ]
Adio, Adesina K. [2 ]
机构
[1] Babcock Univ, Sch Comp & Engn Sci, Dept Comp Sci, Ilishan Remo, Ogun State, Nigeria
[2] Babcock Univ, Sch Sci & Technol, Dept Basic Sci, Ilishan Remo, Ogun State, Nigeria
关键词
Association rule mining; big data analytics; concept drift; market basket analysis; transactional data streams;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Transactional data streams (TDS) are incremental in nature thus, the process of mining is complicated. Such complications arise from challenges such as infinite length, feature evolution, concept evolution and concept drift. Tracking concept drift challenge is very difficult, thus very important for Market Basket Analysis (MBA) applications. Hence, the need for a strategy to accurately determine the suitability of item pairs within the available billions of pairs to solve concept drift chalenge of TDS in MBA. In this work, a Scalable Data Analytics Market Basket Model (SDAMBM) that handles concept drift issues in MBA was developed. Transactional data of 1,112,000 were extracted from a grocery store using Extraction, Transformation and Loading approach and 556,000 instances of the data were simulated from a cloud database. Calibev function was used to caliberate the data nodes. Lugui 7.2.9 and Comprehensive R Archive Network were used for table pivoting between the simulated data and the data collected. The SDAMBM was developed using a combination of components from elixir big data architecture, the research conceptual model and consumer behavior theories. Toad Modeler was then used to assemble the model. The SDAMBM was implemented using Monarch and Tableau to generate insights and data visualization of the transactions. Intelligent interpreters for auto decision grid, selectivity mechanism and customer insights were used as metrics to evaluate the model. The result showed that 79% of the customers from the customers' consumption pattern of the SDAMBM preferred buying snacks and drink as shown in the visualization report through the SDAMBM visualization dashboard. Finally, this study provided a data analytics approach for managing concept drift challenge in customers' buying pattern. Furthermore, a distinctive model for managing concept drift was also achieved. It is therefore recommended that the SDAMBM should be adopted for the enhancement of customers buying and consumption pattern by business ventures, organizations and retailers.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 50 条
  • [31] Videolytics: System for Data Analytics of Video Streams
    Skopal, Tomas
    Duriskova, Dominika
    Pechman, Petr
    Dobransky, Marek
    Khachaturian, Vladislav
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4794 - 4798
  • [32] Scalable Progressive Analytics on Big Data in the Cloud
    Chandramouli, Badrish
    Goldstein, Jonathan
    Quamar, Abdul
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1726 - 1737
  • [33] Scalable genomic data exchange and analytics with sBeacon
    Anuradha Wickramarachchi
    Brendan Hosking
    Yatish Jain
    John Grimes
    Mitchell J. O’Brien
    Tracey Wright
    Mark A. Burgess
    Victor San Kho Lin
    Florian Reisinger
    Oliver Hofmann
    Michael Lawley
    Laurence O. W. Wilson
    Natalie A. Twine
    Denis C. Bauer
    Nature Biotechnology, 2023, 41 : 1510 - 1512
  • [34] Scalable genomic data exchange and analytics with sBeacon
    Wickramarachchi, Anuradha
    Hosking, Brendan
    Jain, Yatish
    Grimes, John
    O'Brien, Mitchell J.
    Wright, Tracey
    Burgess, Mark A.
    Lin, Victor San Kho
    Reisinger, Florian
    Hofmann, Oliver
    Lawley, Michael
    Wilson, Laurence O. W.
    Twine, Natalie A.
    Bauer, Denis C.
    NATURE BIOTECHNOLOGY, 2023, 41 (11) : 1510 - 1512
  • [35] A scalable algorithm for the market basket analysis
    Cavique, Luis
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2007, 14 (06) : 400 - 407
  • [36] In-Memory Computing for Scalable Data Analytics
    Li, Jun
    2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2015), 2015, : 93 - 94
  • [37] Scalable and Efficient Data Analytics and Mining with Lemonade
    dos Santos, Walter
    Avelar, Gustavo P.
    Ribeiro, Manoel Horta
    Guedes, Dorgival
    Meira Jr, Wagner
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 2070 - 2073
  • [38] Unsupervised Classification of Data Streams based on Typicality and Eccentricity Data Analytics
    Jales Costa, Bruno Sielly
    Bezerra, Clauber Gomes
    Guedes, Luiz Affonso
    Parvanov Angelov, Plamen
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 58 - 63
  • [39] Scalable Preference Learning from Data Streams
    Dzogang, Fabon
    Lansdall-Welfare, Thomas
    Sudhahar, Saatviga
    Cristianini, Nello
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 885 - 890
  • [40] Scalable keyword search on large data streams
    Lu Qin
    Jeffrey Xu Yu
    Lijun Chang
    The VLDB Journal, 2011, 20 : 35 - 57