Big data analytics for retail industry using MapReduce-Apriori framework

被引:27
|
作者
Verma, Neha [1 ]
Malhotra, Dheeraj [1 ]
Singh, Jatinder [2 ]
机构
[1] GGSIPU, Vivekananda Inst Profess Studies, Delhi, India
[2] St Baba Bhaag Singh Univ, Jalandhar, Punjab, India
关键词
Big data; retail analytics; MR-Apriori algorithm; map-reduce; market basket analysis; association mining; IRM tool; ALGORITHM; MODEL;
D O I
10.1080/23270012.2020.1728403
中图分类号
F [经济];
学科分类号
02 ;
摘要
Presently, retailing has changed its face from unordered stacked traditional stores to beautifully decorated and appropriately managed merchandise stores or shopping malls with excellent ambiance and comfort. Therefore, these stores try to accommodate all needed items for daily use or rarely required items under the same roof. However, the primary challenge for today's retailer is that the modern customer is quality and brands conscious as well as compare for services provided to them by different outlets at the comfort of home with a single click. Therefore, customers prefer to purchase from E-Commerce websites instead of physically visiting a retail store, which leads to the downfall in the sales of retailers which become a serious threat to them. Therefore, retailers are required to work sincerely towards their customer expectations by providing all their needed goods under the same roof. Therefore, the objective of this paper is to assist retail business owners to recognize the purchasing needs of their customers and hence to entice customers to physical retail stores away from competitor E-Commerce websites. This paper employs a systematic research methodology based on association rule mining deployed over Map-Reduce based Apriori association mining and Hadoop based intelligent cloud architecture to determine useful buying patterns from purchase history of previous customers, in order to assist retail business owners. The finding acknowledges that the traditional mining algorithms have not progressed to support big data analysis as required by current retail businesses owners. The job of finding unknown association rules from big data requires a lot of resources such as memory and processing engines. Moreover, traditional mining systems are inadequate to provide support for partial failure support, extensibility, scalability etc. Therefore, this study aims to implement and develop MapReduce based Apriori (MR-Apriori) algorithm in the form of Intelligent Retail Mining Tool i.e. IRM Tool to recognize all these concerns in an efficient manner. The proposed system adequately satisfy all significant requisites anticipated from modern Big Data processing systems such as scalability, fault tolerance, partial failure support etc. Finally, this study experimentally verifies the effectiveness of the proposed algorithm i.e. MR-Apriori by speed-up, size-up, and scale-up evaluation parameters.
引用
收藏
页码:424 / 442
页数:19
相关论文
共 50 条
  • [31] Dache: A data aware caching for big-data applications using the MapReduce framework
    [J]. Zhao, Y. (yaxiongzhao@google.com), 1600, Tsinghua University (19):
  • [32] Dache: A Data Aware Caching for Big-Data Applications Using the MapReduce Framework
    Zhao, Yaxiong
    Wu, Jie
    Liu, Cong
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2014, 19 (01) : 39 - 50
  • [33] Big data for business management in the retail industry
    Santoro, Gabriele
    Fiano, Fabio
    Bertoldi, Bernardo
    Ciampi, Francesco
    [J]. MANAGEMENT DECISION, 2019, 57 (08) : 1980 - 1992
  • [34] Big data classification with optimization driven MapReduce framework
    Mohammed, Mujeeb Shaik
    Rachapudy, Praveen Sam
    Kasa, Madhavi
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2021, 25 (02) : 173 - 183
  • [35] A Hadoop/MapReduce based platform for supporting health big data analytics
    Kuo, Alex
    Chrimes, Dillon
    Qin, Pinle
    Zamani, Hamid
    [J]. Studies in Health Technology and Informatics, 2019, 257 : 229 - 235
  • [36] AMPO: Algorithm for MapReduce Performance Optimization for Enhancing Big Data Analytics
    Yambem, Nandita
    Nandakumar, A. N.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 717 - 723
  • [37] An Enhanced Memetic Algorithm for Feature Selection in Big Data Analytics with MapReduce
    Ramakrishnan, Umanesan
    Nachimuthu, Nandhagopal
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (03): : 1547 - 1559
  • [38] Big Data Analytics and Business Intelligence in Industry
    Huang, Shih-Chia
    McIntosh, Suzanne
    Sobolevsky, Stanislav
    Hung, Patrick C. K.
    [J]. INFORMATION SYSTEMS FRONTIERS, 2017, 19 (06) : 1229 - 1232
  • [39] A big data analytics framework for scientific data management
    Fiore, Sandro
    Palazzo, Cosimo
    D'Anca, Alessandro
    Foster, Ian
    Williams, Dean N.
    Aloisio, Giovanni
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [40] Big data analytics in Industry 4.0 ecosystems
    Aujla, Gagangeet Singh
    Prodan, Radu
    Rawat, Danda B.
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (03): : 639 - 641