Comparative analysis of model performance for predicting the customer of cafeteria using unstructured data

被引:0
|
作者
Kim, Seungsik [1 ]
Gu, Nami [2 ]
Moon, Jeongin [3 ]
Kim, Keunwook [4 ]
Hwang, Yeongeun [5 ]
Lee, Kyeongjun [6 ]
机构
[1] Kyungpook Natl Univ, Dept Stat, Daegu, South Korea
[2] Pusan Natl Univ, Dept Stat, Busan, South Korea
[3] Yeungnam Univ, Dept Stat, Gyongsan, South Korea
[4] Daegu Digital Innovat Agcy, Big Data Utilizat Ctr, Daegu, South Korea
[5] Korea Ind Complex Corp, Ind Complex Promot Dept, Daegu, South Korea
[6] Kumoh Natl Inst Technol, Dept Math & Big Data Sci, 61 Daehak Ro, Gumi 39177, Gyeongbuk, South Korea
关键词
cafeteria; ensemble model; ESG; food waste; machine learning; menu features; performance improvement; prediction; word embedding;
D O I
10.29220/CSAM.2023.30.5.485
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This study aimed to predict the number of meals served in a group cafeteria using machine learning methodology. Features of the menu were created through the Word2Vec methodology and clustering, and a stacking ensemble model was constructed using Random Forest, Gradient Boosting, and CatBoost as sub-models. Results showed that CatBoost had the best performance with the ensemble model showing an 8% improvement in performance. The study also found that the date variable had the greatest influence on the number of diners in a cafeteria, followed by menu characteristics and other variables. The implications of the study include the potential for machine learning methodology to improve predictive performance and reduce food waste, as well as the removal of subjective elements in menu classification. Limitations of the research include limited data cases and a weak model structure when new menus or foreign words are not included in the learning data. Future studies should aim to address these limitations.
引用
收藏
页码:485 / 499
页数:15
相关论文
共 50 条
  • [1] Predicting Customer Behavior with Combination of Structured and Unstructured Data
    Afolabi, Ibukun T.
    Worlu, Rowland E.
    Adebayo, O. P.
    Jonathan, Oluranti
    3RD INTERNATIONAL CONFERENCE ON SCIENCE AND SUSTAINABLE DEVELOPMENT (ICSSD 2019): SCIENCE, TECHNOLOGY AND RESEARCH: KEYS TO SUSTAINABLE DEVELOPMENT, 2019, 1299
  • [2] Analysis of Hadoop Performance And Unstructured Data Using Zeppelin
    Harleen
    Garg, Naveen
    2016 INTERNATIONAL CONFERENCE ON RESEARCH ADVANCES IN INTEGRATED NAVIGATION SYSTEMS (RAINS), 2016,
  • [3] Analysis Implementation of the Ensemble Algorithm in Predicting Customer Churn in Telco Data: A Comparative Study
    Sari R.P.
    Febriyanto F.
    Adi A.C.
    Informatica (Slovenia), 2023, 47 (07): : 63 - 70
  • [4] Modeling customer satisfaction from unstructured data using a Bayesian approach
    Farhadloo, Mohsen
    Patterson, Raymond A.
    Rolland, Erik
    DECISION SUPPORT SYSTEMS, 2016, 90 : 1 - 11
  • [5] Sentiment Analysis of Unstructured Data Using Spark for Predicting Stock Market Price Movement
    Darji, Miss Dhara N.
    Parikh, Satyen M.
    Patel, Hiral R.
    INVENTIVE COMPUTATION AND INFORMATION TECHNOLOGIES, ICICIT 2021, 2022, 336 : 521 - 530
  • [6] Predicting Publication of Clinical Trials Using Structured and Unstructured Data: Model Development and Validation Study
    Wang, Siyang
    Suster, Simon
    Baldwin, Timothy
    Verspoor, Karin
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (12)
  • [7] The comparative analysis and study of Mobile-based customer data churn prediction model
    Lei Jin-hui
    He Jian-jun
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 4, PROCEEDINGS, 2009, : 524 - 528
  • [8] Comparative Analysis of Customer Sentiments on Competing Brands using Hybrid Model Approach
    Athindran, Srivats N.
    Manikandaraj, S.
    Kamaleshwar, R.
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018, : 348 - 353
  • [9] Predicting Telecommunication Customer Churn Using Data Mining Techniques
    AlOmari, Diana
    Hassan, Mohammad Mehedi
    INTERNET AND DISTRIBUTED COMPUTING SYSTEMS, IDCS 2016, 2016, 9864 : 167 - 178
  • [10] Data Mining for Predicting Customer Satisfaction Using Clustering Techniques
    Purwandari, Kartika
    Sigalingging, Join W. C.
    Fhadli, Muhammad
    Arizky, Shinta Nur
    Pardamean, Bens
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND TECHNOLOGY (ICIMTECH), 2020, : 223 - 227