A Large-Scale Ensemble Learning Framework for Demand Forecasting

被引:1
|
作者
Park, Young-Jin [1 ]
Kim, Donghyun [2 ]
Odermatt, Frederic [3 ]
Lee, Juho [4 ]
Kim, Kyung-Min [5 ,6 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Seoul Natl Univ, Seoul, South Korea
[3] ETH Z urich, Zurich, Switzerland
[4] Superpetual Inc, Seoul, South Korea
[5] NAVER CLOVA, Seongnam, South Korea
[6] NAVER Lab, Seongnam, South Korea
关键词
D O I
10.1109/ICDM54844.2022.00048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Demand forecasting is a crucial component of supply chain management for revenue optimization and inventory planning. Traditional time series forecasting methods, however, have resulted in small models with limited expressive power because they have difficulty in scaling their model size up while maintaining high accuracy. In this paper, we propose Forecasting orchestra (Forchestra), a simple but powerful ensemble framework capable of accurately predicting future demand for a diverse range of items. Forchestra consists of two parts: 1) base predictors and 2) a neural conductor. For a given time series, each base predictor outputs its respective forecast based on historical observations. On top of the base predictors, the neural conductor adaptively assigns the importance weight for each predictor by looking at the representation vector provided by a representation module. Finally, Forchestra aggregates the predictions by the weights and constructs a final prediction. In contrast to previous ensemble approaches, the neural conductor and all base predictors of Forchestra are trained in an endto-end manner; this allows each base predictor to modify its reaction to different inputs, while supporting other predictors and constructing a final prediction jointly. We empirically show that the model size is scalable to up to 0.8 billion parameters (approximate to 400-layer LSTM). The proposed method is evaluated on our proprietary E-Commerce (100K) and the public M5 (30K) datasets, and it outperforms existing forecasting models with a significant margin. In addition, we observe that our framework generalizes well to unseen data points when evaluated in a zeroshot fashion on downstream datasets. Last but not least, we present extensive qualitative and quantitative studies to analyze how the proposed model outperforms baseline models and differs from conventional ensemble approaches. The code is available at https://github.com/young-j-park/22-ICDM-Forchestra.
引用
收藏
页码:378 / 387
页数:10
相关论文
共 50 条
  • [1] Demand Forecasting of Online Car-Hailing With Stacking Ensemble Learning Approach and Large-Scale Datasets
    Jin, Yuming
    Ye, Xiaofei
    Ye, Qiming
    Wang, Tao
    Cheng, Jun
    Yan, Xingchen
    IEEE ACCESS, 2020, 8 : 199513 - 199522
  • [2] Ensemble Learning Models for Large-Scale Time Series Forecasting in Supply Chain
    Zhang, Minjuan
    Wu, Chase Q.
    Hou, Aiqin
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2286 - 2294
  • [3] Ensemble Learning for Large-Scale Workload Prediction
    Singh, Nidhi
    Rao, Shrisha
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 149 - 165
  • [4] Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms
    Tyralis, Hristos
    Papacharalampous, Georgia
    Langousis, Andreas
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (08): : 3053 - 3068
  • [5] Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms
    Hristos Tyralis
    Georgia Papacharalampous
    Andreas Langousis
    Neural Computing and Applications, 2021, 33 : 3053 - 3068
  • [6] A large-scale evaluation of machine learning algorithms in mid-term water demand forecasting
    Michalopoulos, Christos
    Dimas, Panagiotis
    Kossieris, Panagiotis
    Pelekanos, Nikos
    Makropoulos, Christos
    WATER PRACTICE AND TECHNOLOGY, 2024, 19 (07) : 2693 - 2711
  • [7] Large-scale Short-term Urban Taxi Demand Forecasting Using Deep Learning
    Liao, Siyu
    Zhou, Liutong
    Di, Xuan
    Yuan, Bo
    Xiong, Jinjun
    2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 428 - 433
  • [8] Aggregation models in ensemble learning: A large-scale comparison
    Campagner, Andrea
    Ciucci, Davide
    Cabitza, Federico
    INFORMATION FUSION, 2023, 90 : 241 - 252
  • [9] Ensemble learning for large-scale crowd flow prediction
    Karbovskii, Vladislav
    Lees, Michael
    Presbitero, Alva
    Kurilkin, Alexey
    Voloshin, Daniil
    Derevitskii, Ivan
    Karsakov, Andrey
    Sloot, Peter M. A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [10] Machine learning for large-scale crop yield forecasting
    Paudel, Dilli
    Boogaard, Hendrik
    de Wit, Allard
    Janssen, Sander
    Osinga, Sjoukje
    Pylianidis, Christos
    Athanasiadis, Ioannis N.
    AGRICULTURAL SYSTEMS, 2021, 187