A Large-Scale Ensemble Learning Framework for Demand Forecasting

被引：1

作者：

Park, Young-Jin ^{[1
]}

Kim, Donghyun ^{[2
]}

Odermatt, Frederic ^{[3
]}

Lee, Juho ^{[4
]}

Kim, Kyung-Min ^{[5
,6
]}

机构：

[1] MIT, Cambridge, MA 02139 USA

[2] Seoul Natl Univ, Seoul, South Korea

[3] ETH Z urich, Zurich, Switzerland

[4] Superpetual Inc, Seoul, South Korea

[5] NAVER CLOVA, Seongnam, South Korea

[6] NAVER Lab, Seongnam, South Korea

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2022年

关键词：

D O I：

10.1109/ICDM54844.2022.00048

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Demand forecasting is a crucial component of supply chain management for revenue optimization and inventory planning. Traditional time series forecasting methods, however, have resulted in small models with limited expressive power because they have difficulty in scaling their model size up while maintaining high accuracy. In this paper, we propose Forecasting orchestra (Forchestra), a simple but powerful ensemble framework capable of accurately predicting future demand for a diverse range of items. Forchestra consists of two parts: 1) base predictors and 2) a neural conductor. For a given time series, each base predictor outputs its respective forecast based on historical observations. On top of the base predictors, the neural conductor adaptively assigns the importance weight for each predictor by looking at the representation vector provided by a representation module. Finally, Forchestra aggregates the predictions by the weights and constructs a final prediction. In contrast to previous ensemble approaches, the neural conductor and all base predictors of Forchestra are trained in an endto-end manner; this allows each base predictor to modify its reaction to different inputs, while supporting other predictors and constructing a final prediction jointly. We empirically show that the model size is scalable to up to 0.8 billion parameters (approximate to 400-layer LSTM). The proposed method is evaluated on our proprietary E-Commerce (100K) and the public M5 (30K) datasets, and it outperforms existing forecasting models with a significant margin. In addition, we observe that our framework generalizes well to unseen data points when evaluated in a zeroshot fashion on downstream datasets. Last but not least, we present extensive qualitative and quantitative studies to analyze how the proposed model outperforms baseline models and differs from conventional ensemble approaches. The code is available at https://github.com/young-j-park/22-ICDM-Forchestra.

引用

页码：378 / 387

页数：10

共 50 条

[1] Demand Forecasting of Online Car-Hailing With Stacking Ensemble Learning Approach and Large-Scale Datasets
Jin, Yuming
Ye, Xiaofei
Ye, Qiming
Wang, Tao
Cheng, Jun
Yan, Xingchen
IEEE ACCESS, 2020, 8 : 199513 - 199522
[2] Ensemble Learning Models for Large-Scale Time Series Forecasting in Supply Chain
Zhang, Minjuan
Wu, Chase Q.
Hou, Aiqin
2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2286 - 2294
[3] Ensemble Learning for Large-Scale Workload Prediction
Singh, Nidhi
Rao, Shrisha
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 149 - 165
[4] Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms
Tyralis, Hristos
Papacharalampous, Georgia
Langousis, Andreas
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (08): : 3053 - 3068
[5] Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms
Hristos Tyralis
Georgia Papacharalampous
Andreas Langousis
Neural Computing and Applications, 2021, 33 : 3053 - 3068
[6] A large-scale evaluation of machine learning algorithms in mid-term water demand forecasting
Michalopoulos, Christos
Dimas, Panagiotis
Kossieris, Panagiotis
Pelekanos, Nikos
Makropoulos, Christos
WATER PRACTICE AND TECHNOLOGY, 2024, 19 (07) : 2693 - 2711
[7] Large-scale Short-term Urban Taxi Demand Forecasting Using Deep Learning
Liao, Siyu
Zhou, Liutong
Di, Xuan
Yuan, Bo
Xiong, Jinjun
2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 428 - 433
[8] Aggregation models in ensemble learning: A large-scale comparison
Campagner, Andrea
Ciucci, Davide
Cabitza, Federico
INFORMATION FUSION, 2023, 90 : 241 - 252
[9] Ensemble learning for large-scale crowd flow prediction
Karbovskii, Vladislav
Lees, Michael
Presbitero, Alva
Kurilkin, Alexey
Voloshin, Daniil
Derevitskii, Ivan
Karsakov, Andrey
Sloot, Peter M. A.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
[10] Machine learning for large-scale crop yield forecasting
Paudel, Dilli
Boogaard, Hendrik
de Wit, Allard
Janssen, Sander
Osinga, Sjoukje
Pylianidis, Christos
Athanasiadis, Ioannis N.
AGRICULTURAL SYSTEMS, 2021, 187

← 1 2 3 4 5 →