Building Forecasting Solutions Using Open-Source and Azure Machine Learning

被引:1
|
作者
Hu, Chenhui [1 ]
Paunic, Vanja [2 ]
机构
[1] Microsoft, Boston, MA 02115 USA
[2] Microsoft, Redmond, WA USA
关键词
Time series forecasting; Demand forecasting; Azure Machine Learning;
D O I
10.1145/3394486.3406699
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series forecasting is one of the most important topics in data science. Almost every business needs to predict the future in order to make better decisions and allocate resources more effectively. Examples of time series forecasting use cases are financial forecasting, demand forecasting in logistics for operational planning of assets, demand forecasting for Azure resources, and energy demand forecasting for campus buildings and data centers. The goal of this tutorial is to demonstrate stateof-the-art forecasting approaches to problems in retail and introduce a new repository focusing on best-practices in forecasting domain, along with a library of forecasting utilities [1]. The tutorial will start with a quick overview of time series forecasting and traditional time series models to provide the audience with a clear background on the kind of problems that we aim to solve. We will also briefly explore the dataset to be used in all exercises. Next, we will run through several exercises to solve a forecasting problem in retail. We will start with a traditional statistical approach, e.g. ARIMA, using an auto-arima function in python [2]. Next, we will cover machine-learning based approaches to forecasting and cover various ways to featurize the time series dataset, then train a LightGBM model [6]. Finally, we will describe a deep-neural-net based approach, namely Dilated CNN, and train a Dilated CNN model on our data [7-8]. Using LightGBM and Dilated CNN - two efficient and state-of-the-art models, we can train the models quickly and achieve very high forecasting accuracies. In the last part of the tutorial, we will cover an example of hyper-parameter tuning in forecasting, and use HyperDrive in Azure Machine Learning service to achieve the task [3-5]. As a part of this exercise, we will also demonstrate how to deploy the trained model to Azure Container Instance (ACI) and test the deployed service. The repository also contains best-practice implementations in R language. Time permitting, we will cover common approaches to solving forecasting problems in R, ranging from simple regression models to more complex ones, such as Prophet package in R.
引用
收藏
页码:3497 / 3498
页数:2
相关论文
共 50 条
  • [1] Meteorological Variables Forecasting System Using Machine Learning and Open-Source Software
    Segovia, Jenny Aracely
    Toaquiza, Jonathan Fernando
    Llanos, Jacqueline Rosario
    Rivas, David Raimundo
    [J]. ELECTRONICS, 2023, 12 (04)
  • [2] Open-Source Machine Learning in Computational Chemistry
    Hagg, Alexander
    Kirschner, Karl N.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (15) : 4505 - 4532
  • [3] pyStudio: An Open-Source Machine Learning Platform
    Gomicia-Murcia, Enrique
    Bordel Sanchez, Borja
    Souissi, Riad
    AL-Qurishi, Muhammad
    [J]. PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 436 - 440
  • [4] An Open-source Azure Solution for Scalable Genomics Workflows
    Yang-Turner, Fan
    Gripper, Lawrence
    Swann, Jeremy
    Do, Trien
    Foster, Dona
    Volk, Denis
    Ramanan, Anita
    Robinson, Marcus
    Peto, Tim
    Crook, Derrick
    [J]. 2018 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2018), 2018, : 39 - 40
  • [5] Open-source machine learning: R meets Weka
    Hornik, Kurt
    Buchta, Christian
    Zeileis, Achim
    [J]. COMPUTATIONAL STATISTICS, 2009, 24 (02) : 225 - 232
  • [6] Open-source machine learning: R meets Weka
    Kurt Hornik
    Christian Buchta
    Achim Zeileis
    [J]. Computational Statistics, 2009, 24 : 225 - 232
  • [7] ANDez: An open-source tool for author name disambiguation using machine learning
    Kim, Jinseok
    Kim, Jenna
    [J]. SOFTWAREX, 2024, 26
  • [8] Building open-source AI
    Shrestha, Yash Raj
    von Krogh, Georg
    Feuerriegel, Stefan
    [J]. NATURE COMPUTATIONAL SCIENCE, 2023, 3 (11): : 908 - 911
  • [9] Building open-source AI
    Yash Raj Shrestha
    Georg von Krogh
    Stefan Feuerriegel
    [J]. Nature Computational Science, 2023, 3 : 908 - 911
  • [10] Machine Learning for Perovskite Solar Cells: An Open-Source Pipeline
    Roberts, Nicholas
    Jones, Dylan
    Schuy, Alex
    Hsu, Shi-Chieh
    Lin, Lih Y.
    [J]. ADVANCED PHYSICS RESEARCH, 2024,