Multimodal Temporal Fusion Transformers are Good Product Demand Forecasters

被引:0
|
作者
Sukel, Maarten [1 ]
Rudinac, Stevan [1 ]
Worring, Marcel [1 ]
机构
[1] Univ Amsterdam, NL-1089 XH Amsterdam, Netherlands
关键词
Demand forecasting; Task analysis; Transformers; Feature extraction; Visualization; Logic gates; Data mining; Multimodal sensors;
D O I
10.1109/MMUL.2024.3373827
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multimodal demand forecasting aims at predicting product demand utilizing visual, textual, and contextual information. This article proposes a method for such forecasting using an integrated architecture composed of convolutional, graph-based, and transformer-based networks. Since traditional forecasting methods depend on historical demand and factors like manually generated categorical information, they face challenges such as the cold start problem and handling of category dynamics. To address these challenges, our architecture allows for incorporating multimodal information, such as geographical information, product images, and textual descriptions. Experiments with the multimodal approach are performed on a real-world dataset of more than 50 million data points of article demand. The pipeline presented in this work enhances the reliability of the predictions, demonstrating the potential of leveraging multimodal information in product demand forecasting.
引用
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [31] Multi-Route Aircraft Trajectory Prediction Using Temporal Fusion Transformers
    Silvestre, Jorge
    Mielgo, Paula
    Bregon, Anibal
    Martinez-Prieto, Miguel A.
    Alvarez-Esteban, Pedro C.
    IEEE ACCESS, 2024, 12 : 174094 - 174106
  • [32] A MULTIMODAL FUSION FRAMEWORK FOR BRAND RECOGNITION FROM PRODUCT IMAGE AND CONTEXT
    Hu, Changbo
    Li, Qun
    Zhang, Zhen
    Chang, Keng-hao
    Zhang, Ruofei
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [33] Multimodal early fusion operators for temporal video scene segmentation tasks
    Antonio A. R. Beserra
    Rudinei Goularte
    Multimedia Tools and Applications, 2023, 82 : 31539 - 31556
  • [34] Multimodal Fusion of Spatial-Temporal Features for Emotion Recognition in the Wild
    Wang, Zuchen
    Fang, Yuchun
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 205 - 214
  • [35] Multimedia event detection with multimodal feature fusion and temporal concept localization
    Sangmin Oh
    Scott McCloskey
    Ilseo Kim
    Arash Vahdat
    Kevin J. Cannons
    Hossein Hajimirsadeghi
    Greg Mori
    A. G. Amitha Perera
    Megha Pandey
    Jason J. Corso
    Machine Vision and Applications, 2014, 25 : 49 - 69
  • [36] Multimodal early fusion operators for temporal video scene segmentation tasks
    Beserra, Antonio A. R.
    Goularte, Rudinei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 31539 - 31556
  • [37] Multimedia event detection with multimodal feature fusion and temporal concept localization
    Oh, Sangmin
    McCloskey, Scott
    Kim, Ilseo
    Vahdat, Arash
    Cannons, Kevin J.
    Hajimirsadeghi, Hossein
    Mori, Greg
    Perera, A. G. Amitha
    Pandey, Megha
    Corso, Jason J.
    MACHINE VISION AND APPLICATIONS, 2014, 25 (01) : 49 - 69
  • [38] Two-stage decomposition and temporal fusion transformers for interpretable wind speed forecasting
    Wu, Binrong
    Wang, Lin
    Energy, 2024, 288
  • [39] Temporal Fusion Transformers for Enhanced Multivariate Time Series Forecasting of Indonesian Stock Prices
    Hartanto, Standy
    Gunawan, Alexander Agung Santoso
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 140 - 148
  • [40] Short-Term Load Forecasting with Temporal Fusion Transformers for Power Distribution Networks
    Liao, Huanyue
    Radhakrishnan, Krishnanand Kaippilly
    2022 IEEE SUSTAINABLE POWER AND ENERGY CONFERENCE (ISPEC), 2022,