Multimodal Temporal Fusion Transformers are Good Product Demand Forecasters

被引:0
|
作者
Sukel, Maarten [1 ]
Rudinac, Stevan [1 ]
Worring, Marcel [1 ]
机构
[1] Univ Amsterdam, NL-1089 XH Amsterdam, Netherlands
关键词
Demand forecasting; Task analysis; Transformers; Feature extraction; Visualization; Logic gates; Data mining; Multimodal sensors;
D O I
10.1109/MMUL.2024.3373827
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multimodal demand forecasting aims at predicting product demand utilizing visual, textual, and contextual information. This article proposes a method for such forecasting using an integrated architecture composed of convolutional, graph-based, and transformer-based networks. Since traditional forecasting methods depend on historical demand and factors like manually generated categorical information, they face challenges such as the cold start problem and handling of category dynamics. To address these challenges, our architecture allows for incorporating multimodal information, such as geographical information, product images, and textual descriptions. Experiments with the multimodal approach are performed on a real-world dataset of more than 50 million data points of article demand. The pipeline presented in this work enhances the reliability of the predictions, demonstrating the potential of leveraging multimodal information in product demand forecasting.
引用
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [1] Multimodal Token Fusion for Vision Transformers
    Wang, Yikai
    Chen, Xinghao
    Cao, Lele
    Huang, Wenbing
    Sun, Fuchun
    Wang, Yunhe
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12176 - 12185
  • [2] Interpretable tourism demand forecasting with temporal fusion transformers amid COVID-19
    Binrong Wu
    Lin Wang
    Yu-Rong Zeng
    Applied Intelligence, 2023, 53 : 14493 - 14514
  • [3] Interpretable Tourism Demand Forecasting with Two-Stage Decomposition and Temporal Fusion Transformers
    WU Binrong
    WANG Lin
    ZENG YuRong
    Journal of Systems Science & Complexity, 2024, 37 (06) : 2654 - 2679
  • [4] Interpretable tourism demand forecasting with temporal fusion transformers amid COVID-19
    Wu, Binrong
    Wang, Lin
    Zeng, Yu-Rong
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14493 - 14514
  • [5] Interpretable Tourism Demand Forecasting with Two-Stage Decomposition and Temporal Fusion Transformers
    Wu, Binrong
    Wang, Lin
    Zeng, Yu-Rong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024, 37 (06) : 2654 - 2679
  • [6] Taxi demand forecasting based on the temporal multimodal information fusion graph neural network
    Liao, Wenxiong
    Zeng, Bi
    Liu, Jianqi
    Wei, Pengfei
    Cheng, Xiaochun
    APPLIED INTELLIGENCE, 2022, 52 (10) : 12077 - 12090
  • [7] Taxi demand forecasting based on the temporal multimodal information fusion graph neural network
    Wenxiong Liao
    Bi Zeng
    Jianqi Liu
    Pengfei Wei
    Xiaochun Cheng
    Applied Intelligence, 2022, 52 : 12077 - 12090
  • [8] Low Rank Fusion based Transformers for Multimodal Sequences
    Sahay, Saurav
    Okur, Eda
    Kumar, Shachi H.
    Nachman, Lama
    PROCEEDINGS OF THE SECOND GRAND CHALLENGE AND WORKSHOP ON MULTIMODAL LANGUAGE (CHALLENGE-HML), VOL 1, 2020, : 29 - 34
  • [9] LEVERAGING EFFICIENT TRAINING AND FEATURE FUSION IN TRANSFORMERS FOR MULTIMODAL CLASSIFICATION
    Emir, Kenan A. K.
    Lee, Gwang-Gook
    Xu, Yan
    Shen, Mingwei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1420 - 1424
  • [10] Semantics Fusion of Hierarchical Transformers for Multimodal Named Entity Recognition
    Tong, Zhao
    Liu, Qiang
    Shi, Haichao
    Xia, Yuwei
    Wu, Shu
    Zhang, Xiao-Yu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 414 - 426