MDWConv:CNN based on multi-scale atrous pyramid and depthwise separable convolution for long time series forecasting

被引:0
|
作者
Tian, Guangpo [1 ]
Xu, Yunyang [1 ]
Ma, Xiang [1 ]
Li, Xuemei [1 ]
Zhang, Caiming [1 ,2 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Prov Lab Future Intelligence & Financial, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Long time series forecasting; Multi-scale atrous pyramid; Depthwise separable convolution; Segmented polynomial activation function;
D O I
10.1016/j.neunet.2025.107139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long time series forecasting has extensive applications in various fields such as power dispatching, traffic control, and weather forecasting. Recently, models based on the Transformer architecture have dominated the field of time series forecasting. However, these methods lack the ability to handle the correlation of multi-scale information and the interaction of information between variables in model design. This paper proposes a convolutional neural network, MDWConv, based on multi-scale dilated pyramid and depthwise separable convolution. In terms of understanding and integrating multi-scale information, the multi-scale dilated pyramid structure is constructed to capture multi-scale features, and convolution operations are employed to achieve cross-scale information integration, thereby improving the understanding and processing capability of the sequence's rich scale-specific information. A depthwise separable convolution network is constructed, which adopts a grouping strategy: using depthwise convolution to extract long-term dependencies and pointwise convolution for inter-variable information interaction and hidden information extraction. This reduces computational complexity while improving the model's predictive accuracy through enhanced feature representation. We also propose a novel segmented polynomial activation function (TCP), which approximates the GELU function with piecewise cubic Hermite functions in different domains, significantly reducing computational complexity and achieving a faster loss reduction rate. Experiments on various real- world datasets demonstrate that MDWConv outperforms other methods. Despite relying solely on convolutional neural networks, MDWConv still exhibits strong competitiveness.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
    Zhao, Changchen
    Wang, Hongsheng
    Feng, Yuanjing
    Virtual Reality and Intelligent Hardware, 2023, 5 (02): : 124 - 141
  • [32] TFMSNet: A time series forecasting framework with time–frequency analysis and multi-scale processing
    Song, Xin
    Zhang, Xianglong
    Tian, Wang
    Zhu, Qiqi
    Computers and Electrical Engineering, 2025, 123
  • [33] MSSTNet: Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
    Changchen ZHAO
    Hongsheng WANG
    Yuanjing FENG
    虚拟现实与智能硬件(中英文), 2023, 5 (02) : 124 - 141
  • [34] A Multi-Task Learning Based Runoff Forecasting Model for Multi-Scale Chaotic Hydrological Time Series
    Hui Zuo
    Gaowei Yan
    Ruochen Lu
    Rong Li
    Shuyi Xiao
    Yusong Pang
    Water Resources Management, 2024, 38 : 481 - 503
  • [35] A Multi-Task Learning Based Runoff Forecasting Model for Multi-Scale Chaotic Hydrological Time Series
    Zuo, Hui
    Yan, Gaowei
    Lu, Ruochen
    Li, Rong
    Xiao, Shuyi
    Pang, Yusong
    WATER RESOURCES MANAGEMENT, 2024, 38 (01) : 235 - 250
  • [36] Learning the Evolutionary and Multi-scale Graph Structure for Multivariate Time Series Forecasting
    Ye, Junchen
    Liu, Zihan
    Du, Bowen
    Sun, Leilei
    Li, Weimiao
    Fu, Yanjie
    Xiong, Hui
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2296 - 2306
  • [37] Multi-Scale Adaptive Graph Neural Network for Multivariate Time Series Forecasting
    Chen L.
    Chen D.
    Shang Z.
    Wu B.
    Zheng C.
    Wen B.
    Zhang W.
    IEEE Transactions on Knowledge and Data Engineering, 2023, 35 (10) : 10748 - 10761
  • [38] MSPatch: A multi-scale patch mixing framework for multivariate time series forecasting
    Cao, Yizhi
    Tian, Zijian
    Guo, Wenjie
    Liu, Xinggao
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [39] MSGNet: Learning Multi-Scale Inter-Series Correlations for Multivariate Time Series Forecasting
    Cai, Wanlin
    Liang, Yuxuan
    Liu, Xianggen
    Feng, Jianshuai
    Wu, Yuankai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11141 - 11149
  • [40] Lite-YOLOv3: a real-time object detector based on multi-scale slice depthwise convolution and lightweight attention mechanism
    Zhou, Yipeng
    Qian, Huaming
    Ding, Peng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (06)