Mutual adaptation: Learning from prototype for time-series prediction

被引:0
|
作者
Chen J. [1 ]
Shi X. [1 ,2 ]
Zhang H. [3 ]
Li W. [1 ]
Li P. [1 ]
Yao Y. [1 ]
Song X. [4 ]
Shibasaki R. [1 ]
机构
[1] Center for Spatial Information Science, University of Tokyo, Chiba
[2] School of Business, Society and Technology, Mälardalens University, Västeras
[3] LocationMind Inc., Chiyoda-ku
[4] SUSTech-UTokyo Joint Research Center on Super SmartCity, Department OfComputer Science AndEngineering, Southern University of Science and Technology, Shenzhen
来源
关键词
Knowledge based systems; Machine learning; Swarm systems; Temporal learning;
D O I
10.1109/TAI.2023.3282201
中图分类号
学科分类号
摘要
Time-series prediction is a current research hotspot in deep learning. However, due to the complex nature of time-series data, the modeling in this task is often highly nonconvex, which canmake the final convergence unstable. To address this challenge, recentworks have proposed deep mutual learning frameworks that allow models to learn from both ground truth and knowledge of othermodels in order to locate a better convergence point.However, a key disadvantage of deep mutual learning is that models that converge to poor local optima may still share their knowledge, limiting the overall performance. To overcome this limitation, in this article, we propose a new learning framework called mutual adaptation, which selects a prototypemodel that has the least error among all the models in the framework as the common teacher model. In addition, we incorporate a strategy of learning from each individual model's best local optimum in the history of training. Our experimental results show that, on average across multiple datasets, our method improves the performance of both Informer and long short-Term memory (LSTM) models compared to deep mutual learning by 4.73% in mean absolute error (MAE) and 6.99% in mean squared error (MSE) for Informer, and 11.54% in MAE and 18.15% in MSE for LSTM. We also demonstrate the importance of memory of individual best local optima and provide sensitivity analysis and visualization of error and the loss descending process. Our method represents a new state-of-The-Art in group learning for time-series prediction. Impact Statement-Deep mutual learning locates a more robust and better modelling of deep learning by merging different models for interactive learning. Every model learns fromboth groundtruth and knowledge of other models. However, its limitation is that models, which find bad local optima, would also share knowledge and probably lead others to worse performance. In this paper, we propose a framework named mutual adaptation. It selects a prototype model that has the least error from groundtruth among the models as the common teacher model of others. In addition, we recommend each model learn from individual's best output in the history of learning. In the experiment, among all of datasets, the performance of different models is improved by mutual adaptation and shows better performance than deep mutual learning. We believe our method could represents the advances in group deep learning. © 2023 IEEE.
引用
收藏
页码:1247 / 1262
页数:15
相关论文
共 50 条
  • [31] ENHANCED MACHINE LEARNING MODEL FOR METEO-OCEANOGRAPHIC TIME-SERIES PREDICTION
    Chirosca, Gianina
    Mihailov, Maria Emanuela
    Tomescu-Chivu, Maria Ionela
    Chirosca, Alecsandru Vladimir
    ROMANIAN JOURNAL OF PHYSICS, 2022, 67 (9-10):
  • [32] High-Order Temporal Correlation Model Learning for Time-Series Prediction
    Jing, Peiguang
    Su, Yuting
    Jin, Xiao
    Zhang, Chengqian
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (06) : 2385 - 2397
  • [33] Time-series Machine Learning Approach to Sepsis Prediction in the Intensive Care Unit
    Sears, I.
    Levy, M. M.
    Ventetuolo, C. E.
    Eickhoff, C.
    Abbasi, A.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2023, 207
  • [34] Employing local modeling in machine learning based methods for time-series prediction
    Wu, Shin-Fu
    Lee, Shie-Jue
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 341 - 354
  • [35] Composite FORCE Learning of Chaotic Echo State Networks for Time-Series Prediction
    Li, Yansong
    Hu, Kai
    Nakajima, Kohei
    Pan, Yongping
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7355 - 7360
  • [36] Learning Evolvable Time-series Shapelets
    Yamaguchi, Akihiro
    Ueo, Ken
    Kashima, Hisashi
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 793 - 805
  • [37] An Effective Patient Representation Learning for Time-series Prediction Tasks Based on EHRs
    Lei, Liqi
    Zhou, Yangming
    Zhai, Jie
    Zhang, Le
    Fang, Zhijia
    He, Ping
    Gao, Ju
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 885 - 892
  • [38] Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models
    Karlsson, Rickard K. A.
    Willbo, Martin
    Hussain, Zeshan
    Krishnan, Rahul G.
    Sontag, David
    Johansson, Fredrik D.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [39] Percolative Learning: Time-Series Predictions from Future Tendencies
    Takaishi, Kazuki
    Kobayashi, Masayuki
    Yanagimoto, Miku
    Nagao, Tomoharu
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1647 - 1652
  • [40] Learning recurrent behaviors from heterogeneous multivariate time-series
    Duchene, Florence
    Garbay, Catherine
    Rialle, Vincent
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 39 (01) : 25 - 47