Learning manifolds from non-stationary streams

被引:0
|
作者
Mahapatra, Suchismit [1 ]
Chandola, Varun [1 ]
机构
[1] SUNY Buffalo, Dept Comp Sci, Buffalo, NY 14261 USA
关键词
Manifold learning; Dimension reduction; Streaming data; Isomap; Gaussian process; Primary; NONLINEAR DIMENSIONALITY REDUCTION; EIGENMAPS;
D O I
10.1186/s40537-023-00872-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Streaming adaptations of manifold learning based dimensionality reduction methods, such as Isomap, are based on the assumption that a small initial batch of observations is enough for exact learning of the manifold, while remaining streaming data instances can be cheaply mapped to this manifold. However, there are no theoretical results to show that this core assumption is valid. Moreover, such methods typically assume that the underlying data distribution is stationary and are not equipped to detect, or handle, sudden changes or gradual drifts in the distribution that may occur when the data is streaming. We present theoretical results to show that the quality of a manifold asymptotically converges as the size of data increases. We then show that a Gaussian Process Regression (GPR) model, that uses a manifold-specific kernel function and is trained on an initial batch of sufficient size, can closely approximate the state-of-art streaming Isomap algorithms, and the predictive variance obtained from the GPR prediction can be employed as an effective detector of changes in the underlying data distribution. Results on several synthetic and real data sets show that the resulting algorithm can effectively learn lower dimensional representation of high dimensional data in a streaming setting, while identifying shifts in the generative distribution. For instance, key findings on a Gas sensor array data set show that our method can detect changes in the underlying data stream, triggered due to real-world factors, such as introduction of a new gas in the system, while efficiently mapping data on a low-dimensional manifold.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Social Learning in non-stationary environments
    Boursier, Etienne
    Perchet, Vianney
    Scarsini, Marco
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [22] The complexity of non-stationary reinforcement learning
    Peng, Binghui
    Papadimitriou, Christos
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [23] Learning dynamic causal mechanisms from non-stationary data
    Ruichu Cai
    Liting Huang
    Wei Chen
    Jie Qiao
    Zhifeng Hao
    Applied Intelligence, 2023, 53 : 5437 - 5448
  • [24] Preserving Differential Privacy and Utility of Non-Stationary Data Streams
    Khavkin, Michael
    Last, Mark
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 29 - 34
  • [25] An efficient method for anomaly detection in non-stationary data streams
    Chenaghlou, Milad
    Moshtaghi, Masud
    Leckie, Christopher
    Salehi, Mahsa
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [26] Learning dynamic causal mechanisms from non-stationary data
    Cai, Ruichu
    Huang, Liting
    Chen, Wei
    Qiao, Jie
    Hao, Zhifeng
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5437 - 5448
  • [27] Dirichlet process mixture models for non-stationary data streams
    Casado, Ioar
    Perez, Aritz
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 873 - 878
  • [28] Chaotic Security Based On Non-stationary Dynamics and Random Manifolds
    Thang Manh Hoang
    Tien Dzung Nguyen
    Kyamakya, Kyandoghere
    PROCEEDINGS OF INDS '09: SECOND INTERNATIONAL WORKSHOP ON NONLINEAR DYNAMICS AND SYNCHRONIZATION 2009, 2009, 4 : 65 - +
  • [29] Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review
    Palli, Abdul Sattar
    Jaafar, Jafreezal
    Gilal, Abdul Rehman
    Alsughayyir, Aeshah
    Gomes, Heitor Murilo
    Alshanqiti, Abdullah
    Omar, Mazni
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2024, 23 (01): : 105 - 139
  • [30] Stationary vs. Non-stationary Mobile Learning in MOOCs
    Zhao, Yue
    Robal, Tarmo
    Lofi, Christoph
    Hauff, Claudia
    UMAP'18: ADJUNCT PUBLICATION OF THE 26TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, 2018, : 299 - 303