SEAnet: A Deep Learning Architecture for Data Series Similarity Search

被引:0
|
作者
Wang, Qitong [1 ]
Palpanas, Themis [1 ]
机构
[1] Univ Paris Cite, LIPADE, F-75006 Paris, France
关键词
Data series; similarity search; neural networks; sampling; LERNAEAN HYDRA; COCONUT;
D O I
10.1109/TKDE.2023.3270264
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key operation for massive data series collection analysis is similarity search. According to recent studies, SAX-based indexes offer state-of-the-art performance for similarity search tasks. However, their performance lags under high-frequency, weakly correlated, excessively noisy, or other dataset-specific properties. In this work, we propose Deep Embedding Approximation (DEA), a novel family of data series summarization techniques based on deep neural networks. Moreover, we describe SEAnet, a novel architecture especially designed for learning DEA, that introduces the Sum of Squares preservation property into the deep network design. We further enhance SEAnet with SEAtrans encoder. Finally, we propose novel sampling strategies, SEAsam and SEAsamE, that allow SEAnet to effectively train on massive datasets. Comprehensive experiments on 7 diverse synthetic and real datasets verify the advantages of DEA learned using SEAnet in providing high-quality data series summarizations and similarity search results.
引用
收藏
页码:12972 / 12986
页数:15
相关论文
共 50 条
  • [1] Deep Learning Embeddings for Data Series Similarity Search
    Wang, Qitong
    Palpanas, Themis
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1708 - 1716
  • [2] Hercules Against Data Series Similarity Search
    Echihabi, Karima
    Fatourou, Panagiota
    Zoumpatianos, Kostas
    Palpanas, Themis
    Benbrahim, Houda
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (10): : 2005 - 2018
  • [3] Time Series Similarity Search Methods for Sensor Data
    Jawale, Anupama
    Magar, Ganesh
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (02) : 120 - 129
  • [4] Time Series Similarity Search Methods for Sensor Data
    Automatic Control and Computer Sciences, 2022, 56 : 120 - 129
  • [5] Unaligned Sequence Similarity Search Using Deep Learning
    Senter, James K.
    Royalty, Taylor M.
    Steen, Andrew D.
    Sadovnik, Amir
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1892 - 1899
  • [6] Deep Active Learning with a Neural Architecture Search
    Geifman, Yonatan
    El-Yaniv, Ran
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Data Series Progressive Similarity Search with Probabilistic Quality Guarantees
    Gogolou, Anna
    Tsandilas, Theophanis
    Echihabi, Karima
    Bezerianos, Anastasia
    Palpanas, Themis
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 1857 - 1873
  • [8] Odyssey: A Journey in the Land of Distributed Data Series Similarity Search
    Chatzakis, Manos
    Fatourou, Panagiota
    Kosmas, Eleftherios
    Palpanas, Themis
    Peng, Botao
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (05): : 1140 - 1153
  • [9] Automated Deep Learning: Neural Architecture Search Is Not the End
    Dong, Xuanyi
    Kedziora, David Jacob
    Musial, Katarzyna
    Gabrys, Bogdan
    FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2024, 17 (05): : 767 - 920
  • [10] Learning deep morphological networks with neural architecture search
    Hu, Yufei
    Belkhir, Nacim
    Angulo, Jesus
    Yao, Angela
    Franchi, Gianni
    PATTERN RECOGNITION, 2022, 131