Adaptive similarity search for the retrieval of rare events from large time series databases

被引:0
|
作者
Schlegl, Thomas [1 ,3 ]
Schlegl, Stefan [4 ]
Tomaselli, Domenico [5 ]
West, Nikolai [1 ]
Deuse, Jochen [1 ,2 ]
机构
[1] Institute of Production Systems, TU Dortmund University, Leonhard-Euler-Str. 5, Dortmund,44227, Germany
[2] Centre for Advanced Manufacturing, University of Technology Sydney, 11 Broadway, Ultimo NSW,2007, Australia
[3] BMW Group, Petuelring 130, München,80788, Germany
[4] BotCraft GmbH, Lichtenbergstraße 8, Garching,85748, Germany
[5] Technical University of Munich, Arcisstraße 21, Munich,80333, Germany
关键词
Iterative methods - Manufacture - Learning algorithms - Benchmarking - Information retrieval systems - Search engines - Database systems - Information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
Improving the recall of information retrieval systems for similarity search in time series databases is of great practical importance. In the manufacturing domain, these systems are used to query large databases of manufacturing process data that contain terabytes of time series data from millions of parts. This allows domain experts to identify parts that exhibit specific process faults. In practice, the search often amounts to an iterative query–response cycle in which users define new queries (time series patterns) based on results of previous queries. This is a well-documented phenomenon in information retrieval and not unique to the manufacturing domain. Indexing manufacturing databases to speed up the exploratory search is often not feasible as it may result in an unacceptable reduction in recall. In this paper, we present a novel adaptive search algorithm that refines the query based on relevance feedback provided by the user. Additionally, we propose a mechanism that allows the algorithm to self-adapt to new patterns without requiring any user input. As the search progresses, the algorithm constructs a library of time series patterns that are used to accurately find objects of the target class. Experimental validation of the algorithm on real-world manufacturing data shows, that the recall for the retrieval of fault patterns is considerably higher than that of other state-of-the-art adaptive search algorithms. Additionally, its application to publicly available benchmark data sets shows, that these results are transferable to other domains. © 2022
引用
收藏
相关论文
共 50 条
  • [1] Adaptive similarity search for the retrieval of rare events from large time series databases
    Schlegl, Thomas
    Schlegl, Stefan
    Tomaselli, Domenico
    West, Nikolai
    Deuse, Jochen
    [J]. ADVANCED ENGINEERING INFORMATICS, 2022, 52
  • [2] Parallelization of similarity search in large time series databases
    Qiao, Jonathan
    Ye, Yang
    Zhang, Chaoyang
    [J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 355 - +
  • [3] Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases
    Eamonn Keogh
    Kaushik Chakrabarti
    Michael Pazzani
    Sharad Mehrotra
    [J]. Knowledge and Information Systems, 2001, 3 (3) : 263 - 286
  • [4] Similarity search in time series databases using moments
    Toshniwal, D
    Joshi, RC
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 164 - 171
  • [5] A simple dimensionality reduction technique for fast similarity search in large time series databases
    Keogh, EJ
    Pazzani, MJ
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 122 - 133
  • [6] An adaptive index structure for similarity search in large image databases
    Wu, P
    Manjunath, BS
    [J]. INTERNET MULTIMEDIA MANAGEMENT SYSTEMS II, 2001, 4519 : 32 - 41
  • [7] Interval-focused similarity search in time series databases
    Assfalg, Johannes
    Kriegel, Hans-Peter
    Kroeger, Peer
    Kunath, Peter
    Pryakhin, Alexey
    Renz, Matthias
    [J]. ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 586 - +
  • [8] Anticipatory DTW for Efficient Similarity Search in Time Series Databases
    Assent, Ira
    Wichterich, Marc
    Krieger, Ralph
    Kremer, Hardy
    Seidl, Thomas
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01):
  • [9] Similarity search using the polar wavelet in time series databases
    Kang, Seonggu
    Kim, Jaehwan
    Chae, Jinseok
    Choi, Wonik
    Lee, Sangjun
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 1347 - +
  • [10] Fast similarity search in the presence of longitudinal scaling in time series databases
    Keogh, E
    [J]. NINTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1997, : 578 - 584