Effective Detection of Rare Anomalies from Massive Waveform Data Using Heterogeneous Clustering

被引:1
|
作者
Goto, Masaharu [1 ]
Chikamatsu, Kiyoshi [1 ]
Kobayashi, Naoki [1 ]
Ren, Gang [2 ]
Ogihara, Mitsunori [2 ]
机构
[1] Ctr Excellence Keysight Technol Int Japan, Elect Ind Solut Grp, Hachioji, Tokyo, Japan
[2] Univ Miami, Inst Data Sci & Comp, Dept Comp Sci, Coral Gables, FL 33124 USA
关键词
Clustering; Massive waveform data; Waveform analysis; Real-time signal processing; Long-duration waveform; Measurement instruments;
D O I
10.1109/BigData50022.2020.9377945
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today's measurement instruments are capable of capturing and processing massive amount of waveform data. High sampling rate Analog to Digital Converters (ADCs) and low-cost storages make it relatively easy to collect "big measurement data" at massive scale. More and more measurement instrument users acquire tera-byte-scale waveform data which are essential for hard-to-find failure detection and prediction. However, conventional analysis techniques focus on small fragments of signals and largely lag behind today's test and measurement data assets' processing demands. Most of these techniques are inadequate for coping with the massive data volume and the complexities of the analysis tasks. A previous report by the authors introduced a heterogeneous waveform clustering framework to break the technical barriers. The present paper demonstrates the effectiveness of the proposed framework with real-world application examples at tera-byte data scale. The framework consists of the real-time tagging for pre-sorting incoming data, quick clustering for summarizing data overviews from long-duration recording, and detail clustering for deeper analyses. The tagging process is the critical performance link for satisfying the processing time and hardware constrains. We share theoretical analysis on the degree of freedom involved in the waveform and the tagging results. The data is pre-sorted into tag database with highly efficient retrieval characteristics, allowing the system to provide results quickly and flexibly. Three real-world waveform analysis examples are demonstrated, namely power line voltage, mechanical relay stick error, and Bluetooth device current consumption. Our framework allows efficient and robust exploration of complex signal signatures for detecting extremely rare anomalies. The detected anomaly patterns not only show straightforward engineering usages, but also demonstrate a predictive analysis power of related signal events.
引用
收藏
页码:1513 / 1522
页数:10
相关论文
共 50 条
  • [41] Separating Sensor Anomalies From Process Anomalies in Data-Driven Anomaly Detection
    LaRosa, Nicholas
    Farber, Jacob
    Venkitasubramaniam, Parv
    Blum, Rick
    Al Rashdan, Ahmad
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1704 - 1708
  • [42] Automatic geobody detection from seismic data using minimum message length clustering
    Xu, Y
    Caers, J
    Arroyo-Garcia, C
    COMPUTERS & GEOSCIENCES, 2004, 30 (07) : 741 - 751
  • [43] Extracting Anomalies from Time Sequences Derived from Nuclear Power Plant Data by Using Fixed Width Clustering Algorithm
    Gupta, Aditya
    Toshniwal, Durga
    Gupta, Pramod K.
    Khurana, Vikas
    Upadhyay, Pushp
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1587 - 1592
  • [44] MCNC: Multi-Channel Nonparametric Clustering from Heterogeneous Data
    Thanh-Binh Nguyen
    Vu Nguyen
    Venkatesh, Svetha
    Dinh Phung
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3633 - 3638
  • [45] Decentralized multiple hypothesis testing in Cognitive IOT using massive heterogeneous data
    Jha, Vidyapati
    Tripathi, Priyanka
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (05): : 6889 - 6929
  • [46] An Integrated Framework for Managing Massive and Heterogeneous Sensor Data Using Cloud Computing
    Song, Xin
    Wang, Cuirong
    Chen, Yanjun
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 461 - 464
  • [47] Mining Effective Temporal Specifications from Heterogeneous API Data
    Qian Wu
    Guang-Tai Liang
    Qian-Xiang Wang
    Hong Mei
    Journal of Computer Science and Technology, 2011, 26 : 1061 - 1075
  • [48] Mining Effective Temporal Specifications from Heterogeneous API Data
    吴倩
    梁广泰
    王千祥
    梅宏
    JournalofComputerScience&Technology, 2011, 26 (06) : 1061 - 1075
  • [49] An Effective way of Mining Knowledge from Heterogeneous Data Sources
    Molli, Venkateswara Rao
    Veeramanickam, M. R. M.
    2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,
  • [50] Mining Effective Temporal Specifications from Heterogeneous API Data
    Wu, Qian
    Liang, Guang-Tai
    Wang, Qian-Xiang
    Mei, Hong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2011, 26 (06) : 1061 - 1075