LCSS-Based Algorithm for Computing Multivariate Data Set Similarity: A Case Study of Real-Time WSN Data

被引:7
|
作者
Khan, Rahim [1 ]
Ali, Ihsan [2 ]
Altowaijri, Saleh M. [3 ]
Zakarya, Muhammad [1 ]
Rahman, Atiq Ur [3 ]
Ahmedy, Ismail [2 ]
Khan, Anwar [4 ]
Gani, Abdullah [5 ]
机构
[1] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, Pakistan
[2] Univ Malaya, Fac Comp Sci & IT, Dept Comp Syst & Technol, Kuala Lumpur 50603, Malaysia
[3] Northern Border Univ, Fac Comp & Informat Technol, Rafha 91911, Saudi Arabia
[4] Univ Peshawar, Dept Elect, Peshawar 25000, Pakistan
[5] Taylors Univ, Sch Comp & Informat Technol, Subang Jaya 47500, Malaysia
关键词
multivariate data set; longest common subsequence; dynamic programming; WSN data; LONGEST COMMON SUBSEQUENCE; SERIES;
D O I
10.3390/s19010166
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multivariate data sets are common in various application areas, such as wireless sensor networks (WSNs) and DNA analysis. A robust mechanism is required to compute their similarity indexes regardless of the environment and problem domain. This study describes the usefulness of a non-metric-based approach (i.e., longest common subsequence) in computing similarity indexes. Several non-metric-based algorithms are available in the literature, the most robust and reliable one is the dynamic programming-based technique. However, dynamic programming-based techniques are considered inefficient, particularly in the context of multivariate data sets. Furthermore, the classical approaches are not powerful enough in scenarios with multivariate data sets, sensor data or when the similarity indexes are extremely high or low. To address this issue, we propose an efficient algorithm to measure the similarity indexes of multivariate data sets using a non-metric-based methodology. The proposed algorithm performs exceptionally well on numerous multivariate data sets compared with the classical dynamic programming-based algorithms. The performance of the algorithms is evaluated on the basis of several benchmark data sets and a dynamic multivariate data set, which is obtained from a WSN deployed in the Ghulam Ishaq Khan (GIK) Institute of Engineering Sciences and Technology. Our evaluation suggests that the proposed algorithm can be approximately 39.9% more efficient than its counterparts for various data sets in terms of computational time.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Similarity query processing algorithm over data stream based on LCSS
    Wang, Shaopeng
    Wen, Yingyou
    Zhao, Hong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (09): : 1976 - 1991
  • [2] A real-time grid-based clustering algorithm for large data set
    Yu, Zhiwen
    Wong, Hau-San
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 740 - +
  • [3] Algorithm of WSN data collection based on similarity prediction
    Li, Ping, 1600, Chinese Academy of Sciences (25):
  • [4] Study on the Similarity Query Based on LCSS over Data Stream Window
    Wang, Shaopeng
    Wen, Yingyou
    Zhao, Hong
    2015 IEEE 12TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2015, : 68 - 73
  • [5] REAL-TIME COMPUTING OF NEUROPHYSIOLOGICAL DATA
    ARBUTHNOTT, GW
    DONNELLY, S
    WHALE, D
    JOURNAL OF PHYSIOLOGY-LONDON, 1984, 346 (JAN): : P20 - P20
  • [6] A Real-Time Data Set for Switzerland
    Indergand R.
    Leist S.
    Swiss Journal of Economics and Statistics, 2014, 150 (4) : 331 - 352
  • [7] A real-time data set for macroeconomists
    Croushore, D
    Stark, T
    JOURNAL OF ECONOMETRICS, 2001, 105 (01) : 111 - 130
  • [8] A Real-Time AIS Data Cleaning and Indicator Analysis Algorithm Based on Stream Computing
    Lv T.
    Tang P.
    Zhang J.
    Scientific Programming, 2023, 2023
  • [9] RDCM: An Efficient Real-Time Data Collection Model for IoT/WSN Edge With Multivariate Sensors
    Alduais, Nayef Abdulwahab Mohammed
    Abdullah, Jiwa
    Jamil, Ansar
    IEEE ACCESS, 2019, 7 : 89063 - 89082
  • [10] Real-time squared: A real-time data set for real-time GDP forecasting
    Golinelli, Roberto
    Parigi, Giuseppe
    INTERNATIONAL JOURNAL OF FORECASTING, 2008, 24 (03) : 368 - 385