Ensemble clustering and feature weighting in time series data

被引:0
|
作者
Bahramlou, Ainaz [1 ]
Hashemi, Massoud Reza [1 ]
Zali, Zeinab [1 ]
机构
[1] Isfahan Univ Technol, Dept Elect & Comp Engn, Esfahan 8415683111, Iran
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 15期
关键词
Time series; Ensemble clustering; Feature weighting; Relief algorithm; MapReduce; SELECTION; DIVERSITY;
D O I
10.1007/s11227-023-05290-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ensemble clustering is an important approach in machine learning, which combines multiple hypotheses to minimize the risk of selecting a wrong hypothesis or local minimum. In this study, two parallel and distributed frameworks for ensemble clustering and time series data prediction are presented. The second framework is presented with a higher distribution level than the first framework. Both frameworks were implemented using the MapReduce programming model, with the Relief algorithm's various versions used for feature weighting, including Multisurf*, ReliefF, Simba-Sc, and I-Relief. Additionally, a new version of Relief called M-Relief is introduced and compared to other versions. To analyze the proposed frameworks' performance, Irish weather data, energy consumption data from PJM, and Spanish weather data from the Kaggle dataset were selected. The study's results demonstrated higher clustering accuracy and diversity compared to the basic Relief, with performance measured in terms of memory, runtime, and error metrics.
引用
收藏
页码:16442 / 16478
页数:37
相关论文
共 50 条
  • [1] Ensemble clustering and feature weighting in time series data
    Ainaz Bahramlou
    Massoud Reza Hashemi
    Zeinab Zali
    [J]. The Journal of Supercomputing, 2023, 79 : 16442 - 16478
  • [2] On multivariate binary data clustering and feature weighting
    Bouguila, Nizar
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (01) : 120 - 134
  • [3] ENSEMBLE-BASED TIME SERIES DATA CLUSTERING FOR HIGH DIMENSIONAL DATA
    Saravanan, Sampasetty
    Nawaz, Gulam Mohideen Kadhar
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (04): : 1457 - 1470
  • [4] Ensemble Clustering for Unsupervised Learning of Time Series Data using FPGAs
    Porcello, John C.
    [J]. 2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [5] Soft Subspace Based Ensemble Clustering for Multivariate Time Series Data
    He, Guoliang
    Jiang, Wenjun
    Peng, Rong
    Yin, Ming
    Han, Min
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7761 - 7774
  • [6] Feature-Based Clustering for Electricity Use Time Series Data
    Rasanen, Teemu
    Kolehmainen, Mikko
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2009, 5495 : 401 - 412
  • [7] An ensemble solution for multivariate time series clustering
    Vazquez, Iago
    Villar, Jose R.
    Sedano, Javier
    de la Cal, Enrique
    Simic, Svetlana
    [J]. NEUROCOMPUTING, 2021, 457 : 182 - 192
  • [8] Ensemble Clustering with Novel Weighting Strategy
    Sun, Yao
    Jia, Hong
    Huang, Jiwu
    [J]. 2018 14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2018, : 155 - 159
  • [9] Feature Weighting and Feature Selection in Fuzzy Clustering
    Borgelt, Christian
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 838 - 844
  • [10] Trend Feature-based Clustering for Research Funding Time Series Data
    Ma Yixuan
    Gao Xuedong
    Pan Baoxiang
    [J]. 2015 INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS), 2015,