A Scalable Multi-Data Sources Based Recursive Approximation Approach for Fast Error Recovery in Big Sensing Data on Cloud

被引:4
|
作者
Yang, Chi [1 ]
Xu, Xianghua [2 ]
Ramamohanarao, Kotagiri [3 ]
Chen, Jinjun [4 ]
机构
[1] Univ Wollongong, SCIT, Wollongong, NSW 2522, Australia
[2] Hangzhou Dianzi Univ, Hangzhou 310005, Zhejiang, Peoples R China
[3] Univ Melbourne, Melbourne, Vic 3010, Australia
[4] Swinburne Univ Technol, Melbourne, Vic 3122, Australia
基金
澳大利亚研究理事会;
关键词
Sensors; Big Data; Cloud computing; Reliability; Complex networks; Time series analysis; big sensing data; cloud; euclidean distance; error recovery;
D O I
10.1109/TKDE.2019.2895612
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big sensing data is commonly encountered from various surveillance or sensing systems. Sampling and transferring errors are commonly encountered during each stage of sensing data processing. How to recover from these errors with accuracy and efficiency is quite challenging because of high sensing data volume and unrepeatable wireless communication environment. While Cloud provides a promising platform for processing big sensing data, however scalable and accurate error recovery solutions are still need. In this paper, we propose a novel approach to achieve fast error recovery in a scalable manner on cloud. This approach is based on the prediction of a recovery replacement data by making multiple data sources based approximation. The approximation process will use coverage information carried by data units to limit the algorithm in a small cluster of sensing data instead of a whole data spectrum. Specifically, in each sensing data cluster, a Euclidean distance based approximation is proposed to calculate a time series prediction. With the calculated time series, a detected error can be recovered with a predicted data value. Through the experiment with real world meteorological data sets on cloud, we demonstrate that the proposed error recovery approach can achieve high accuracy in data approximation to replace the original data error. At the same time, with MapReduce based implementation for scalability, the experimental results also show significant efficiency on time saving.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 50 条
  • [1] A Scalable Data Chunk Similarity Based Compression Approach for Efficient Big Sensing Data Processing on Cloud
    Yang, Chi
    Chen, Jinjun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (06) : 1144 - 1157
  • [2] A MapReduce Based Approach of Scalable Multidimensional Anonymization for Big Data Privacy Preservation on Cloud
    Zhang, Xuyun
    Yang, Chi
    Nepal, Surya
    Liu, Chang
    Dou, Wanchun
    Chen, Jinjun
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING (CGC 2013), 2013, : 105 - 112
  • [3] IoE based private multi-data center cloud architecture framework
    Dhaya, R.
    Kanthavel, R.
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [4] Fault Diagnosis System of Power Grid Based on Multi-Data Sources
    Ji, Jinjie
    Chen, Qing
    Jin, Lei
    Zhou, Xiaotong
    Ding, Wei
    APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [5] Probability based Data Mining Approach with Big Data in Cloud Infrastructure
    Kittappa, Thiagarajan
    Vasudevan, Rajeswari
    Karuppusamy, Saranya
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE, MULTIMEDIA AND COMMUNICATION ENGINEERING (SMCE 2015), 2015, : 277 - 281
  • [6] GeoRocket: A scalable and cloud -based data store for big geospatial files
    Kraemer, Michel
    SOFTWAREX, 2020, 11
  • [7] Fast-Sec: an approach to secure Big Data processing in the cloud
    dos Anjos, Julio C. S.
    Galibus, Tatiana
    Geyer, Claudio F. R.
    Fedak, Gilles
    Costa, Joao Paulo C. L.
    Pereira, Rubem
    de Freitas, Edison Pignaton
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (03) : 272 - 287
  • [8] Dependable Data Outsourcing Scheme Based on Cloud-of-Clouds Approach with Fast Recovery
    Fan, Chun-, I
    Huang, Jheng-Jia
    Tseng, Shang-Wei
    Chen, I-Te
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2021, 9 (02) : 546 - 561
  • [9] A Scalable Adaptive Sampling Based Approach for Big Data Classification
    Djouzi, Kheyreddine
    Beghdad-Bey, Kadda
    Amamra, Abdenour
    ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2022, 513 : 73 - 83
  • [10] MBMD-LoRa Scalable LoRaWAN for Internet of Things: A Multi-band Multi-data Rate Approach
    Almuhaya, Mkrm
    Al-Hadhrami, Tawfik
    Kaiwartya, Omparakash
    Brown, David. J.
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II, 2024, 2166 : 54 - 67