A Scalable Multi-Data Sources Based Recursive Approximation Approach for Fast Error Recovery in Big Sensing Data on Cloud

被引:4
|
作者
Yang, Chi [1 ]
Xu, Xianghua [2 ]
Ramamohanarao, Kotagiri [3 ]
Chen, Jinjun [4 ]
机构
[1] Univ Wollongong, SCIT, Wollongong, NSW 2522, Australia
[2] Hangzhou Dianzi Univ, Hangzhou 310005, Zhejiang, Peoples R China
[3] Univ Melbourne, Melbourne, Vic 3010, Australia
[4] Swinburne Univ Technol, Melbourne, Vic 3122, Australia
基金
澳大利亚研究理事会;
关键词
Sensors; Big Data; Cloud computing; Reliability; Complex networks; Time series analysis; big sensing data; cloud; euclidean distance; error recovery;
D O I
10.1109/TKDE.2019.2895612
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big sensing data is commonly encountered from various surveillance or sensing systems. Sampling and transferring errors are commonly encountered during each stage of sensing data processing. How to recover from these errors with accuracy and efficiency is quite challenging because of high sensing data volume and unrepeatable wireless communication environment. While Cloud provides a promising platform for processing big sensing data, however scalable and accurate error recovery solutions are still need. In this paper, we propose a novel approach to achieve fast error recovery in a scalable manner on cloud. This approach is based on the prediction of a recovery replacement data by making multiple data sources based approximation. The approximation process will use coverage information carried by data units to limit the algorithm in a small cluster of sensing data instead of a whole data spectrum. Specifically, in each sensing data cluster, a Euclidean distance based approximation is proposed to calculate a time series prediction. With the calculated time series, a detected error can be recovered with a predicted data value. Through the experiment with real world meteorological data sets on cloud, we demonstrate that the proposed error recovery approach can achieve high accuracy in data approximation to replace the original data error. At the same time, with MapReduce based implementation for scalability, the experimental results also show significant efficiency on time saving.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 50 条
  • [31] An Ensemble-Based Scalable Approach for Intrusion Detection Using Big Data Framework
    Sahu, Santosh Kumar
    Mohapatra, Durga Prasad
    Rout, Jitendra Kumar
    Sahoo, Kshira Sagar
    Luhach, Ashish Kr
    BIG DATA, 2021, 9 (04) : 303 - 321
  • [32] Distributed and scalable Sybil identification based on nearest neighbour approximation using big data analysis techniques
    Valliyammai, Chinnaiah
    Devakunchari, Ramalingam
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 14461 - 14476
  • [33] A hierarchical multi-objective task scheduling approach for fast big data processing
    Jalalian, Zahra
    Sharifi, Mohsen
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 2307 - 2336
  • [34] A hierarchical multi-objective task scheduling approach for fast big data processing
    Zahra Jalalian
    Mohsen Sharifi
    The Journal of Supercomputing, 2022, 78 : 2307 - 2336
  • [35] Distributed and scalable Sybil identification based on nearest neighbour approximation using big data analysis techniques
    Chinnaiah Valliyammai
    Ramalingam Devakunchari
    Cluster Computing, 2019, 22 : 14461 - 14476
  • [36] Inferring HIV Transmission Network Determinants Using Agent-Based Models Calibrated to Multi-Data Sources
    Niyukuri, David
    Chibawara, Trust
    Nyasulu, Peter Suwirakwenda
    Delva, Wim
    MATHEMATICS, 2021, 9 (21)
  • [37] Near Infrared Spectral Imaging Based on Cloud Data and Wireless Network Sensing in Big Data Sports and Fitness Detection
    Minjin, Guo
    MOBILE NETWORKS & APPLICATIONS, 2024,
  • [38] A hybrid approach for scalable sub-tree anonymization over big data using Map Reduce on cloud
    Zhang, Xuyun
    Liu, Chang
    Nepal, Surya
    Yang, Chi
    Dou, Wanchun
    Chen, Jinjun
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (05) : 1008 - 1020
  • [39] Big data classification of remote sensing image based on cloud computing and convolutional neural network
    Wu, Xiaobo
    SOFT COMPUTING, 2022, 28 (Suppl 2) : 437 - 437
  • [40] Fast and Parallel Trust Computing Scheme Based on Big Data Analysis For Collaboration Cloud Service
    Li, Xiaoyong
    Yuan, Jie
    Ma, Huadong
    Yao, Wenbin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (08) : 1917 - 1931