A Scalable Multi-Data Sources Based Recursive Approximation Approach for Fast Error Recovery in Big Sensing Data on Cloud

被引:4
|
作者
Yang, Chi [1 ]
Xu, Xianghua [2 ]
Ramamohanarao, Kotagiri [3 ]
Chen, Jinjun [4 ]
机构
[1] Univ Wollongong, SCIT, Wollongong, NSW 2522, Australia
[2] Hangzhou Dianzi Univ, Hangzhou 310005, Zhejiang, Peoples R China
[3] Univ Melbourne, Melbourne, Vic 3010, Australia
[4] Swinburne Univ Technol, Melbourne, Vic 3122, Australia
基金
澳大利亚研究理事会;
关键词
Sensors; Big Data; Cloud computing; Reliability; Complex networks; Time series analysis; big sensing data; cloud; euclidean distance; error recovery;
D O I
10.1109/TKDE.2019.2895612
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big sensing data is commonly encountered from various surveillance or sensing systems. Sampling and transferring errors are commonly encountered during each stage of sensing data processing. How to recover from these errors with accuracy and efficiency is quite challenging because of high sensing data volume and unrepeatable wireless communication environment. While Cloud provides a promising platform for processing big sensing data, however scalable and accurate error recovery solutions are still need. In this paper, we propose a novel approach to achieve fast error recovery in a scalable manner on cloud. This approach is based on the prediction of a recovery replacement data by making multiple data sources based approximation. The approximation process will use coverage information carried by data units to limit the algorithm in a small cluster of sensing data instead of a whole data spectrum. Specifically, in each sensing data cluster, a Euclidean distance based approximation is proposed to calculate a time series prediction. With the calculated time series, a detected error can be recovered with a predicted data value. Through the experiment with real world meteorological data sets on cloud, we demonstrate that the proposed error recovery approach can achieve high accuracy in data approximation to replace the original data error. At the same time, with MapReduce based implementation for scalability, the experimental results also show significant efficiency on time saving.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 50 条
  • [41] Provisioning big data applications as services on containerised cloud: a microservices-based approach
    Gao Jing
    Li Wubin
    Zhao Zhuofeng
    Han Yanbo
    INTERNATIONAL JOURNAL OF SERVICES TECHNOLOGY AND MANAGEMENT, 2020, 26 (2-3) : 167 - 181
  • [42] Cluster Based Multi Layer User Authentication Data Center Storage Architecture for Big Data Security in Cloud Computing
    Ramasamy, S.
    Gnanamurthy, R. K.
    JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (01): : 159 - 171
  • [43] Optimizing healthcare big data privacy with scalable subtree-based L-Anonymization in cloud environments
    Natarajan, Aravindhraj
    Shanthi, N.
    WIRELESS NETWORKS, 2025, 31 (03) : 2727 - 2742
  • [44] Multi-source remote sensing image big data classification system design in cloud computing environment
    Tong X.-Y.
    Guo C.
    Cheng H.
    International Journal of Internet Manufacturing and Services, 2020, 7 (1-2) : 130 - 145
  • [45] GPU-based fast error recovery for high speed data communication in media technology
    Md Shohidul Islam
    Jong-Myon Kim
    Cluster Computing, 2015, 18 : 93 - 101
  • [46] GPU-based fast error recovery for high speed data communication in media technology
    Islam, Md Shohidul
    Kim, Jong-Myon
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (01): : 93 - 101
  • [47] Scalable Multi-Criteria Decision-Making: A MapReduce deployed Big Data Approach for Skill Analytics
    Bohlouli, Mahdi
    Schrage, Martin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020,
  • [48] BigDEC: A multi-algorithm Big Data tool based on the k-mer spectrum method for scalable short-read error correction
    Exposito, Roberto R.
    Gonzalez-Dominguez, Jorge
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 314 - 329
  • [49] Optimized Big Data Management across Multi-Cloud Data Centers: Software-Defined-Network-Based Analysis
    Chaudhary, Rajat
    Aujla, Gagangeet Singh
    Kumar, Neeraj
    Rodrigues, Joel J. P. C.
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (02) : 118 - 126
  • [50] A Hybrid Machine Learning Approach for Performance Modeling of Cloud-Based Big Data Applications
    Ataie, Ehsan
    Evangelinou, Athanasia
    Gianniti, Eugenio
    Ardagna, Danilo
    COMPUTER JOURNAL, 2022, 65 (12): : 3123 - 3140