IoT streaming data integration from multiple sources

被引:0
|
作者
Doan Quang Tu
A. S. M. Kayes
Wenny Rahayu
Kinh Nguyen
机构
[1] La Trobe University,
来源
Computing | 2020年 / 102卷
关键词
IoT streaming data integration; Timing alignment; De-duplication; Window-based integration; 68U35; 68-04; 94Axx;
D O I
暂无
中图分类号
学科分类号
摘要
The Internet of Things (IoT) has recently received considerable interest due to the development of smart technologies in today’s interconnected world. With the rapid advancement in Internet technologies and the proliferation of IoT sensors, myriad systems and applications generate data of a massive volume, variety and velocity which traditional databases and systems are unable to manage effectively. Many organizations need to deal with these massive datasets that encounter different types of data (e.g., IoT streaming data, static data) in different formats (e.g., structured, semi-structured) coming from multiple sources. Several data integration mechanisms have been designed to process mostly static data. Unfortunately, these techniques are not able to deal with and integrate IoT streaming datasets from multiple sources. In this paper, we identify the challenges of IoT Streaming Data Integration (ISDI) and present a formal approach for the real-time integration of such IoT streaming datasets. We address one of the important issues of timing conflict/alignment among streaming data coming from multiple sources. A generic window-based ISDI approach is proposed to deal with IoT data in different formats and algorithms are developed to integrate IoT streaming data from multiple sources. In particular, we extend the basic windowing algorithm for real-time data integration and to deal with the timing alignment issue. We also introduce a de-duplication algorithm to deal with data redundancy and to demonstrate the useful fragments of the integrated data. We conduct several sets of experiments and quantify the performance of our proposed window-based approach. In particular, we compare our local experimental results with a real setup for streaming data, using Apache Spark. The results of the experiments, which are performed on several IoT datasets, show the efficiency of our proposed solution in terms of processing time. The results are also used to provide an integrated data view to the users.
引用
收藏
页码:2299 / 2329
页数:30
相关论文
共 50 条
  • [1] IoT streaming data integration from multiple sources
    Tu, Doan Quang
    Kayes, A. S. M.
    Rahayu, Wenny
    Nguyen, Kinh
    [J]. COMPUTING, 2020, 102 (10) : 2299 - 2329
  • [2] Integration of IoT Streaming Data With Efficient Indexing and Storage Optimization
    Doan, Quang-Tu
    Kayes, A. S. M.
    Rahayu, Wenny
    Kinh Nguyen
    [J]. IEEE ACCESS, 2020, 8 : 47456 - 47467
  • [3] Lightweight data streaming from IoT devices
    Misic, Jelena
    Misic, Vojislav B.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [4] Conceptual Framework for entity integration from multiple data sources
    Orescanin, Drazen
    Tan, Ran
    Ao, Jing
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1232 - 1237
  • [5] Analysis, Integration and Visualization of Urban Data From Multiple Heterogeneous Sources
    Fortini, Pedro Magalhaes
    Davis, Clodoveu A., Jr.
    [J]. PROCEEDINGS OF THE 1ST ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ADVANCES IN RESILIENT AND INTELLIGENT CITIES (ARIC-2018), 2018, : 17 - 26
  • [6] SPARTAN: Semantic integration of big spatio-temporal data from streaming and archival sources
    Santipantakis, Georgios M.
    Glenis, Apostolos
    Patroumpas, Kostas
    Vlachou, Akrivi
    Doulkeridis, Christos
    Vouros, George A.
    Pelekis, Nikos
    Theodoridis, Yannis
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 110 : 540 - 555
  • [7] INTEGRATION OF MULTIPLE DATA SOURCES IN IMMIGRANT STUDIES
    SULLIVAN, TA
    TIENDA, M
    [J]. REVIEW OF PUBLIC DATA USE, 1984, 12 (04): : 233 - 244
  • [8] INTEGRATION OF MULTIPLE DATA SOURCES IN IMMIGRANT STUDIES
    TIENDA, M
    SULLIVAN, T
    [J]. POPULATION INDEX, 1984, 50 (03) : 413 - 414
  • [9] DATA QUALITY IN THE INTEGRATION AND ANALYSIS OF DATA FROM MULTIPLE SOURCES: SOME RESEARCH CHALLENGES
    Harding, J. L.
    [J]. 8TH INTERNATIONAL SYMPOSIUM ON SPATIAL DATA QUALITY, 2013, 40-2 (w1): : 59 - 63
  • [10] Integration of Association Rules and Clustering Models Obtained from Multiple Data Sources
    Morales Vega, Daymi
    Martin Rodriguez, Diana
    Wilford Rivera, Ingrid
    Rosete Suarez, Alejandro
    [J]. COMPUTACION Y SISTEMAS, 2012, 16 (02): : 175 - 189