Benchmarking large-scale data management for Internet of Things

被引:7
|
作者
Hendawi, Abdeltawab [1 ,2 ]
Gupta, Jayant [3 ]
Liu, Jiayi [6 ]
Teredesai, Ankur [7 ]
Ramakrishnan, Naveen [8 ]
Shah, Mohak [6 ]
El-Sappagh, Shaker [4 ,5 ]
Kwak, Kyung-Sup [4 ]
Ali, Mohamed [7 ]
机构
[1] Univ Rhode Isl, Dept Comp Sci & Stat, Kingston, RI 02881 USA
[2] Cairo Univ, Fac Comp & Informat, Giza, Egypt
[3] Univ Minnesota, Comp Sci & Engn, Minneapolis, MN USA
[4] Inha Univ, Dept Informat & Commun Engn, Incheon, South Korea
[5] Benha Univ, Fac Comp & Informat, Informat Syst Dept, Kaliobeya, Egypt
[6] LG Elect, Seoul, South Korea
[7] Univ Washington, Ctr Data Sci, Tacoma, WA USA
[8] Robert Bosch LLC, Ctr AI, Palo Alto, CA USA
来源
JOURNAL OF SUPERCOMPUTING | 2019年 / 75卷 / 12期
基金
新加坡国家研究基金会;
关键词
Benchmarking; NoSQL; Distributed data management; Parallel data management; Internet of things (IoT); MongoDB; Cassandra; HBase; CHALLENGES;
D O I
10.1007/s11227-019-02984-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era of the Internet of Things (IoT), massive number of sensors are used in our daily lives. Sensors are everywhere around us. They exist in our homes, work places, streets, cars, and even ourselves. Examples include home appliances, wearable devices, and medical sensors. These sensors generate huge amount of dynamic, heterogeneous, and unstructured data that need special handling beyond the capabilities of conventional relational databases. Thus, identification of suitable data management platform to store and query this data is necessary. Despite of its popularity and efficiency in processing various types of big data, there is no single-guided study of how NoSQL data stores will behave with the Internet of Things (IoT) datasets. IoT data have its own characteristics that make it special. IoT data come from various sensors, with a wide range of formats, high velocity, and require high throughput processing with low latency. NoSQL data stores are commonly used to provide flexibility and availability for big data handling. However, there is a lack of comprehensive studies about which NoSQL data store performs the best from the two scalability aspects (scale-up and scale-out) in a distributed and parallel processing environment. This paper benchmarks the commonly used NoSQL data stores (MongoDB, Cassandra, and HBase), and compares their performance with real industrial IoT dataset. In addition, we focus on comparing the throughput, latency, and run time of the evaluated NoSQL data stores.
引用
收藏
页码:8207 / 8230
页数:24
相关论文
共 50 条
  • [1] Benchmarking large-scale data management for Internet of Things
    Abdeltawab Hendawi
    Jayant Gupta
    Jiayi Liu
    Ankur Teredesai
    Naveen Ramakrishnan
    Mohak Shah
    Shaker El-Sappagh
    Kyung-Sup Kwak
    Mohamed Ali
    [J]. The Journal of Supercomputing, 2019, 75 : 8207 - 8230
  • [2] Blockchain for Large-Scale Internet of Things Data Storage and Protection
    Li, Ruinian
    Song, Tianyi
    Mei, Bo
    Li, Hong
    Cheng, Xiuzhen
    Sun, Limin
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2019, 12 (05) : 762 - 771
  • [3] Large-scale Offloading in the Internet of Things
    Flores, Huber
    Su, Xiang
    Kostakos, Vassilis
    Ding, Aaron Yi
    Nurmi, Petteri
    Tarkoma, Sasu
    Hui, Pan
    Li, Yong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2017,
  • [4] Novel Paradigm for Abstraction and Management of Heterogenous Things over Large-scale Internet of Things
    Park, Byeongjo
    Lee, Sang-Hoon
    Park, Soochang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [5] Special Issue on Large-Scale Internet of Things
    Guo, Song
    Liu, Jiajia
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2016, 3 (04): : 439 - 440
  • [6] Efficient Large-scale Medical Data (eHealth Big Data) Analytics in Internet of Things
    Plageras, Andreas P.
    Stergiou, Christos
    Kokkonis, George
    Psannis, Kostas E.
    Ishibashi, Yutaka
    Kim, Byung-Gyu
    Gupta, B. Brij
    [J]. 2017 IEEE 19TH CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL 2, 2017, 2 : 21 - 27
  • [7] Large-scale heterogeneous terminal management technology for power Internet of Things platform
    Hong, Huijun
    Suo, Zhixin
    Wu, Heng
    Li, Dang
    Wang, Jiayan
    Lu, Hongzhi
    Zhang, Yu
    Lu, Hui
    [J]. 4TH INTERNATIONAL CONFERENCE ON INFORMATICS ENGINEERING AND INFORMATION SCIENCE (ICIEIS2021), 2022, 12161
  • [8] VIoLET: A Large-Scale Virtual Environment for Internet of Things
    Badiger, Shreyas
    Baheti, Shrey
    Simmhan, Yogesh
    [J]. EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 309 - 324
  • [9] DABPR: a large-scale internet of things-based data aggregation back pressure routing for disaster management
    Amiri, Iraj Sadegh
    Prakash, J.
    Balasaraswathi, M.
    Sivasankaran, V.
    Sundararajan, T. V. P.
    Hindia, M. H. D. Nour
    Tilwari, Valmik
    Dimyati, Kaharudin
    Henry, Ojukwu
    [J]. WIRELESS NETWORKS, 2020, 26 (04) : 2353 - 2374
  • [10] DABPR: a large-scale internet of things-based data aggregation back pressure routing for disaster management
    Iraj Sadegh Amiri
    J. Prakash
    M. Balasaraswathi
    V. Sivasankaran
    T. V. P. Sundararajan
    M. H. D. Nour Hindia
    Valmik Tilwari
    Kaharudin Dimyati
    Ojukwu Henry
    [J]. Wireless Networks, 2020, 26 : 2353 - 2374