Benchmarking large-scale data management for Internet of Things

被引:7
|
作者
Hendawi, Abdeltawab [1 ,2 ]
Gupta, Jayant [3 ]
Liu, Jiayi [6 ]
Teredesai, Ankur [7 ]
Ramakrishnan, Naveen [8 ]
Shah, Mohak [6 ]
El-Sappagh, Shaker [4 ,5 ]
Kwak, Kyung-Sup [4 ]
Ali, Mohamed [7 ]
机构
[1] Univ Rhode Isl, Dept Comp Sci & Stat, Kingston, RI 02881 USA
[2] Cairo Univ, Fac Comp & Informat, Giza, Egypt
[3] Univ Minnesota, Comp Sci & Engn, Minneapolis, MN USA
[4] Inha Univ, Dept Informat & Commun Engn, Incheon, South Korea
[5] Benha Univ, Fac Comp & Informat, Informat Syst Dept, Kaliobeya, Egypt
[6] LG Elect, Seoul, South Korea
[7] Univ Washington, Ctr Data Sci, Tacoma, WA USA
[8] Robert Bosch LLC, Ctr AI, Palo Alto, CA USA
来源
JOURNAL OF SUPERCOMPUTING | 2019年 / 75卷 / 12期
基金
新加坡国家研究基金会;
关键词
Benchmarking; NoSQL; Distributed data management; Parallel data management; Internet of things (IoT); MongoDB; Cassandra; HBase; CHALLENGES;
D O I
10.1007/s11227-019-02984-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era of the Internet of Things (IoT), massive number of sensors are used in our daily lives. Sensors are everywhere around us. They exist in our homes, work places, streets, cars, and even ourselves. Examples include home appliances, wearable devices, and medical sensors. These sensors generate huge amount of dynamic, heterogeneous, and unstructured data that need special handling beyond the capabilities of conventional relational databases. Thus, identification of suitable data management platform to store and query this data is necessary. Despite of its popularity and efficiency in processing various types of big data, there is no single-guided study of how NoSQL data stores will behave with the Internet of Things (IoT) datasets. IoT data have its own characteristics that make it special. IoT data come from various sensors, with a wide range of formats, high velocity, and require high throughput processing with low latency. NoSQL data stores are commonly used to provide flexibility and availability for big data handling. However, there is a lack of comprehensive studies about which NoSQL data store performs the best from the two scalability aspects (scale-up and scale-out) in a distributed and parallel processing environment. This paper benchmarks the commonly used NoSQL data stores (MongoDB, Cassandra, and HBase), and compares their performance with real industrial IoT dataset. In addition, we focus on comparing the throughput, latency, and run time of the evaluated NoSQL data stores.
引用
收藏
页码:8207 / 8230
页数:24
相关论文
共 50 条
  • [31] REVISITING UNKNOWN RFID TAG IDENTIFICATION IN LARGE-SCALE INTERNET OF THINGS
    Zhang, Daqiang
    He, Zongjian
    Qian, Yuming
    Wan, Jiafu
    Li, Di
    Zhao, Shengjie
    [J]. IEEE WIRELESS COMMUNICATIONS, 2016, 23 (05) : 24 - 29
  • [32] Approximate Cardinality Estimation (ACE) in large-scale Internet of Things deployments
    Cao, Qing
    Feng, Yunhe
    Lu, Zheng
    Qi, Hairong
    Tolbert, Leon M.
    Wan, Lipeng
    Wang, Zhibo
    Zhou, Wenjun
    [J]. AD HOC NETWORKS, 2017, 66 : 52 - 63
  • [33] CLOTHO: A Large-Scale Internet of Things-Based Crowd Evacuation Planning System for Disaster Management
    Xu, Xiaolong
    Zhang, Lei
    Sotiriadis, Stelios
    Asimakopoulou, Eleana
    Li, Maozhen
    Bessis, Nik
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (05): : 3559 - 3568
  • [34] Building a large-scale object-based active storage platform for data analytics in the internet of things
    Quanqing Xu
    Khin Mi Mi Aung
    Yongqing Zhu
    Khai Leong Yong
    [J]. The Journal of Supercomputing, 2016, 72 : 2796 - 2814
  • [35] Building a large-scale object-based active storage platform for data analytics in the internet of things
    Xu, Quanqing
    Aung, Khin Mi Mi
    Zhu, Yongqing
    Yong, Khai Leong
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (07): : 2796 - 2814
  • [36] Benchmarking for, large-scale placement and beyond
    Adya, AN
    Yildiz, MC
    Markov, IL
    Villarrubia, PG
    Parakh, PN
    Madden, PH
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2004, 23 (04) : 472 - 487
  • [37] Edge Computing and Social Internet of Things for Large-Scale Smart Environments Development
    Cicirelli, Franco
    Guerrieri, Antonio
    Spezzano, Giandomenico
    Vinci, Andrea
    Briante, Orazio
    Iera, Antonio
    Ruggeri, Giuseppe
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (04): : 2557 - 2571
  • [38] Promptly Pinpointing Mobile RFID Tags for Large-Scale Internet-of-Things
    Kim, Taekyung
    Shao, Chenglong
    Lee, Wonjun
    [J]. 2015 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2015, : 118 - 123
  • [39] Energy Efficient and Accurate Monitoring of Large-Scale Diffusive Objects in Internet of Things
    Oh, Seungmin
    Lee, Jeongcheol
    Park, Soochang
    [J]. IEEE COMMUNICATIONS LETTERS, 2017, 21 (03) : 612 - 615
  • [40] Detecting Malicious Components in Large-Scale Internet-of-Things Systems and Architectures
    Bordel, Borja
    Alcarria, Ramon
    Sanchez-de-Rivera, Diego
    [J]. RECENT ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2017, 569 : 155 - 165