Applying spark based machine learning model on streaming big data for health status prediction

被引:66
|
作者
Nair, Lekha R. [1 ]
Shetty, Sujala D. [1 ]
Shetty, Siddhanth D. [1 ]
机构
[1] BITS Pilani, Dept Comp Sci, Dubai Campus,POB 345055, Dubai, U Arab Emirates
关键词
Big data machine learning; Streaming data processing; Tweet processing; Apache spark; Health informatics; TWITTER;
D O I
10.1016/j.compeleceng.2017.03.009
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning is one of the driving forces of science and commerce, but the proliferation of Big Data demands paradigm shifts from traditional methods in the application of machine learning techniques on this voluminous data having varying velocity. With the availability of large health care datasets and progressions in machine learning techniques, computers are now well equipped in diagnosing many health issues. This work aims at developing a real time remote health status prediction system built around open source Big Data processing engine, the Apache Spark, deployed in the cloud which focus on applying machine learning model on streaming Big Data. In this scalable system, the user tweets his health attributes and the application receives the same in real time, extracts the attributes and applies machine learning model to predict user's health status which is then directly messaged to him/her instantly for taking appropriate action. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:393 / 399
页数:7
相关论文
共 50 条
  • [31] A nodes scheduling model based on Markov chain prediction for big streaming data analysis
    Zhang, Qingchen
    Chen, Zhikui
    Yang, Laurence T.
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2015, 28 (09) : 1610 - 1619
  • [32] A Research Study on Running Machine Learning Algorithms on Big Data with Spark
    Kerestely, Arpad
    Baicoianu, Alexandra
    Bocu, Razvan
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 307 - 318
  • [33] Big data Predictive Analytics for Apache Spark using Machine Learning
    Junaid, Muhammad
    Wagan, Shiraz Ali
    Qureshi, Nawab Muhammad Faseeh
    Nam, Choon Sung
    Shin, Dong Ryeol
    [J]. 2020 GLOBAL CONFERENCE ON WIRELESS AND OPTICAL TECHNOLOGIES (GCWOT), 2020,
  • [34] A combined water quality pollution prediction model based on the Spark big data platform
    Sun, Zhihui
    Fan, Yiqing
    [J]. AQUA-WATER INFRASTRUCTURE ECOSYSTEMS AND SOCIETY, 2022, 71 (09) : 963 - 974
  • [35] An insight into tree based machine learning techniques for big data Analytics using Apache Spark
    Sheshasaayee, Ananthi
    Lakshmi, J. V. N.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 1740 - 1743
  • [36] Banking in Terms of Deposit Prediction Based on Machine Learning and Big Data Analytics
    Alessa, Nourah
    Majdua, Amal
    Alshehri, Sharifah
    Alhawiti, Maryam
    Aljohani, Resan
    Alhakamy, A'aeshah
    [J]. 2023 11TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION, ICCMA, 2023, : 69 - 74
  • [37] A Flood Prediction Method Based on Streaming Big Data Processing
    Li, Chenming
    Peng, Jianhua
    Wang, Huibin
    Yang, Simon X.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (IEEE ICIA 2017), 2017, : 898 - 902
  • [38] Big data mining optimization algorithm based on machine learning model
    Jiao, Changyi
    [J]. Revue d'Intelligence Artificielle, 2020, 34 (01) : 51 - 57
  • [39] Machine Learning and Big Data Implementation on Health Care data
    Sasubilli, Gopinadh
    Kumar, Abhishek
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 859 - 864
  • [40] Network Traffic Big Data Prediction Model Based On Combinatorial Learning
    Liu, Fei
    Li, Qianmu
    Liu, Yonzong
    [J]. 2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 252 - 256