Applying spark based machine learning model on streaming big data for health status prediction

被引:66
|
作者
Nair, Lekha R. [1 ]
Shetty, Sujala D. [1 ]
Shetty, Siddhanth D. [1 ]
机构
[1] BITS Pilani, Dept Comp Sci, Dubai Campus,POB 345055, Dubai, U Arab Emirates
关键词
Big data machine learning; Streaming data processing; Tweet processing; Apache spark; Health informatics; TWITTER;
D O I
10.1016/j.compeleceng.2017.03.009
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning is one of the driving forces of science and commerce, but the proliferation of Big Data demands paradigm shifts from traditional methods in the application of machine learning techniques on this voluminous data having varying velocity. With the availability of large health care datasets and progressions in machine learning techniques, computers are now well equipped in diagnosing many health issues. This work aims at developing a real time remote health status prediction system built around open source Big Data processing engine, the Apache Spark, deployed in the cloud which focus on applying machine learning model on streaming Big Data. In this scalable system, the user tweets his health attributes and the application receives the same in real time, extracts the attributes and applies machine learning model to predict user's health status which is then directly messaged to him/her instantly for taking appropriate action. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:393 / 399
页数:7
相关论文
共 50 条
  • [21] Data analysis in health and big data: A machine learning medical diagnosis model based on patients’ complaints
    Silahtaroğlu, Gökhan
    Yılmaztürk, Nevin
    [J]. Communications in Statistics - Theory and Methods, 2021, 50 (07): : 1547 - 1556
  • [22] Research on Visual Machine Learning Algorithms Based on Apache Spark in Big Data Environment
    Wang, Jialin
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 124 : 144 - 144
  • [23] Data analysis in health and big data: A machine learning medical diagnosis model based on patients' complaints
    Silahtaroglu, Gokhan
    Yilmazturk, Nevin
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (07) : 1547 - 1556
  • [24] Road Machinery Fault Prediction Based on Big Data and Machine Learning
    Xue Lige
    Song Zong Hua
    Shao Zhu Feng
    [J]. CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 536 - 540
  • [25] Prediction of Liver Diseases Based on Machine Learning Technique for Big Data
    El-Shafeiy, Engy A.
    El-Desouky, Ali I.
    Elghamrawy, Sally M.
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 362 - 374
  • [26] Flight Delay Prediction Based on Aviation Big Data and Machine Learning
    Gui, Guan
    Liu, Fan
    Sun, Jinlong
    Yang, Jie
    Zhou, Ziqi
    Zhao, Dongxu
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) : 140 - 150
  • [27] Applying Machine Learning to Big Data Streams An overview of challenges
    Augenstein, Christoph
    Spangenberg, Norman
    Franczyk, Bogdan
    [J]. 2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2017, : 25 - 29
  • [28] Big Data and Machine Learning in Health Care
    Beam, Andrew L.
    Kohane, Isaac S.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (13): : 1317 - 1318
  • [29] Prediction model of COPD acute exacerbation with big data by machine learning methods
    Rhee, Chin Kook
    Kim, Jin Woo
    Yoo, Kwang Ha
    Jung, Ki-Suck
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2020, 56
  • [30] Manufacturing Systems at Scale with Big Data Streaming and Online Machine Learning
    Morariu, Octavian
    Morariu, Cristina
    Borangiu, Theodor
    Raileanu, Silviu
    [J]. SERVICE ORIENTATION IN HOLONIC AND MULTI-AGENT MANUFACTURING, 2018, 762 : 253 - 264