Design and Development of a Medical Big Data Processing System Based on Hadoop

被引:52
|
作者
Yao, Qin [1 ]
Tian, Yu [1 ]
Li, Peng-Fei [1 ]
Tian, Li-Li [2 ]
Qian, Yang-Ming [2 ]
Li, Jing-Song [1 ]
机构
[1] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Collaborat Innovat Ctr Diag & Treatment Infect Di, Engn Res Ctr EMR & Intelligent Expert Syst,Minist, Hangzhou 310003, Zhejiang, Peoples R China
[2] Navy Gen Hosp, Beijing, Peoples R China
关键词
Medical big data; Hadoop; MapReduce; User-generated content; Sqoop; Mahout; ALTERNATIVE MEDICINE; HEALTH; TECHNOLOGY; ACCEPTANCE; MAPREDUCE; BEHAVIOR; QUALITY; IMPACT;
D O I
10.1007/s10916-015-0220-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Secondary use of medical big data is increasingly popular in healthcare services and clinical research. Understanding the logic behind medical big data demonstrates tendencies in hospital information technology and shows great significance for hospital information systems that are designing and expanding services. Big data has four characteristics Volume, Variety, Velocity and Value (the 4 Vs) - that make traditional systems incapable of processing these data using standalones. Apache Hadoop MapReduce is a promising software framework for developing applications that process vast amounts of data in parallel with large clusters of commodity hardware in a reliable, fault-tolerant manner. With the Hadoop framework and MapReduce application program interface (API), we can more easily develop our own MapReduce applications to run on a Hadoop framework that can scale up from a single node to thousands of machines. This paper investigates a practical case of a Hadoop-based medical big data processing system. We developed this system to intelligently process medical big data and uncover some features of hospital information system user behaviors. This paper studies user behaviors regarding various data produced by different hospital information systems for daily work. In this paper, we also built a five-node Hadoop cluster to execute distributed MapReduce algorithms. Our distributed algorithms show promise in facilitating efficient data processing with medical big data in healthcare services and clinical research compared with single nodes. Additionally, with medical big data analytics, we can design our hospital information systems to be much more intelligent and easier to use by making personalized recommendations.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Research and analysis of big data based on hadoop
    Liu, Xiaohong
    Wang, Wangang
    Zhu, Guangfu
    [J]. Boletin Tecnico/Technical Bulletin, 2017, 55 (04): : 382 - 386
  • [32] Design of customer marketing big data processing system based on data mining clustering technology
    Wang, Jingzhe
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 100 - 104
  • [33] Big Data analysis in development of personalized medical system
    Shakhovska, Natalia
    Fedushko, Solomia
    Ml, Michal Gregus
    Melnykova, Natalia
    Shvorob, Iryna
    Syerov, Yuriy
    [J]. 10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 229 - 234
  • [34] Distributed Case-based Reasoning System Based on Big Data Platform Hadoop
    Wang, Chong-Yang
    Wang, Hong-Bing
    Liang, Yan-Rui
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 629 - 634
  • [35] Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform
    Wang, Yajun
    Cheng, Shengming
    Zhang, Xinchen
    Leng, Junyu
    Liu, Jun
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [36] Design of Electric Power Data Management System Based on Hadoop
    Li, Yongheng
    Wang, Yongzhi
    Jin, Liang
    [J]. PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND INFORMATION TECHNOLOGY APPLICATIONS, 2016, 71 : 1090 - 1093
  • [37] A Design of Processing Platform for Smartphone Based on Big Data
    Zhang, Jie
    Zhang, Ke
    Zhou, Hengxin
    [J]. 2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 162 - 165
  • [38] Big Data Analysis: Recommendation System with Hadoop Framework
    Verma, Jai Prakash
    Patel, Bankim
    Patel, Atul
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION TECHNOLOGY CICT 2015, 2015, : 92 - 97
  • [39] Hadoop Distributed File System for Big data analysis
    Almansouri, Hatim Talal
    Masmoudi, Youssef
    [J]. PROCEEDINGS OF 2019 IEEE 4TH WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS' 19), 2019, : 257 - 261
  • [40] Hadoop Eco System for Big Data Security and Privacy
    Adluru, Pradeep
    Datla, Srikari Sindhoori
    Zhang, Xiaowen
    [J]. 2015 IEEE LONG ISLAND SYSTEMS, APPLICATIONS AND TECHNOLOGY CONFERENCE (LISAT), 2015,