Design and Development of a Medical Big Data Processing System Based on Hadoop

被引:52
|
作者
Yao, Qin [1 ]
Tian, Yu [1 ]
Li, Peng-Fei [1 ]
Tian, Li-Li [2 ]
Qian, Yang-Ming [2 ]
Li, Jing-Song [1 ]
机构
[1] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Collaborat Innovat Ctr Diag & Treatment Infect Di, Engn Res Ctr EMR & Intelligent Expert Syst,Minist, Hangzhou 310003, Zhejiang, Peoples R China
[2] Navy Gen Hosp, Beijing, Peoples R China
关键词
Medical big data; Hadoop; MapReduce; User-generated content; Sqoop; Mahout; ALTERNATIVE MEDICINE; HEALTH; TECHNOLOGY; ACCEPTANCE; MAPREDUCE; BEHAVIOR; QUALITY; IMPACT;
D O I
10.1007/s10916-015-0220-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Secondary use of medical big data is increasingly popular in healthcare services and clinical research. Understanding the logic behind medical big data demonstrates tendencies in hospital information technology and shows great significance for hospital information systems that are designing and expanding services. Big data has four characteristics Volume, Variety, Velocity and Value (the 4 Vs) - that make traditional systems incapable of processing these data using standalones. Apache Hadoop MapReduce is a promising software framework for developing applications that process vast amounts of data in parallel with large clusters of commodity hardware in a reliable, fault-tolerant manner. With the Hadoop framework and MapReduce application program interface (API), we can more easily develop our own MapReduce applications to run on a Hadoop framework that can scale up from a single node to thousands of machines. This paper investigates a practical case of a Hadoop-based medical big data processing system. We developed this system to intelligently process medical big data and uncover some features of hospital information system user behaviors. This paper studies user behaviors regarding various data produced by different hospital information systems for daily work. In this paper, we also built a five-node Hadoop cluster to execute distributed MapReduce algorithms. Our distributed algorithms show promise in facilitating efficient data processing with medical big data in healthcare services and clinical research compared with single nodes. Additionally, with medical big data analytics, we can design our hospital information systems to be much more intelligent and easier to use by making personalized recommendations.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Design and Development of a Medical Big Data Processing System Based on Hadoop
    Qin Yao
    Yu Tian
    Peng-Fei Li
    Li-Li Tian
    Yang-Ming Qian
    Jing-Song Li
    [J]. Journal of Medical Systems, 2015, 39
  • [2] Big medical data processing system based on hadoop
    Liu, W.
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 181 - 181
  • [3] Design of big data processing system architecture based on Hadoop Under the cloud computing
    Duan, Chunmei
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 6302 - 6306
  • [4] Retraction Note: Research on intelligent medical big data system based on Hadoop and blockchain
    Xiangfeng Zhang
    Yanmei Wang
    [J]. EURASIP Journal on Wireless Communications and Networking, 2023
  • [5] RETRACTED ARTICLE: Research on intelligent medical big data system based on Hadoop and blockchain
    Xiangfeng Zhang
    Yanmei Wang
    [J]. EURASIP Journal on Wireless Communications and Networking, 2021
  • [6] HaoLap: A Hadoop based OLAP system for big data
    Song, Jie
    Guo, Chaopeng
    Wang, Zhi
    Zhang, Yichan
    Yu, Ge
    Pierson, Jean-Marc
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 102 : 167 - 181
  • [7] Hadoop based Demography Big Data Management System
    Bukhari, Syeda Sana
    Park, JinHyuck
    Shin, Dong Ryeol
    [J]. 2018 19TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2018, : 93 - 98
  • [8] Design and development of real-time query platform for big data based on hadoop
    刘小利
    Xu Pandeng
    Liu Mingliang
    Zhu Guobin
    [J]. High Technology Letters, 2015, 21 (02) : 231 - 238
  • [9] A Semantically-Based Big Data Processing System Using Hadoop and Map-Reduce
    Wang Wanting
    Qin Zheng
    [J]. SOCIALLY AWARE ORGANISATIONS AND TECHNOLOGIES: IMPACT AND CHALLENGES, 2016, 477 : 246 - 247
  • [10] Efficient Big Data Processing in Hadoop MapReduce
    Dittrich, Jens
    Quiane-Ruiz, Jorge-Arnulfo
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2014 - 2015