Optimization and Application in Medical Big Document-Data of Apriori Algorithm based on MapReduce

被引:0
|
作者
Li Wei [1 ]
Liu Guangming [1 ]
Shao Yachao [2 ]
Liu Junlong [2 ]
Zuo You [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[2] Natl Supercomp Ctr Tianjin, Tianjin, Peoples R China
关键词
component; Medical big data; NoSQL; MapReduce; Data Mining; Apriori; optimization;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
For the challenges of redundancy, multi-dimension, complex and heterogeneous in medical documents, and to solve the problem that the value hidden in the huge amounts of medical document-data can't be mined, this paper proposed a system called MSPM based on NOSQL and MapReduce. Through storage of key-value pairs, complex and heterogeneous datas are summed up in a unified and convenient format of transaction for Apriori. Then Apriori is executed in parallel through MapReduce. At last, with the strategies of generating all the candidate sets non-recursively and constraint count for candidate sets of interest, it can solve the problem of low speed, high overhead and poor effectiveness for Apriori algorithm in the application of medical data. Testing results has shown the algorithm of optimization is available.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Apriori Algorithm Optimization Study Based on MapReduce
    Li Chunqing
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1466 - 1470
  • [2] AN ALGORITHM OF APRIORI BASED ON MEDICAL BIG DATA AND CLOUD COMPUTING
    Cui, Xiaoyan
    Yang, Shimeng
    Wang, Daming
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 361 - 365
  • [3] Complex Statistical Analysis of Big Data: Implementation and Application of Apriori and FP-Growth Algorithm Based on MapReduce
    Rong, Zbuobo
    Xia, Dawen
    Zhang, Zili
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 968 - 972
  • [4] Apriori algorithm optimization based on Spark platform under big data
    Yu, Huafeng
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 80
  • [5] Parallel Clustering Optimization Algorithm Based on MapReduce in Big Data Mining
    Zhang, Huajie
    Song, Lei
    Zhang, Sen
    IAENG International Journal of Applied Mathematics, 2023, 53 (01):
  • [6] Apriori Versions Based on MapReduce for Mining Frequent Patterns on Big Data
    Maria Luna, Jose
    Padillo, Francisco
    Pechenizkiy, Mykola
    Ventura, Sebastian
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (10) : 2851 - 2865
  • [7] Mining on Relationships in Big Data era using Improve Apriori Algorithm with MapReduce Approach
    Pandey, Kamlesh Kumar
    Shukla, Diwakar
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [8] Performance optimization of MapReduce-based Apriori algorithm on Hadoop cluster
    Singh, Sudhakar
    Garg, Rakhi
    Mishra, P. K.
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 67 : 348 - 364
  • [9] Research and Optimization of Apriori Algorithm Based on Cloud Computing and Medical Large Data
    Song, Menghua
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 697 - 700
  • [10] Parallel implementation of Apriori algorithm based on MapReduce
    Li N.
    Zeng L.
    He Q.
    Shi Z.
    International Journal of Networked and Distributed Computing, 2013, 1 (2) : 89 - 96