A Discretization Algorithm for Meteorological Data and its Parallelization Based on Hadoop

被引:2
|
作者
Liu, Chao [1 ]
Jin, Wen [1 ]
Yu, Yuting [1 ]
Qiu, Taorong [1 ]
Bai, Xiaoming [1 ]
Zou, Shuilong [1 ]
机构
[1] Nanchang Inst Sci & Technol, Sch Elect & Informat Engn, Nanchang, Jiangxi, Peoples R China
关键词
D O I
10.1088/1742-6596/910/1/012011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In view of the large amount of meteorological observation data, the property is more and the attribute values are continuous values, the correlation between the elements is the need for the application of meteorological data, this paper is devoted to solving the problem of how to better discretize large meteorological data to more effectively dig out the hidden knowledge in meteorological data and research on the improvement of discretization algorithm for large scale data, in order to achieve data in the large meteorological data discretization for the follow-up to better provide knowledge to provide protection, a discretization algorithm based on information entropy and inconsistency of meteorological attributes is proposed and the algorithm is parallelized under Hadoop platform. Finally, the comparison test validates the effectiveness of the proposed algorithm for discretization in the area of meteorological large data.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] An improved discretization algorithm for lightning meteorological data discretization
    Qiu, Taorong
    Liu, Lu
    Duan, Longzhen
    Zhou, Shilin
    Journal of Convergence Information Technology, 2012, 7 (23) : 518 - 527
  • [2] An Improved Parallelization of K-means Algorithm based on HADOOP
    Guo, Yizhuo
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [3] A NOVEL MASS METEOROLOGICAL DATA STORAGE SYSTEM BASED ON HADOOP ECOSYSTEM
    Ji, Quanpeng
    FRESENIUS ENVIRONMENTAL BULLETIN, 2021, 30 (05): : 5332 - 5339
  • [4] Design and Implementation of Meteorological Big Data Platform Based on Hadoop and Elasticsearch
    Yin, He
    Deng Fengdong
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 705 - 710
  • [5] Design and Test of GIS Platform for Meteorological Data Analysis Based on Hadoop
    Li T.
    Feng Z.
    Sun S.
    Cheng W.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2019, 50 (01): : 180 - 188
  • [6] A Novel Density based Clustering Algorithm and Its Parallelization
    Li, Xiaokang
    Yu, Binbin
    Zhou, Yinghua
    Sun, Guangzhong
    2014 15TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2014), 2014, : 1 - 6
  • [7] Data mining association rule algorithm based on Hadoop
    Huang Suyu
    PROCEEDINGS OF 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2015), 2015, : 349 - 352
  • [8] A new data mining algorithm based on MapReduce and hadoop
    Yang, Xianfeng
    Lian, Liming
    International Journal of Signal Processing, Image Processing and Pattern Recognition, 2014, 7 (02) : 131 - 142
  • [9] The discretization algorithm for rough data and its application to intrusion detection
    Shi, Zhicai
    Xia, Yongxiang
    Wu, Fei
    Dai, Jian
    Journal of Networks, 2014, 9 (06) : 1380 - 1387
  • [10] Boolean Function Complementation Based Algorithm for Data Discretization
    Borowik, Grzegorz
    COMPUTER AIDED SYSTEMS THEORY, PT II, 2013, 8112 : 218 - 225