A Discretization Algorithm for Meteorological Data and its Parallelization Based on Hadoop

被引:2
|
作者
Liu, Chao [1 ]
Jin, Wen [1 ]
Yu, Yuting [1 ]
Qiu, Taorong [1 ]
Bai, Xiaoming [1 ]
Zou, Shuilong [1 ]
机构
[1] Nanchang Inst Sci & Technol, Sch Elect & Informat Engn, Nanchang, Jiangxi, Peoples R China
关键词
D O I
10.1088/1742-6596/910/1/012011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In view of the large amount of meteorological observation data, the property is more and the attribute values are continuous values, the correlation between the elements is the need for the application of meteorological data, this paper is devoted to solving the problem of how to better discretize large meteorological data to more effectively dig out the hidden knowledge in meteorological data and research on the improvement of discretization algorithm for large scale data, in order to achieve data in the large meteorological data discretization for the follow-up to better provide knowledge to provide protection, a discretization algorithm based on information entropy and inconsistency of meteorological attributes is proposed and the algorithm is parallelized under Hadoop platform. Finally, the comparison test validates the effectiveness of the proposed algorithm for discretization in the area of meteorological large data.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Metecloud: A private cloud platform for meteorological data storage using hadoop
    Xiaolong, X. (xlxu1988@gmail.com), 1600, Exeley Inc (06):
  • [32] An efficient algorithm and its parallelization for computing PageRank
    Qiao, Jonathan
    Jones, Brittany
    Thrall, Stacy
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 237 - +
  • [33] A NEW GRAPH TRICONNECTIVITY ALGORITHM AND ITS PARALLELIZATION
    MILLER, GL
    RAMACHANDRAN, V
    COMBINATORICA, 1992, 12 (01) : 53 - 76
  • [34] A data discretization algorithm based on improved chi-square statistic
    Sang, Yu
    Li, Ke-Qiu
    Yan, De-Qin
    Dalian Ligong Daxue Xuebao/Journal of Dalian University of Technology, 2012, 52 (03): : 443 - 447
  • [35] Improvement of recommendation algorithm based on Collaborative Deep Learning and its Parallelization on Spark
    Yang, Fan
    Wang, Huaqiong
    Fu, Jianjing
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 148 : 58 - 68
  • [36] Data Deduplication based on Hadoop
    Zhang, Dongzhan
    Liao, Chengfa
    Yan, Wenjing
    Tao, Ran
    Zheng, Wei
    2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2017, : 147 - 152
  • [37] Design of Distributed Communications Data Query Algorithm Based on the Cloud Computing of Hadoop
    Jun, Luo
    ADVANCED RESEARCH ON COMPUTER EDUCATION, SIMULATION AND MODELING, PT II, 2011, 176 (02): : 273 - 280
  • [38] Research on adaptive recommendation algorithm for big data mining based on Hadoop platform
    Zhang, Jinming
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2019, 12 (04) : 213 - 220
  • [39] An improved algorithm for clustering uncertain traffic data streams based on Hadoop platform
    Xu, Weixiang
    Li, Jiaojiao
    INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2019, 33 (19):
  • [40] Hadoop-based parallel algorithm for data mining in remote sensing images
    Wang Y.
    Liu Y.
    Jing W.
    International Journal of Performability Engineering, 2019, 15 (11): : 2860 - 2870