Hybrid Method for Cluster Analysis of Big Data

被引:0
|
作者
Dabas, Chetna [1 ]
Nigam, Gaurav Kumar [1 ]
机构
[1] Jaypee Inst Informat Technol, Noida, India
关键词
Analysis; Big data; Clustering; Hybrid method; GENE-EXPRESSION DATA; MODEL;
D O I
10.1007/978-981-15-0214-9_17
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In big data analytics, deep interest in a communication known as computer-mediated has cropped up. While using traditional techniques, it is difficult to handle the data which is magnanimous. Hence, there exists a need for improved methods to handle this data since the past methods do not fit properly in all kinds of situations. Normally, there are various steps for the handling of big data like acquisition, preprocessing, and processing and analysis of this data in order to retrieve proper semantics out of that amount of data. In a similar context, clustering has evolved as a popular approach for organizing and analysis of big data. In the present research work, a hybrid method for analysis of big data is proposed. The hybrid approach consists of the blending of K-means, Ward hierarchical along with the interpolation technique. The evaluation of and validation of the proposed approach has been carried out for the city dataset in R language. In the present work, the number of clusters and the size of the data get varied while carrying out the results. The results of the proposed work reflect impressive execution times of the proposed method over the existing ones. The proposed method also presents possible recommendation for extracting specific semantics for providing insights to business recommendations.
引用
收藏
页码:133 / 139
页数:7
相关论文
共 50 条
  • [31] A Big Data Analytics Method for Tourist Behaviour Analysis
    Miah, Shah Jahan
    Vu, Huy Quan
    Gammack, John
    McGrath, Michael
    [J]. INFORMATION & MANAGEMENT, 2017, 54 (06) : 771 - 785
  • [32] A hybrid algorithm with cluster analysis in modelling high dimensional data
    Tunga, Burcu
    [J]. DISCRETE APPLIED MATHEMATICS, 2018, 235 : 161 - 168
  • [33] A Hybrid Data Center Architecture for Big Data
    Rahman, Mohammad Naimur
    Esmailpour, Amir
    [J]. BIG DATA RESEARCH, 2016, 3 : 29 - 40
  • [34] The next big h in big data: Hybrid architectures
    Kobielus, James
    [J]. IBM Data Management Magazine, 2013, (05):
  • [35] A Hybrid Cluster-Lift Method for the Analysis of Research Activities
    Mirkin, Boris
    Nascimento, Susana
    Fenner, Trevor
    Pereira, Luis Moniz
    [J]. HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 1, 2010, 6076 : 152 - +
  • [36] Computational Performance Analysis of Cluster-based Technologies for Big Data Analytics
    Khan, Mukhtakj
    Salman
    Iqbal, Nadeem
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 280 - 286
  • [37] An Effective and Adaptable K-means Algorithm for Big Data Cluster Analysis
    Hu, Haize
    Liu, Jianxun
    Zhang, Xiangping
    Fang, Mengge
    [J]. PATTERN RECOGNITION, 2023, 139
  • [38] An optimized cluster storage method for real-time big data in Internet of Things
    Li Tu
    Shuai Liu
    Yan Wang
    Chi Zhang
    Ping Li
    [J]. The Journal of Supercomputing, 2020, 76 : 5175 - 5191
  • [39] Optimization of Physical Education Model Based on Cluster Analysis in the Context of Big Data
    Yang, Hanxue
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [40] Quality-driven early stopping for explorative cluster analysis for big data
    Fritz, Manuel
    Behringer, Michael
    Schwarz, Holger
    [J]. SICS SOFTWARE-INTENSIVE CYBER-PHYSICAL SYSTEMS, 2019, 34 (2-3): : 129 - 140