Hybrid Method for Cluster Analysis of Big Data

被引:0
|
作者
Dabas, Chetna [1 ]
Nigam, Gaurav Kumar [1 ]
机构
[1] Jaypee Inst Informat Technol, Noida, India
关键词
Analysis; Big data; Clustering; Hybrid method; GENE-EXPRESSION DATA; MODEL;
D O I
10.1007/978-981-15-0214-9_17
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In big data analytics, deep interest in a communication known as computer-mediated has cropped up. While using traditional techniques, it is difficult to handle the data which is magnanimous. Hence, there exists a need for improved methods to handle this data since the past methods do not fit properly in all kinds of situations. Normally, there are various steps for the handling of big data like acquisition, preprocessing, and processing and analysis of this data in order to retrieve proper semantics out of that amount of data. In a similar context, clustering has evolved as a popular approach for organizing and analysis of big data. In the present research work, a hybrid method for analysis of big data is proposed. The hybrid approach consists of the blending of K-means, Ward hierarchical along with the interpolation technique. The evaluation of and validation of the proposed approach has been carried out for the city dataset in R language. In the present work, the number of clusters and the size of the data get varied while carrying out the results. The results of the proposed work reflect impressive execution times of the proposed method over the existing ones. The proposed method also presents possible recommendation for extracting specific semantics for providing insights to business recommendations.
引用
收藏
页码:133 / 139
页数:7
相关论文
共 50 条
  • [21] A Hybrid Outlier Detection Method for Health Care Big Data
    Yan, Ke
    You, Xiaoming
    Ji, Xiaobo
    Yin, Guangqiang
    Yang, Fan
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 157 - 162
  • [22] HSDP: A Hybrid Sampling Method for Imbalanced Big Data Based on Data Partition
    Chen, Liping
    Jiang, Jiabao
    Zhang, Yong
    COMPLEXITY, 2021, 2021
  • [23] A Big Data Analysis Method for Online Education
    Yu, Shidong
    Yang, Dongsheng
    Feng, Xue
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 291 - 294
  • [24] Big-Data Analysis, Cluster Analysis, and Machine-Learning Approaches
    Alonso-Betanzos, Amparo
    Bolon-Canedo, Veronica
    SEX-SPECIFIC ANALYSIS OF CARDIOVASCULAR FUNCTION, 2018, 1065 : 607 - 626
  • [25] Primary Education Evaluation in Brazil using Big Data and Cluster Analysis
    Ramos, Thiago Graca
    Ferreira Machado, Jean Cristian
    Vieira Cordeiro, Bruna Principe
    3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2015, 2015, 55 : 1031 - 1039
  • [26] Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing
    Lim, JongBeom
    Ahnh, Jong-Suk
    Lee, Kang-Woo
    ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2314 - 2319
  • [27] Performance Factor Analysis and Scope of Optimization for Big Data Processing on Cluster
    Godara, Hanuman
    Govil, M. C.
    Pilli, E. S.
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 418 - 423
  • [28] CloudVista: Interactive and Economical Visual Cluster Analysis for Big Data in the Cloud
    Xu, Huiqi
    Li, Zhen
    Guo, Shumin
    Chen, Keke
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 1886 - 1889
  • [29] HRCM: An Efficient Hybrid Referential Compression Method for Genomic Big Data
    Yao, Haichang
    Ji, Yimu
    Li, Kui
    Liu, Shangdong
    He, Jing
    Wang, Ruchuan
    BIOMED RESEARCH INTERNATIONAL, 2019, 2019
  • [30] A Product Recommendation Method Based on Big Data Analysis
    Li, Jiahang
    APPLICATIONS OF DECISION SCIENCE IN MANAGEMENT, ICDSM 2022, 2023, 260 : 137 - 144