An Efficient Distributed Database Clustering Algorithm for Big Data Processing

被引:0
|
作者
Sun, Qiao [1 ]
Fu, Lan-mei [1 ]
Deng, Bu-qiao [1 ]
Pei, Xu-bin [2 ]
Sun, Jia-song [3 ]
机构
[1] Beijing GuoDianTong Network Technol Co Ltd, Beijing, Peoples R China
[2] State Grid Zhejiang Elect Power Co Ltd, Hangzhou, Zhejiang, Peoples R China
[3] Tsinghua Univ, EE Dept, Beijing, Peoples R China
来源
2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC 2017) | 2017年
关键词
Distributed big data processing; Distributed database; Data clustering; Depth neural network; K-means;
D O I
10.23977/iccsc.2017.1012
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a distributed data clustering technique based on deep neural network. First, each record in the distributed database is taken as an input vector, and its characteristics are extracted and input to the input layer of the depth neural network. The weight of the connection is trained by BP algorithm, and the training of depth neural network output is realized by adjusting the weight. Finally, the data clustering results are judged according to the similarity of the current vector corresponding to the output data. Experimental results based on small-scale distributed systems show that this method has better test set accuracy than traditional k-means clustering method, and is more suitable for large-scale data clustering in the distributed environments.
引用
收藏
页码:70 / 74
页数:5
相关论文
共 50 条
  • [31] Computationally Efficient, Dynamic Distributed Algorithm of Sensor-based Big Data
    Al-kahtani, Mohammed S.
    Karim, Lutful
    Almhana, Jalal
    2017 13TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2017, : 759 - 763
  • [32] A DISTRIBUTED ENERGY EFFICIENT CLUSTERING ALGORITHM FOR DATA AGGREGATION IN WIRELESS SENSOR NETWORKS
    Shirazi, Seyed Mohammad Bagher Musavi
    Sabet, Maryam
    Pajoohan, Mohammad Reza
    IIUM ENGINEERING JOURNAL, 2018, 19 (01): : 72 - 90
  • [33] Study on Intelligent Analysis and Processing Technology of Computer Big Data Based on Clustering Algorithm
    Liu, Xiaoming
    Rokunojjaman, Md
    Kumar, Rakesh E. R.
    Nazila, Ragimova
    Vugar, Abdullayev
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2023, 16 (02) : 150 - 158
  • [34] An Efficient Clustering Technique for Big Data Mining
    Banait, Satish S.
    Sane, S. S.
    Talekar, Sopan A.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (03): : 702 - 717
  • [35] Big-Data Clustering with Genetic Algorithm
    Mortezanezhad, Afsaneh
    Daneshifar, Ebrahim
    2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 702 - 706
  • [36] Research on incremental clustering algorithm for big data
    Yang X.
    Applied Mathematics and Nonlinear Sciences, 2023, 8 (02) : 169 - 180
  • [37] Quantum Supervised Clustering Algorithm for Big Data
    Bishwas, Arit Kumar
    Mani, Ashish
    Palade, Vasile
    2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [38] Batch Clustering Algorithm for Big Data Sets
    Alguliyev, Rasim
    Aliguliyev, Ramiz
    Bagirov, Adil
    Karimov, Rafael
    2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 79 - 82
  • [39] Distributed Database and Application Architecture for Big Data Solutions
    Misaki, Makoto
    Tsuda, Tomio
    Inoue, Shinji
    Sato, Shintaro
    Kayahara, Akihiro
    Imai, Shin-Ichi
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2017, 30 (04) : 328 - 332
  • [40] Performance Enhancement of Distributed Clustering for Big Data Analytics
    Mohamed, Omar Hesham
    Shehab, Mohamed Elemam
    El Fakharany, Essam
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 415 - 425