Neural Network-Based Deep Encoding for Mixed-Attribute Data Classification

被引:1
|
作者
Huang, Tinglin [1 ]
He, Yulin [1 ,2 ]
Dai, Dexin [1 ]
Wang, Wenting [1 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
基金
中国博士后科学基金; 国家重点研发计划;
关键词
Mixed-attribute data; Discrete-attribute data; Continuous-attribute data; One-hot encoding; Uncertainty; DISCRETIZATION; UNCERTAINTY; SELECTION;
D O I
10.1007/978-3-030-26142-9_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a neural network-based deep encoding (DE) method for the mixed-attribute data classification. DE method first uses the existing one-hot encoding (OE) method to encode the discrete-attribute data. Second, DE method trains an improved neural network to classify the OE-attribute data corresponding to the discrete-attribute data. The loss function of improved neural network not only includes the training error but also considers the uncertainty of hidden-layer output matrix (i.e., DE-attribute data), where the uncertainty is calculated with the re-substitution entropy. Third, the classification task is conducted based on the combination of previous continuous-attribute data and transformed DE-attribute data. Finally, we compare DE method with OE method by training support vector machine (SVM) and deep neural network (DNN) on 4 KEEL mixed-attribute data sets. The experimental results demonstrate the feasibility and effectiveness of DE method and show that DE method can help SVM and DNN obtain the better classification accuracies than the traditional OE method.
引用
收藏
页码:153 / 163
页数:11
相关论文
共 50 条
  • [1] Detecting Network Anomalies in Mixed-Attribute Data Sets
    Tran, Khoi-Nguyen
    Jin, Huidong
    [J]. THIRD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING: WKDD 2010, PROCEEDINGS, 2010, : 383 - 386
  • [2] Two layered Genetic Programming for mixed-attribute data classification
    Jabeen, Hajira
    Baig, Abdul Rauf
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (01) : 416 - 422
  • [3] A novel dependency-oriented mixed-attribute data classification method
    He, Yu-Lin
    Ou, Gui-Liang
    Fournier-Viger, Philippe
    Huang, Joshua Zhexue
    Suganthan, Ponnuthurai Nagaratnam
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
  • [4] Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
    Liu, Shihua
    [J]. COMPLEXITY, 2022, 2022
  • [5] Missing Value Estimation for Mixed-Attribute Data Sets
    Zhu, Xiaofeng
    Zhang, Shichao
    Jin, Zhi
    Zhang, Zili
    Xu, Zhuoming
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (01) : 110 - 121
  • [6] A practical outlier detection approach for mixed-attribute data
    Bouguessa, Mohamed
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8637 - 8649
  • [7] Deep Neural Network-Based Gait Classification Using Wearable Inertial Sensor Data
    Jung, Dawoon
    Mau Dung Nguyen
    Han, Jooin
    Park, Mina
    Lee, Kwanhoon
    Yoo, Seonggeun
    Kim, Jinwook
    Mun, Kyung-Ryoul
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 3624 - 3628
  • [8] Random Mixed Field Model for Mixed-Attribute Data Restoration
    Li, Qiang
    Bian, Wei
    Xu, Richard Yi Da
    You, Jane
    Tao, Dacheng
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1244 - 1250
  • [9] Deep Neural Network-based Automatic Modulation Classification Technique
    Kim, Byeoungdo
    Kim, Jaekyum
    Chae, Hyunmin
    Yoon, Dongweon
    Choi, Jun Won
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC 2016): TOWARDS SMARTER HYPER-CONNECTED WORLD, 2016, : 579 - 582
  • [10] Clustering Mixed-Attribute Data using Random Walk
    Skabar, Andrew
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 988 - 997