Feature-based augmentation and classification for tabular data

被引:9
|
作者
Sathianarayanan, Balachander [1 ]
Samant, Yogesh Chandra Singh [1 ]
Guruprasad, Prahalad S. Conjeepuram [1 ]
Hariharan, Varshin B. [1 ]
Manickam, Nirmala Devi [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Coimbatore, Tamil Nadu, India
关键词
Distribution functions;
D O I
10.1049/cit2.12123
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating synthetic samples for a tabular data is a strenuous task. Most of the time, the columns (features) in the dataset may not follow an ideal distribution function. The objective of the proposed algorithm, Histogram Augmentation Technique (HAT), is to generate a dataset whose distribution is similar to that of the original dataset. This augmentation is achieved based on individual columns, where separate algorithms are designed for continuous and discrete columns. Humans also use features of an object for interpretation. When humans make a judgement, they notice prominent features and characterise the perceived object. However, conventional Machine Learning classifiers are designed and trained on the basis of samples. Taking the features as the basis for classification, Feature Importance Classifier (FIC) has been attempted in this work. FIC treats every feature independent of each other, and ranks the features based on its dependence with the classified label. It has been found that the FIC has the highest accuracy and has improved the accuracy by 5.54% on average, when it's compared to other classifiers. The suggested algorithms have been experimented on five datasets and compared with two augmentation algorithms and four state-of-the-art ML classification algorithms.
引用
收藏
页码:481 / 491
页数:11
相关论文
共 50 条
  • [31] Classification of schizophrenia using feature-based morphometry
    U. Castellani
    E. Rossato
    V. Murino
    M. Bellani
    G. Rambaldelli
    C. Perlini
    L. Tomelleri
    M. Tansella
    P. Brambilla
    Journal of Neural Transmission, 2012, 119 : 395 - 404
  • [32] Feature-based data assimilation in geophysics
    Morzfeld, Matthias
    Adams, Jesse
    Lunderman, Spencer
    Orozco, Rafael
    NONLINEAR PROCESSES IN GEOPHYSICS, 2018, 25 (02) : 355 - 374
  • [33] Feature-Based Data Stream Clustering
    Asbagh, Mohsen Jafari
    Abolhassani, Hassan
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 363 - 368
  • [34] Benchmarking Data Augmentation Techniques for Tabular Data
    Machado, Pedro
    Fernandes, Bruno
    Novais, Paulo
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2022, 2022, 13756 : 104 - 112
  • [35] Feature-Based Attention and Feature-Based Expectation
    Summerfield, Christopher
    Egner, Tobias
    TRENDS IN COGNITIVE SCIENCES, 2016, 20 (06) : 401 - 404
  • [36] Advancements in Image Feature-Based Classification of Motor Imagery EEG Data: A Comprehensive Review
    Yilmaz, Cagatay Murat
    Yilmaz, Bahar Hatipoglu
    TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1857 - 1868
  • [37] Visualizing High Dimensional Feature Space for Feature-Based Information Classification
    Wang, Xiaokun
    Yang, Li
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II, 2016, 9787 : 540 - 550
  • [38] A Parallel Feature Expansion Classification Model with Feature-based Attention Mechanism
    Yu, Yingchao
    Hao, Kuangrong
    Tang, Xue-Song
    Wang, Tong
    Liu, Xiaoyan
    Ding, Yongsheng
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 362 - 367
  • [39] A Feature-Based Classification of Triple Graph Grammar Variants
    Weidmann, Nils
    Oppermann, Robin
    Robrecht, Patrick
    PROCEEDINGS OF THE 12TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING (SLE '19), 2019, : 1 - 14
  • [40] An Overview of Feature-Based Methods for Digital Modulation Classification
    Hazza, Alharbi
    Shoaib, Mobien
    Alshebeili, Saleh A.
    Fahad, Alturki
    2013 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA'13), 2013,