TLTD: Transfer Learning for Tabular Data

被引:7
|
作者
Bragilovski, Maxim [1 ]
Kapri, Zahi [1 ]
Rokach, Lior [1 ]
Levy-Tzedek, Shelly [2 ,3 ,4 ]
机构
[1] Ben Gurion Univ Negev, Dept Software & Informat Syst Engn, IL-8410501 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Recanati Sch Community Hlth Profess, Dept Phys Therapy, Beer Sheva, Israel
[3] Ben Gurion Univ Negev, Zlotowski Ctr Neurosci, Beer Sheva, Israel
[4] Univ Freiburg, Freiburg Inst Adv Studies FRIAS, Freiburg, Germany
关键词
Deep-learning; Feature-extraction; Tabular datasets;
D O I
10.1016/j.asoc.2023.110748
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) have become effective for various machine learning tasks. DNNs are known to achieve high accuracy with unstructured data in which each data sample (e.g., image) consists of many raw features (e.g., pixels) of the same type. The effectiveness of this approach diminishes for structured (tabular) data. In most cases, decision tree-based models such as Random Forest (RF) or Gradient Boosting Decision Trees (GBDT) outperform DNNs. In addition, DNNs tend to perform poorly when the number of samples in the dataset is small. This paper introduces Transfer Learning for Tabular Data (TLTD) which utilizes a novel learning architecture designed to extract new features from structured datasets. Using the DNN's learning capabilities on images, we convert the tabular data into images, then use the distillation technique to achieve better learning. We evaluated our approach with 25 structured datasets, and compared the outcomes to those of RF, eXtreme Gradient Boosting (XGBoost), Tabnet, KNN, and TabPFN. The results demonstrate the usefulness of the TLTD approach.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] SMARTboost Learning for Tabular Data
    Giordani, Paolo
    JOURNAL OF FINANCIAL ECONOMETRICS, 2024,
  • [2] TaCLe: Learning Constraints in Tabular Data
    Paramonov, Sergey
    Kolb, Samuel
    Guns, Tias
    De Raedt, Luc
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2511 - 2514
  • [3] Learning constraints in spreadsheets and tabular data
    Kolb, Samuel
    Paramonov, Sergey
    Guns, Tias
    De Raedt, Luc
    MACHINE LEARNING, 2017, 106 (9-10) : 1441 - 1468
  • [4] Learning Semantic Annotations for Tabular Data
    Chen, Jiaoyan
    Jimenez-Ruiz, Ernesto
    Horrocks, Ian
    Sutton, Charles
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2088 - 2094
  • [5] Learning constraints in spreadsheets and tabular data
    Samuel Kolb
    Sergey Paramonov
    Tias Guns
    Luc De Raedt
    Machine Learning, 2017, 106 : 1441 - 1468
  • [6] Prostate Cancer Diagnosis via Visual Representation of Tabular Data and Deep Transfer Learning
    El-Melegy, Moumen
    Mamdouh, Ahmed
    Ali, Samia
    Badawy, Mohamed
    El-Ghar, Mohamed Abou
    Alghamdi, Norah Saleh
    El-Baz, Ayman
    BIOENGINEERING-BASEL, 2024, 11 (07):
  • [7] Revisiting Deep Learning Models for Tabular Data
    Gorishniy, Yury
    Rubachev, Ivan
    Khrulkov, Valentin
    Babenko, Artem
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] LocalGLMnet: interpretable deep learning for tabular data
    Richman, Ronald
    Wuethrich, Mario, V
    SCANDINAVIAN ACTUARIAL JOURNAL, 2023, 2023 (01) : 71 - 95
  • [9] Local Contrastive Feature Learning for Tabular Data
    Gharibshah, Zhabiz
    Zhu, Xingquan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3963 - 3967
  • [10] Is Deep Learning on Tabular Data Enough? An Assessment
    Fayaz, Sheikh Amir
    Zaman, Majid
    Kaul, Sameer
    Butt, Muheet Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 466 - 473