Learning Enhanced Representations for Tabular Data via Neighborhood Propagation

被引:0
|
作者
Du, Kounianhua [1 ,3 ]
Zhang, Weinan [1 ]
Zhou, Ruiwen [1 ,3 ]
Wang, Yangkun [1 ,3 ]
Zhao, Xilong [1 ,3 ]
Jin, Jiarui [1 ,3 ]
Gan, Quan [2 ]
Zhang, Zheng [2 ]
Wipf, David [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai, Peoples R China
[2] Amazon, Seattle, WA USA
[3] Shanghai AI Lab, Amazon Web Serv, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction over tabular data is an essential and fundamental problem in many important downstream tasks. However, existing methods either treat a data instance of the table independently as input or do not jointly utilize multi-row features and labels to directly change and enhance target data representations. In this paper, we propose to 1) construct a hypergraph from relevant data instance retrieval to model the cross-row and cross-column patterns of those instances, and 2) perform message Propagation to Enhance the target data instance representations for Tabular prediction tasks. Specifically, our tailored message propagation step benefits from both the fusion of label and features during propagation, as well as locality-aware high-order feature interactions. Experiments on two important tabular data prediction tasks validate the superiority of the proposed PET model relative to other baselines. Additionally, we demonstrate the effectiveness of the model components and the feature enhancement ability of PET via various ablation studies and visualizations. The code is available at https://github.com/KounianhuaDu/PET.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Recent deep learning methods for tabular data
    Hwang, Yejin
    Song, Jongwoo
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2023, 30 (02) : 215 - 226
  • [22] TabAttention: Learning Attention Conditionally on Tabular Data
    Grzeszczyk, Michal K.
    Plotka, Szymon
    Rebizant, Beata
    Kosinska-Kaczynska, Katarzyna
    Lipa, Michal
    Brawura-Biskupski-Samaha, Robert
    Korzeniowski, Przemyslaw
    Trzcinski, Tomasz
    Sitek, Arkadiusz
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 347 - 357
  • [23] Investigating latent representations and generalization in deep neural networks for tabular data
    Couplet, Edouard
    Lambert, Pierre
    Verleysen, Michel
    Lee, John A.
    de Bodt, Cyril
    NEUROCOMPUTING, 2024, 597
  • [24] Semi-Supervised Learning with Data Augmentation for Tabular Data
    Fang, Junpeng
    Tang, Caizhi
    Cui, Qing
    Zhu, Feng
    Li, Longfei
    Zhou, Jun
    Zhu, Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3928 - 3932
  • [25] Learning Graph Representations with Embedding Propagation
    Garcia-Duran, Alberto
    Niepert, Mathias
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [26] The effects of data quality on machine learning performance on tabular data
    Mohammed, Sedir
    Budach, Lukas
    Feuerpfeil, Moritz
    Ihde, Nina
    Nathansen, Andrea
    Noack, Nele
    Patzlaff, Hendrik
    Naumann, Felix
    Harmouch, Hazar
    INFORMATION SYSTEMS, 2025, 132
  • [27] Contrastive learning enhanced deep neural network with serial regularization for high-dimensional tabular data
    Wu, Yao
    Zhu, Donghua
    Wang, Xuefeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [28] Tabular data: Deep learning is not all you need
    Shwartz-Ziv, Ravid
    Armon, Amitai
    INFORMATION FUSION, 2022, 81 : 84 - 90
  • [29] Machine learning for question answering from tabular data
    Khalid, Mahboob Alam
    Jijkoun, Valentin
    de Rijke, Maarten
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 392 - +
  • [30] Dense Representation Learning and Retrieval for Tabular Data Prediction
    Zheng, Lei
    Li, Ning
    Chen, Xianyu
    Gan, Quan
    Zhang, Weinan
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3559 - 3569