Learning Enhanced Representations for Tabular Data via Neighborhood Propagation

被引:0
|
作者
Du, Kounianhua [1 ,3 ]
Zhang, Weinan [1 ]
Zhou, Ruiwen [1 ,3 ]
Wang, Yangkun [1 ,3 ]
Zhao, Xilong [1 ,3 ]
Jin, Jiarui [1 ,3 ]
Gan, Quan [2 ]
Zhang, Zheng [2 ]
Wipf, David [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai, Peoples R China
[2] Amazon, Seattle, WA USA
[3] Shanghai AI Lab, Amazon Web Serv, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction over tabular data is an essential and fundamental problem in many important downstream tasks. However, existing methods either treat a data instance of the table independently as input or do not jointly utilize multi-row features and labels to directly change and enhance target data representations. In this paper, we propose to 1) construct a hypergraph from relevant data instance retrieval to model the cross-row and cross-column patterns of those instances, and 2) perform message Propagation to Enhance the target data instance representations for Tabular prediction tasks. Specifically, our tailored message propagation step benefits from both the fusion of label and features during propagation, as well as locality-aware high-order feature interactions. Experiments on two important tabular data prediction tasks validate the superiority of the proposed PET model relative to other baselines. Additionally, we demonstrate the effectiveness of the model components and the feature enhancement ability of PET via various ablation studies and visualizations. The code is available at https://github.com/KounianhuaDu/PET.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] TABBIE Pretrained Representations of Tabular Data
    Iida, Hiroshi
    Dung Thai
    Manjunatha, Varun
    Iyyer, Mohit
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3446 - 3456
  • [2] HYTREL: Hypergraph-enhanced Tabular Data Representation Learning
    Chen, Pei
    Sarkar, Soumajyoti
    Lausen, Leonard
    Srinivasan, Balasubramaniam
    Zha, Sheng
    Huang, Ruihong
    Karypis, George
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Towards Efficient Learning of GNNs on High-Dimensional Multilayered Representations of Tabular Data
    A. V. Medvedev
    A. G. Djakonov
    Doklady Mathematics, 2023, 108 : S265 - S271
  • [4] Towards Efficient Learning of GNNs on High-Dimensional Multilayered Representations of Tabular Data
    Medvedev, A. V.
    Djakonov, A. G.
    DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S265 - S271
  • [5] SMARTboost Learning for Tabular Data
    Giordani, Paolo
    JOURNAL OF FINANCIAL ECONOMETRICS, 2024,
  • [6] TaCLe: Learning Constraints in Tabular Data
    Paramonov, Sergey
    Kolb, Samuel
    Guns, Tias
    De Raedt, Luc
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2511 - 2514
  • [7] Learning constraints in spreadsheets and tabular data
    Kolb, Samuel
    Paramonov, Sergey
    Guns, Tias
    De Raedt, Luc
    MACHINE LEARNING, 2017, 106 (9-10) : 1441 - 1468
  • [8] Learning Semantic Annotations for Tabular Data
    Chen, Jiaoyan
    Jimenez-Ruiz, Ernesto
    Horrocks, Ian
    Sutton, Charles
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2088 - 2094
  • [9] Learning constraints in spreadsheets and tabular data
    Samuel Kolb
    Sergey Paramonov
    Tias Guns
    Luc De Raedt
    Machine Learning, 2017, 106 : 1441 - 1468
  • [10] TLTD: Transfer Learning for Tabular Data
    Bragilovski, Maxim
    Kapri, Zahi
    Rokach, Lior
    Levy-Tzedek, Shelly
    APPLIED SOFT COMPUTING, 2023, 147