DeepTable: a permutation invariant neural network for table orientation classification

被引:7
|
作者
Habibi, Maryam [1 ]
Starlinger, Johannes [1 ]
Leser, Ulf [1 ]
机构
[1] Humboldt Univ, Berlin, Germany
关键词
Information discovery; Tabular data; Table orientation classification; Deep learning; Machine learning;
D O I
10.1007/s10618-020-00711-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables are a common way to present information in an intuitive and concise manner. They are used extensively in media such as scientific articles or web pages. Automatically analyzing the content of tables bears special challenges. One of the most basic tasks is determination of the orientation of a table: In column tables, columns represent one entity with the different attribute values present in the different rows; row tables are vice versa, and matrix tables give information on pairs of entities. In this paper, we address the problem of classifying a given table into one of the three layouts horizontal (for row tables), vertical (for column tables), and matrix. We describe DeepTable, a novel method based on deep neural networks designed for learning from sets. Contrary to previous state-of-the-art methods, this basis makes DeepTable invariant to the permutation of rows or columns, which is a highly desirable property as in most tables the order of rows and columns does not carry specific information. We evaluate our method using a silver standard corpus of 5500 tables extracted from biomedical articles where the layout was determined heuristically. DeepTable outperforms previous methods in both precision and recall on our corpus. In a second evaluation, we manually labeled a corpus of 300 tables and were able to confirm DeepTable to reach superior performance in the table layout classification task. The codes and resources introduced here are available at.
引用
收藏
页码:1963 / 1983
页数:21
相关论文
共 50 条
  • [21] Neural network for invariant recognition
    Oparin, AN
    Plekhanova, IV
    Soloviov, NG
    SECOND INTERNATIONAL CONFERENCE ON OPTICAL INFORMATION PROCESSING, 1996, 2969 : 132 - 135
  • [22] Automatic cardiac MRI segmentation and permutation-invariant pathology classification using deep neural networks and point clouds
    Chang, Yakun
    Jung, Cheolkon
    NEUROCOMPUTING, 2020, 418 : 270 - 279
  • [23] On permutation symmetries of hopfield model neural network
    Dong, JY
    Xu, SC
    Chen, ZX
    Wu, BX
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2001, 6 (02) : 129 - 136
  • [24] System Invariant Method for Ultrasonic Flaw Classification in Weldments Using Residual Neural Network
    Park, Jinhyun
    Lee, Seung-Eun
    Kim, Hak-Joon
    Song, Sung-Jin
    Kang, Sung-Sik
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [25] Burst Image Deblurring Using Permutation Invariant Convolutional Neural Networks
    Aittala, Miika
    Durand, Fredo
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 748 - 764
  • [26] Permutation Invariant Training of Generative Adversarial Network for Monaural Speech Separation
    Chen, Lianwu
    Yu, Meng
    Qian, Yanmin
    Su, Dan
    Yu, Dong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 302 - 306
  • [27] Improving Graph Neural Network with Learnable Permutation Pooling
    Jin, Yu
    Jaja, Joseph F.
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 682 - 689
  • [28] A neural network to enhance local search in the permutation flowshop
    El-Bouri, A
    Balakrishnan, S
    Popplewell, N
    COMPUTERS & INDUSTRIAL ENGINEERING, 2005, 49 (01) : 182 - 196
  • [29] Fuzzy ARTMAP classification of invariant features derived using angle of rotation from a neural network
    Raveendran, P
    Palaniappan, R
    Omatu, S
    INFORMATION SCIENCES, 2000, 130 (1-4) : 67 - 84
  • [30] INVARIANT FEATURE EXTRACTION FOR IMAGE CLASSIFICATION VIA MULTI-CHANNEL CONVOLUTIONAL NEURAL NETWORK
    Mei, Shaohui
    Jiang, Ruoqiao
    Ji, Jingyu
    Sun, Jun
    Peng, Yang
    Zhang, Yifan
    2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 491 - 495