DeepTable: a permutation invariant neural network for table orientation classification

被引:7
|
作者
Habibi, Maryam [1 ]
Starlinger, Johannes [1 ]
Leser, Ulf [1 ]
机构
[1] Humboldt Univ, Berlin, Germany
关键词
Information discovery; Tabular data; Table orientation classification; Deep learning; Machine learning;
D O I
10.1007/s10618-020-00711-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables are a common way to present information in an intuitive and concise manner. They are used extensively in media such as scientific articles or web pages. Automatically analyzing the content of tables bears special challenges. One of the most basic tasks is determination of the orientation of a table: In column tables, columns represent one entity with the different attribute values present in the different rows; row tables are vice versa, and matrix tables give information on pairs of entities. In this paper, we address the problem of classifying a given table into one of the three layouts horizontal (for row tables), vertical (for column tables), and matrix. We describe DeepTable, a novel method based on deep neural networks designed for learning from sets. Contrary to previous state-of-the-art methods, this basis makes DeepTable invariant to the permutation of rows or columns, which is a highly desirable property as in most tables the order of rows and columns does not carry specific information. We evaluate our method using a silver standard corpus of 5500 tables extracted from biomedical articles where the layout was determined heuristically. DeepTable outperforms previous methods in both precision and recall on our corpus. In a second evaluation, we manually labeled a corpus of 300 tables and were able to confirm DeepTable to reach superior performance in the table layout classification task. The codes and resources introduced here are available at.
引用
收藏
页码:1963 / 1983
页数:21
相关论文
共 50 条
  • [1] DeepTable: a permutation invariant neural network for table orientation classification
    Maryam Habibi
    Johannes Starlinger
    Ulf Leser
    Data Mining and Knowledge Discovery, 2020, 34 : 1963 - 1983
  • [2] Minimal Neural Network Models for Permutation Invariant Agents
    Pedersen, Joachim Winther
    Risi, Sebastian
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 130 - 138
  • [3] Neural network for invariant image classification
    Patra, PK
    JOURNAL OF THE INSTITUTION OF ELECTRONICS AND TELECOMMUNICATION ENGINEERS, 1996, 42 (4-5): : 281 - 290
  • [4] Neural decoders with permutation invariant structure
    Chen, Xiangyu
    Ye, Min
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (08): : 5481 - 5503
  • [5] Permutation invariant polynomial neural network approach to fitting potential energy surfaces
    Jiang, Bin
    Guo, Hua
    JOURNAL OF CHEMICAL PHYSICS, 2013, 139 (05):
  • [6] CLASSIFICATION OF INVARIANT IMAGE REPRESENTATIONS USING A NEURAL NETWORK
    KHOTANZAD, A
    LU, JH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (06): : 1028 - 1038
  • [7] Classification of Plants Using Invariant Features and a Neural Network
    Amlekar, Manisha M.
    Ali, Mouad M. H.
    Gaikwad, Ashok T.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS, ICTIS 2018, VOL 2, 2019, 107 : 127 - 136
  • [8] Classification of periodic variable stars with novel cyclic-permutation invariant neural networks
    Zhang, Keming
    Bloom, Joshua S.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 505 (01) : 515 - 522
  • [9] Translation-invariant optical neural network for image classification
    Hoda Sadeghzadeh
    Somayyeh Koohi
    Scientific Reports, 12
  • [10] Translation-invariant optical neural network for image classification
    Sadeghzadeh, Hoda
    Koohi, Somayyeh
    SCIENTIFIC REPORTS, 2022, 12 (01)