DeepTable: a permutation invariant neural network for table orientation classification

被引:7
|
作者
Habibi, Maryam [1 ]
Starlinger, Johannes [1 ]
Leser, Ulf [1 ]
机构
[1] Humboldt Univ, Berlin, Germany
关键词
Information discovery; Tabular data; Table orientation classification; Deep learning; Machine learning;
D O I
10.1007/s10618-020-00711-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables are a common way to present information in an intuitive and concise manner. They are used extensively in media such as scientific articles or web pages. Automatically analyzing the content of tables bears special challenges. One of the most basic tasks is determination of the orientation of a table: In column tables, columns represent one entity with the different attribute values present in the different rows; row tables are vice versa, and matrix tables give information on pairs of entities. In this paper, we address the problem of classifying a given table into one of the three layouts horizontal (for row tables), vertical (for column tables), and matrix. We describe DeepTable, a novel method based on deep neural networks designed for learning from sets. Contrary to previous state-of-the-art methods, this basis makes DeepTable invariant to the permutation of rows or columns, which is a highly desirable property as in most tables the order of rows and columns does not carry specific information. We evaluate our method using a silver standard corpus of 5500 tables extracted from biomedical articles where the layout was determined heuristically. DeepTable outperforms previous methods in both precision and recall on our corpus. In a second evaluation, we manually labeled a corpus of 300 tables and were able to confirm DeepTable to reach superior performance in the table layout classification task. The codes and resources introduced here are available at.
引用
收藏
页码:1963 / 1983
页数:21
相关论文
共 50 条
  • [41] Neural learning invariant to network size changes
    de Angulo, VR
    Torras, C
    ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, 2001, 2130 : 33 - 40
  • [42] A NEURAL NETWORK FOR INVARIANT PATTERN-RECOGNITION
    BIENENSTOCK, E
    VONDERMALSBURG, C
    EUROPHYSICS LETTERS, 1987, 4 (01): : 121 - 126
  • [43] Wafer map failure pattern classification using geometric transformation-invariant convolutional neural network
    Iljoo Jeong
    Soo Young Lee
    Keonhyeok Park
    Iljeok Kim
    Hyunsuk Huh
    Seungchul Lee
    Scientific Reports, 13
  • [44] Wafer map failure pattern classification using geometric transformation-invariant convolutional neural network
    Jeong, Iljoo
    Lee, Soo Young
    Park, Keonhyeok
    Kim, Iljeok
    Huh, Hyunsuk
    Lee, Seungchul
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [45] An experimental study on upper limb position invariant EMG signal classification based on deep neural network
    Mukhopadhyay, Anand Kumar
    Samui, Suman
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 55
  • [46] Classification of neural network hardware
    Computer Engineering Dep Eastern, Mediterranean Univ, Magosa, Cyprus
    Neural Network World, 1 (11-27):
  • [47] A neural network for texture classification
    Branca, A
    Tafuri, M
    Attolico, G
    Distante, A
    MELECON '96 - 8TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, PROCEEDINGS, VOLS I-III: INDUSTRIAL APPLICATIONS IN POWER SYSTEMS, COMPUTER SCIENCE AND TELECOMMUNICATIONS, 1996, : 653 - 656
  • [48] Mirror invariant convolutional neural networks for image classification
    Lu, Shufang
    Li, Yan
    Wang, Minqian
    Gao, Fei
    IET IMAGE PROCESSING, 2022, 16 (06) : 1626 - 1635
  • [49] Complete Nuclear Permutation Inversion Invariant Artificial Neural Network (CNPI-ANN) Diabatization for the Accurate Treatment of Vibronic Coupling Problems
    Williams, David M. G.
    Eisfeld, Wolfgang
    JOURNAL OF PHYSICAL CHEMISTRY A, 2020, 124 (37): : 7608 - 7621
  • [50] Permutation invariant polynomial neural network approach to fitting potential energy surfaces. III. Molecule-surface interactions
    Jiang, Bin
    Guo, Hua
    JOURNAL OF CHEMICAL PHYSICS, 2014, 141 (03):