Interpreting data from scanned tables

被引:4
|
作者
Farrukh, Waleed [1 ]
Foncubierta-Rodriguez, Antonio [2 ]
Ciubotaru, Anca-Nicoleta [2 ]
Jaume, Guillaume [2 ]
Bekas, Costas [2 ]
Goksel, Orcun [1 ]
Gabrani, Maria [2 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] IBM Res, Ruschlikon, Switzerland
关键词
D O I
10.1109/ICDAR.2017.250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Densely-packed but structured scientific data are typically presented in the form of tables, which often appear in raster image form. To interpret data from scanned tables, understanding their hierarchical structure is vital. To further address the vast variability of table representations, we propose a fully automatic methodology that uses a bottom -up reasoning that is independent on the existence of representation features, such as lines. We evaluate our approach on the ICDAR 2013 dataset and demonstrate its effectiveness on detecting tables cells and their content and classifying header and data cells. For detecting the cell hierarchy, we demonstrate results on synthetic data due to lack of ground truth.
引用
收藏
页码:5 / 6
页数:2
相关论文
共 50 条
  • [1] Reproducing tables in scanned documents
    Jahan, M. A. C. Akmal
    Ragel, Roshan G.
    [J]. JOURNAL OF THE NATIONAL SCIENCE FOUNDATION OF SRI LANKA, 2016, 44 (04): : 367 - 377
  • [2] Locating Tables in Scanned Documents for Reconstructing and Republishing
    Jahan, M. A. C. Akmal
    Ragel, Roshan G.
    [J]. 2014 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS), 2014,
  • [3] Interpreting Medical Tables as Linked Data for Generating Meta-Analysis Reports
    Mulwad, Varish
    Finin, Tim
    Joshi, Anupam
    [J]. 2014 IEEE 15TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2014, : 677 - 686
  • [4] INTERPRETING GRAPHS AND TABLES - SELBY,PH
    ELASHOFF, JD
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (357) : 235 - 236
  • [5] INTERPRETING GRAPHS AND TABLES - SELBY,PH
    RANDALL, DG
    [J]. JOURNAL OF BUSINESS COMMUNICATION, 1977, 14 (02): : 59 - 59
  • [6] Modelling decision tables from data
    Wets, G
    Vanthienen, J
    Timmermans, H
    [J]. RESEARCH AND DEVELOPMENT IN KNOWLEDGE DISCOVERY AND DATA MINING, 1998, 1394 : 412 - 413
  • [7] Using standardised tables for interpreting Loglinear models
    Hendrickx J.
    [J]. Quality and Quantity, 2005, 38 (5): : 603 - 620
  • [8] EXPECTANCY TABLES: A METHOD OF INTERPRETING CORRELATION COEFFICIENTS
    Bittner, Reign H.
    Wilder, Carlton E.
    [J]. JOURNAL OF EXPERIMENTAL EDUCATION, 1946, 14 (03): : 245 - 252
  • [9] Interpreting Results in 2 x 2 Tables
    Sauerbrei, W.
    Blettner, M.
    [J]. DEUTSCHES ARZTEBLATT INTERNATIONAL, 2009, 106 (48): : 795 - 800
  • [10] Using standardised tables for interpreting loglinear models
    Hendrickx, J
    [J]. QUALITY & QUANTITY, 2004, 38 (05) : 603 - 620