Extracting Structured Scholarly Information from the Machine Translation Literature

被引:0
|
作者
Choi, Eunsol [1 ,4 ]
Horvat, Matic [2 ,4 ]
May, Jonathan [3 ]
Knight, Kevin [3 ]
Marcu, Daniel [3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Univ Cambridge, Cambridge, England
[3] Informat Sci Inst, Los Angeles, CA USA
[4] ISI, Los Angeles, CA USA
关键词
Information Extraction; Scientific Literature; Structured Prediction;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: "Which are the best published results reported on the NIST 09 Chinese to English dataset?" and "What are the most important methods for speeding up phrase-based decoding?" Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
  • [21] FROM SCHOLARLY MACHINES TO THE SCHOLARLY - MACHINE
    Pajon, Patrick
    TRICTRAC-JOURNAL OF WORLD MYTHOLOGY AND FOLKLORE, 2010, 3 (01) : 30 - 40
  • [22] Machine Translation and Global Research: Towards Improved Machine Translation Literacy in the Scholarly Community
    LeBlanc, Matthieu
    JOURNAL OF SPECIALISED TRANSLATION, 2021, (35): : 233 - 234
  • [23] Recognition techniques for extracting information from semi-structured documents
    Della Ventura, A
    Gagliardi, I
    Zonta, B
    DOCUMENT RECOGNITION AND RETRIEVAL VIII, 2001, 4307 : 130 - 137
  • [24] Extracting Structured Information from the Textual Description of Geometry Word Problems
    Boob, Archana
    Bodakhe, Prajakta
    Radke, Mansi A.
    Deshpande, Umesh A.
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 31 - 37
  • [25] A strategy for extracting information from semi-structured web pages
    Shaker, Mahmoud
    Ibrahim, Hamidah
    Mustapha, Aida
    Abdullah, Lili Nurliyana
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2010, 6 (04) : 304 - 318
  • [26] Enhancing scholarly literature with compound information
    Cleeren, Maarten
    Hoctor, Timothy
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [27] Extracting contrastive information from negation patterns in biomedical literature
    Korea Advanced Institute of Science and Technology
    不详
    ACM Trans. Asian Lang. Inf. Process., 2006, 1 (44-60):
  • [28] Recent progress in automatically extracting information from the pharmacogenomic literature
    Garten, Yael
    Coulet, Adrien
    Altman, Russ B.
    PHARMACOGENOMICS, 2010, 11 (10) : 1467 - 1489
  • [29] A Framework for Extracting Information from Semi-Structured Web Data Sources
    Shaker, Malunoud
    Ibrahim, Hamidah
    Mustapha, Aida
    Abdullah, Lili Nurliyana
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 27 - 31
  • [30] Dr. Inventor Framework: Extracting Structured Information from Scientific Publications
    Ronzano, Francesco
    Saggion, Horacio
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 209 - 220