Extracting Structured Scholarly Information from the Machine Translation Literature

被引:0
|
作者
Choi, Eunsol [1 ,4 ]
Horvat, Matic [2 ,4 ]
May, Jonathan [3 ]
Knight, Kevin [3 ]
Marcu, Daniel [3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Univ Cambridge, Cambridge, England
[3] Informat Sci Inst, Los Angeles, CA USA
[4] ISI, Los Angeles, CA USA
关键词
Information Extraction; Scientific Literature; Structured Prediction;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Understanding the experimental results of a scientific paper is crucial to understanding its contribution and to comparing it with related work. We introduce a structured, queryable representation for experimental results and a baseline system that automatically populates this representation. The representation can answer compositional questions such as: "Which are the best published results reported on the NIST 09 Chinese to English dataset?" and "What are the most important methods for speeding up phrase-based decoding?" Answering such questions usually involves lengthy literature surveys. Current machine reading for academic papers does not usually consider the actual experiments, but mostly focuses on understanding abstracts. We describe annotation work to create an initial hscientific paper; experimental results representationi corpus. The corpus is composed of 67 papers which were manually annotated with a structured representation of experimental results by domain experts. Additionally, we present a baseline algorithm that characterizes the difficulty of the inference task.
引用
收藏
页码:421 / 425
页数:5
相关论文
共 50 条
  • [1] Better Neural Machine Translation by Extracting Linguistic Information from BERT
    Shavarani, Hassan S.
    Sarkar, Anoop
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2772 - 2783
  • [2] Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature
    Coelho, Luis Pedro
    Ahmed, Amr
    Arnold, Andrew
    Kangas, Joshua
    Sheikh, Abdul-Saboor
    Xing, Eric P.
    Cohen, William W.
    Murphy, Robert F.
    LINKING LITERATURE, INFORMATION, AND KNOWLEDGE FOR BIOLOGY, 2010, 6004 : 23 - +
  • [3] A machine learning framework for extracting information from biological pathway images in the literature
    Kwon, Mun Su
    Lee, Junkyu
    Kim, Hyun Uk
    METABOLIC ENGINEERING, 2024, 86 : 1 - 11
  • [4] Extracting Information about Research Resources from Scholarly Papers
    Saji, Ayahito
    Matsubara, Shigeki
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 440 - 448
  • [5] EXTRACTING STRUCTURED INFORMATION FROM PATHOLOGY REPORTS USING NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING
    Odisho, Anobel
    Park, Briton
    Altieri, Nicholas
    Murdoch, William
    Carroll, Peter
    Coopberberg, Matthew
    Yu, Bin
    JOURNAL OF UROLOGY, 2019, 201 (04): : E1031 - E1032
  • [6] Extracting information from the literature by text mining
    Kostoff, RN
    DeMarco, RA
    ANALYTICAL CHEMISTRY, 2001, 73 (13) : 370A - 378A
  • [7] Extracting kinetic information from literature with KineticRE
    Freitas, Ana Alao
    Costa, Hugo
    Rocha, Miguel
    Rocha, Isabel
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2015, 12 (04): : 282
  • [8] Extracting and synthesizing information from a literature review
    Foster, Roxie L.
    JOURNAL FOR SPECIALISTS IN PEDIATRIC NURSING, 2013, 18 (02) : 85 - 88
  • [9] Extracting information automatically from biological literature
    Blaschke, C
    Hoffmann, R
    Oliveros, JC
    Valencia, A
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2001, 2 (05): : 310 - 313
  • [10] Extracting parallel phrases from comparable data for machine translation
    Hewavitharana, Sanjika
    Vogel, Stephan
    NATURAL LANGUAGE ENGINEERING, 2016, 22 (04) : 549 - 573