Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature

被引:0
|
作者
Gabor, Kata [1 ]
Zargayouna, Haifa [1 ]
Buscaldi, Davide [1 ]
Tellier, Isabelle [2 ]
Charnois, Thierry [1 ]
机构
[1] Univ Paris 13, Sorbonne Paris Cite, CNRS, LIPN,UMR 7030, Villetaneuse, France
[2] Univ Sorbonne Paris Cite, Univ Sorbonne Nouvelle Paris 3, PSL Res Univ, CNRS,ENS Paris,LaTTiCe,UMR 8094, Paris, France
关键词
semantic annotation; semantic relations; ACL Anthology;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This paper describes the process of creating a corpus annotated for concepts and semantic relations in the scientific domain. A part of the ACL Anthology Corpus was selected for annotation, but the annotation process itself is not specific to the computational linguistics domain and could be applied to any scientific corpus. Concepts were identified and annotated fully automatically, based on a combination of terminology extraction and available ontological resources. A typology of semantic relations between concepts is also proposed. This typology, consisting of 18 domain-specific and 3 generic relations, is the result of a corpus-based investigation of the text sequences occurring between concepts in sentences. A sample of 500 abstracts from the corpus is currently being manually annotated with these semantic relations. Only explicit relations are taken into account, so that the data could serve to train or evaluate pattern-based semantic relation classification systems.
引用
收藏
页码:3694 / 3701
页数:8
相关论文
共 50 条
  • [1] Automatic Annotation of Semantic Term Types in the Complete ACL Anthology Reference Corpus
    Schumann, Anne-Kathrin
    Alonso, Hector Martinez
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3707 - 3715
  • [2] Automatic Semantic Annotation of Polish Dialogue Corpus
    Mykowiecka, Anieszka
    Marciniak, Malgorzata
    Glowinska, Katarzyna
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 625 - 632
  • [3] The ACL anthology network corpus
    Dragomir R. Radev
    Pradeep Muthukrishnan
    Vahed Qazvinian
    Amjad Abu-Jbara
    Language Resources and Evaluation, 2013, 47 : 919 - 944
  • [4] The ACL anthology network corpus
    Radev, Dragomir R.
    Muthukrishnan, Pradeep
    Qazvinian, Vahed
    Abu-Jbara, Amjad
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (04) : 919 - 944
  • [5] Research and development of semantic annotation platform for scientific literature
    Liu, Yao
    Zhang, Ziyuan
    Huang, Yi
    ICIC Express Letters, 2016, 10 (07): : 1787 - 1794
  • [6] Corpus Annotation as a Scientific Task
    Scott, Donia
    Barone, Rossano
    Koeling, Rob
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1481 - 1485
  • [7] Semantic disambiguation in Automatic Semantic Annotation
    Qi, Xin
    Xiao, Min
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL IV, 2010, : 64 - 67
  • [8] Semantic Disambiguation in Automatic Semantic Annotation
    Qi, Xin
    Xiao, Min
    APPLIED INFORMATICS AND COMMUNICATION, PT 4, 2011, 227 : 135 - 142
  • [9] Semantic annotation of (Czech) corpus texts
    Pala, K
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 56 - 61
  • [10] Semantic annotation of Nouns in Sensem Corpus
    Castellon, Irene
    Lloberes, Marina
    Fisas, Beatriz
    Julia, Albert
    Rigau, German
    Climent, Salvador
    Coll-Florit, Marta
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 315 - 316