Genres in the Prague discourse treebank

被引:0
|
作者
20175004519436
机构
来源
(1) Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, Malostranské nám. 25, Prague 1; 118 00, Czech Republic | 1600年 / European Media Laboratory GmbH (EML); Holmes Semantic Solutions; IMMI; KDictionaries; VoiceBox Technologies卷 / European Language Resources Association (ELRA)期
关键词
We present the project of classification of Prague Discourse Treebank documents (Czech journalistic texts) for their genres. Our main interest lies in opening the possibility to observe how text coherence is realized in different types (in the genre sense) of language data and; in the future; in exploring the ways of using genres as a feature for multi-sentence-level language technologies. In the paper; we first describe the motivation and the concept of the genre annotation; and briefly introduce the Prague Discourse Treebank. Then; we elaborate on the process of manual annotation of genres in the treebank; from the annotators' manual work to post-annotation checks and to the inter-annotator agreement measurements. The annotated genres are subsequently analyzed together with discourse relations (already annotated in the treebank) - we present distributions of the annotated genres and results of studying distinctions of distributions of discourse relations across the individual genres;
D O I
暂无
中图分类号
学科分类号
摘要
131726
引用
收藏
相关论文
共 50 条
  • [21] Persian Discourse Treebank and coreference corpus
    Mirzaei, Azadeh
    Safari, Pegah
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4049 - 4055
  • [22] Coreference in Prague Czech-English Dependency Treebank
    Nedoluzhko, Anna
    Novak, Michal
    Cinkova, Silvie
    Mikulova, Marie
    Mirovsky, Jiri
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 169 - 176
  • [23] Developing the Bangla RST Discourse Treebank
    Das, Debopam
    Stede, Manfred
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1832 - 1838
  • [24] Building a Macro Chinese Discourse Treebank
    Chu, Xiaomin
    Jiang, Feng
    Xu, Sheng
    Zhu, Qiaoming
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1920 - 1924
  • [25] Sense annotation in the penn discourse treebank
    Miltsakaki, Eleni
    Robaldo, Livio
    Lee, Alan
    Joshi, Aravind
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 275 - +
  • [26] Prague Dependency Treebank Annotation Errors A Preliminary Analysis
    Kovar, Vojtech
    Jakubicek, Milos
    RASLAN 2009: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2009, : 101 - 108
  • [27] The Penn Discourse TreeBank 2.0.
    Prasad, Rashmi
    Dinesh, Nikhil
    Lee, Alan
    Miltsakaki, Eleni
    Robaldo, Livio
    Joshi, Aravind
    Webber, Bonnie
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2961 - 2968
  • [28] The Thai Discourse Treebank: Annotating and Classifying Thai Discourse Connectives
    Prasertsom, Ponrawee
    Jaroonpol, Apiwat
    Rutherford, Attapol T.
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 613 - 629
  • [29] The Chinese Discourse TreeBank: a Chinese corpus annotated with discourse relations
    Yuping Zhou
    Nianwen Xue
    Language Resources and Evaluation, 2015, 49 : 397 - 431
  • [30] A Study of Recognizing Implicit Discourse Relations in the Penn Discourse Treebank
    Liu, Chu
    Chen, Jin-xiu
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 582 - 587