DiSeg 1.0: The first system for Spanish discourse segmentation

被引:13
|
作者
da Cunha, Iria [1 ,2 ,3 ]
San Juan, Eric [2 ]
Manuel Torres-Moreno, Juan [2 ,3 ,4 ]
Lloberese, Marina [5 ]
Castellone, Irene [5 ]
机构
[1] Univ Pompeu Fabra, Inst Univ Linguist Aplicada, Barcelona 08018, Spain
[2] Univ Avignon & Pays Vaucluse, Lab Informat Avignon, F-84911 Avignon 9, France
[3] Univ Nacl Autonoma Mexico, Inst Ingn, Mexico City 04510, DF, Mexico
[4] Ecole Polytech, Montreal, PQ H3C 3A7, Canada
[5] Univ Barcelona, E-08007 Barcelona, Spain
关键词
Discourse parsing; Discourse segmentation; Shallow parsing; Rhetorical Structure Theory;
D O I
10.1016/j.eswa.2011.06.058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1671 / 1678
页数:8
相关论文
共 50 条
  • [1] DiSeg 1.0: The first system for Spanish discourse segmentation (vol 39, pg 1671, 2011)
    da Cunha, Iria
    SanJuan, Eric
    Torres-Moreno, Juan-Manuel
    Lloberes, Marina
    Castellon, Irene
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (06) : 6276 - 6276
  • [2] DiSeg: An Automatic Discourse Segmenter for Spanish
    da Cunha, Iria
    SanJuan, Eric
    Torres-Moreno, Juan-Manuel
    Lloberas, Marina
    Castellon, Irene
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 145 - 152
  • [3] Discourse Segmentation for Spanish Based on Shallow Parsing
    da Cunha, Iria
    SanJuan, Eric
    Torres-Moreno, Juan-Manuel
    Lloberes, Marina
    Castellon, Irene
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, MICAI 2010, PT I, 2010, 6437 : 13 - 23
  • [4] Studies in Spanish Linguistics (ELiEs): Segmentation units in discourse
    不详
    [J]. CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2014, (59): : 175 - 177
  • [5] Assessing "pues": on teaching Spanish discourse markers from a discourse segmentation perspective
    Pardo Llibrer, Adria
    [J]. FORO DE PROFESORES DE E-LE, 2019, 15 : 179 - 190
  • [7] UTILCON 1.0: A Conference Management System trainer in Spanish with strict refereeing control
    Castillo-Velazquez, Jose-Ignacio
    Trigueros-Galicia, Manuel-Israel
    [J]. PROCEEDINGS OF THE 2019 IEEE XXVI INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2019,
  • [8] LINGUISTIC CONTENT - FROM SYSTEM TO DISCOURSE - SPANISH - LAMIQUIZIBANEZ,V
    PEREZLAGOS, MF
    [J]. REVISTA DE FILOLOGIA ESPANOLA, 1987, 67 (3-4): : 375 - 377
  • [9] LINGUISTIC CONTENT - FROM SYSTEM TO DISCOURSE - SPANISH - LAMIQUIZ,V
    PELLEN, R
    [J]. REVUE DE LINGUISTIQUE ROMANE, 1987, 51 (203-04): : 544 - 551
  • [10] Discourse segmentation and interaction
    Garrido, Joaquin
    [J]. CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2017, (71): : 35 - 62