Toolbox for annotating spontaneous speech corpora (Computational Linguistics Lab - UAM)

被引:0
|
作者
Moreno Sandoval, Antonio [1 ]
Guirao Miras, Jose Ma. [2 ]
Torre Toledano, Doroteo [3 ]
机构
[1] UAM, Lab Lingtifst Informat, Madrid, Spain
[2] UGranada, Dept Lenguajes & Sistemas Informat, Granada, Spain
[3] UAM, Dept Ingenierfa Informat, Madrid, Spain
来源
关键词
Corpus annotation; phonology; syllable; lemmatization; PoS tagging;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We show a toolbox for linguistic annotation (including phonology, sillabification, part of speech, lemma and morphological features) especially adapted to Spanish spoken corpora. These tools have been developed and validated against several spontaneous speech corpora compiled by the Laboratorio de Linguistica Informatica-UAM: C-ORAL-ROM, CHIEDE, CORLEC
引用
收藏
页码:301 / 302
页数:2
相关论文
共 27 条
  • [1] Annotating discourse markers in spontaneous speech corpora on an example for the Slovenian language
    Verdonik, Darinka
    Rojc, Matej
    Stabej, Marko
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2007, 41 (02) : 147 - 180
  • [2] Annotating discourse markers in spontaneous speech corpora on an example for the Slovenian language
    Darinka Verdonik
    Matej Rojc
    Marko Stabej
    [J]. Language Resources and Evaluation, 2007, 41 : 147 - 180
  • [3] CORPORA FOR COMPUTATIONAL LINGUISTICS
    Orsan, Constantin
    Le An Ha
    Evans, Richard
    Hasler, Laura
    Mitkov, Ruslan
    [J]. ILHA DO DESTERRO-A JOURNAL OF ENGLISH LANGUAGE LITERATURES IN ENGLISH AND CULTURAL STUDIES, 2007, 52 : 65 - 101
  • [4] Encoding, Annotating, Theorizing Rapprochement of Literary Studies and Linguistics by way of Corpora
    Braun, Manuel
    [J]. LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 2013, 43 (172): : 83 - 90
  • [5] Part of speech and tagging in Computational Linguistics
    Oliveira, Claudia
    de Freitas, Maria Claudia
    [J]. CALIDOSCOPIO, 2006, 4 (03): : 179 - 188
  • [6] CLASSIC AND MODERN SPANISH CORPORA: BETWEEN PHILOLOGY AND COMPUTATIONAL LINGUISTICS
    Calderon Campos, Miguel
    [J]. RLA-REVISTA DE LINGUISTICA TEORICA Y APLICADA, 2019, 57 (02): : 41 - 64
  • [7] Automatic language recognition on spontaneous speech: The ATVS-UAM system
    Toledano, Doroteo T.
    Ignacio, Lopez-Moreno
    Mateos, Ismael
    Alejandro, Abejon
    Ramos, Daniel
    Gonzalez-Rodriguez, Joaquin
    [J]. AES: Journal of the Audio Engineering Society, 2009, 57 (10): : 788 - 806
  • [8] Automatic Language Recognition on Spontaneous Speech: The ATVS-UAM System
    Toledano, Doroteo T.
    Lopez-Moreno, Ignacio
    Mateos, Ismael
    Abejon, Alejandro
    Ramos, Daniel
    Gonzalez-Rodriguez, Joaquin
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2009, 57 (10): : 788 - 806
  • [9] Praaline: An Open-Source System for Managing, Annotating, Visualising and Analysing Speech Corpora
    Christodoulides, George
    [J]. 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 111 - 115
  • [10] CAN YOU SEE WHOSE SPEECH IS OVERLAPPING + COMPUTER CORPORA IN LINGUISTICS
    MEYER, CF
    MORRIS, RA
    BLACHMAN, E
    [J]. VISIBLE LANGUAGE, 1994, 28 (02) : 110 - 133