Creating and Working with Corpus of Spoken Lithuanian

被引:0
|
作者
Kamandulyte-Merfeldiene, Laura [1 ]
Godliauskas, Povilas [2 ]
机构
[1] Vytautas Magnus Univ, Dept Lithuanian Language, Kaunas, Lithuania
[2] Vytautas Magnus Univ, Kaunas, Lithuania
关键词
Corpus of Spoken Lithuanian; spontaneous speech; ADS; CHILDES;
D O I
10.3233/978-1-61499-442-8-179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we give a general overview on the development of the Corpus of Spoken Lithuanian. We consider the methodology of the corpus as well as the process of transcribing and coding the collected data. In addition, we provide a brief analysis based on the collected corpus data of spoken adult speech (ADS). The analysis is conducted according to four aspects: distribution of parts of speech, inflectional changes, features of syntax, and usage of loanwords. In the end of the paper we provide conclusions and future expectations.
引用
收藏
页码:179 / +
页数:2
相关论文
共 50 条
  • [1] Creating a corpus of spoken English of Polish EFL learners
    Baczkowska, A
    PALC'99: PRACTICAL APPLICATIONS IN LANGUAGE CORPORA, 2000, 1 : 221 - 231
  • [2] CORPORA OF SPOKEN LITHUANIAN
    Dabasinskiene, Ineta
    Kamandulyte, Laura
    EESTI RAKENDUSLINGVISTIKA UHINGU AASTARAAMAT, 2009, 5 : 67 - 77
  • [3] Lithuanian-Latvian-Lithuanian Parallel Corpus
    Utka, Andrius
    Levane-Petrova, Kristine
    Bielinskiene, Agne
    Kovalevskaite, Jolanta
    Rimkute, Erika
    Vevere, Daira
    HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 260 - +
  • [4] Building and working with a spoken corpus of wine tasting situations: the OenoLex Burgundy project
    Gautier, Laurent
    Hohota, Valentina
    STUDIA UNIVERSITATIS BABES-BOLYAI PHILOLOGIA, 2014, 59 (04): : 157 - 173
  • [5] The Wenzhou Spoken Corpus
    Newman, John
    Lin, Jingxia
    Butler, Terry
    Zhang, Eric
    CORPORA, 2007, 2 (01) : 97 - 109
  • [6] Spock - a Spoken Corpus Client
    Janssen, Maarten
    Freitas, Tiago
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3473 - 3478
  • [7] Where are the corpus on spoken French?
    Cappeau, Paul
    Gadet, Francoise
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2007, 12 (01): : 129 - 133
  • [8] The AUTONOMATA Spoken Names Corpus
    van den Heuvel, Henk
    Martens, Jean-Pierre
    D'hoore, Bart
    D'hanens, Kristof
    Konings, Nanneke
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 140 - 143
  • [9] The design of the Spoken Dutch Corpus
    Oostdijk, N
    NEW FRONTIERS OF CORPUS RESEARCH, 2002, (36): : 105 - 112
  • [10] The corpus of Spanish spoken in Tunja
    Calderon Noguera, Donald Freddy
    CUADERNOS DE LINGUISTICA HISPANICA, 2008, 12 : 17 - 30