Creating and Working with Corpus of Spoken Lithuanian

被引:0
|
作者
Kamandulyte-Merfeldiene, Laura [1 ]
Godliauskas, Povilas [2 ]
机构
[1] Vytautas Magnus Univ, Dept Lithuanian Language, Kaunas, Lithuania
[2] Vytautas Magnus Univ, Kaunas, Lithuania
关键词
Corpus of Spoken Lithuanian; spontaneous speech; ADS; CHILDES;
D O I
10.3233/978-1-61499-442-8-179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we give a general overview on the development of the Corpus of Spoken Lithuanian. We consider the methodology of the corpus as well as the process of transcribing and coding the collected data. In addition, we provide a brief analysis based on the collected corpus data of spoken adult speech (ADS). The analysis is conducted according to four aspects: distribution of parts of speech, inflectional changes, features of syntax, and usage of loanwords. In the end of the paper we provide conclusions and future expectations.
引用
收藏
页码:179 / +
页数:2
相关论文
共 50 条
  • [21] How to expand the corpus of spoken English
    张光华
    校园英语, 2015, (13) : 44 - 44
  • [22] The spoken corpus of Cameroon Pidgin English
    Ozon, Gabriel
    Ayafor, Miriam
    Green, Melanie
    Fitzgerald, Sarah
    WORLD ENGLISHES, 2017, 36 (03) : 427 - 447
  • [23] Macrosyntactic Segmenters of a French spoken corpus
    Wang, Ilaine
    Kahane, Sylvain
    Tellier, Isabelle
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3891 - 3896
  • [24] Corpus of Contemporary Lithuanian Language - the Standardised Way
    Rimkute, Erika
    Kovalevskaite, Jolanta
    Melninkaite, Vida
    Utka, Andrius
    Vitkute-Adzgauskiene, Daiva
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2010, 219 : 154 - 160
  • [25] Coreference Annotation Scheme and Corpus for Lithuanian Language
    Zitkus, Voldemaras
    Butkiene, Rita
    2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 245 - 250
  • [26] Overcoming challenges in corpus construction: The Spoken British National Corpus 2014
    Hanks, Elizabeth
    REGISTER STUDIES, 2023, 5 (01) : 136 - 142
  • [27] Sustaining a Corpus for Spoken Turkish Discourse: Accessibility and Corpus Management Issues
    Ruhi, Sukriye
    Eroz-Tuga, Betil
    Hatipoglu, Ciler
    Isik-Guler, Hale
    Acar, M. Gunes Can
    Eryilmaz, Kerem
    Can, Humeyra
    Karakas, Ozlem
    Karadas, Derya Cokal
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : F44 - F48
  • [28] Overcoming Challenges in Corpus Construction: The Spoken British National Corpus 2014
    Liu, Siqi
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (04) : 537 - 540
  • [29] Overcoming Challenges in Corpus Construction: The Spoken British National Corpus 2014
    Wang, Jiawei
    INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2020, 25 (04) : 504 - 510
  • [30] Overcoming Challenges in Corpus Construction: The Spoken British National Corpus 2014
    Wang, Yawen
    Sun, Peijian Paul
    SYSTEM, 2021, 101