Creating and Working with Corpus of Spoken Lithuanian

被引:0
|
作者
Kamandulyte-Merfeldiene, Laura [1 ]
Godliauskas, Povilas [2 ]
机构
[1] Vytautas Magnus Univ, Dept Lithuanian Language, Kaunas, Lithuania
[2] Vytautas Magnus Univ, Kaunas, Lithuania
关键词
Corpus of Spoken Lithuanian; spontaneous speech; ADS; CHILDES;
D O I
10.3233/978-1-61499-442-8-179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we give a general overview on the development of the Corpus of Spoken Lithuanian. We consider the methodology of the corpus as well as the process of transcribing and coding the collected data. In addition, we provide a brief analysis based on the collected corpus data of spoken adult speech (ADS). The analysis is conducted according to four aspects: distribution of parts of speech, inflectional changes, features of syntax, and usage of loanwords. In the end of the paper we provide conclusions and future expectations.
引用
收藏
页码:179 / +
页数:2
相关论文
共 50 条
  • [31] Working with spoken discourse.
    Weatherall, A
    DISCOURSE & SOCIETY, 2003, 14 (04) : 525 - 526
  • [32] The Spoken BNC2014 Designing and building a spoken corpus of everyday conversations
    Love, Robbie
    Dembry, Claire
    Hardie, Andrew
    Brezina, Vaclav
    McEnery, Tony
    INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2017, 22 (03) : 319 - 344
  • [33] Historical Spoken Language Research: Corpus Perspectives
    Krischke, Ulrike
    ANGLIA-ZEITSCHRIFT FUR ENGLISCHE PHILOLOGIE, 2019, 137 (02): : 351 - 356
  • [34] ZERO RESOURCE SPOKEN AUDIO CORPUS ANALYSIS
    Harwath, David F.
    Hazen, Timothy J.
    Glass, James R.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8555 - 8559
  • [35] Spoken Corpus Linguistics: From Monomodal to Multimodal
    Kusmierczyk, Ewa
    DISCOURSE STUDIES, 2015, 17 (05) : 630 - 631
  • [36] ARGUMENTS OF PHRASAL VERBS IN A CORPUS OF SPOKEN ITALIAN
    Cordin, Patrizia
    LINGUE E LINGUAGGIO, 2024, 23 (01) : 57 - 80
  • [37] PROGRESS REPORT ON THE SPOKEN-ENGLISH CORPUS
    TAYLOR, L
    KNOWLES, G
    CORPUS LINGUISTICS, HARD AND SOFT, 1988, 2 : 237 - 244
  • [38] MINOR AND FRAGMENTARY SENTENCES OF A CORPUS OF SPOKEN ENGLISH
    BOWMAN, E
    INTERNATIONAL JOURNAL OF AMERICAN LINGUISTICS, 1966, 32 (03) : 1 - 67
  • [39] MYCanCor: A Video Corpus of spoken Malaysian Cantonese
    Liesenfeld, Andreas
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 764 - 767
  • [40] SNuC: The Sheffield Numbers Spoken Language Corpus
    Barker, Emma
    Barker, Jon
    Gaizauskas, Robert
    Ma, Ning
    Paramita, Monica Lestari
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1978 - 1984