Corpus-based research of spoken language: the state-of-the-art for Czech and English

被引:0
|
作者
Cermakova, Anna [1 ]
Koprivova, Marie [1 ]
机构
[1] Ustav Pro Jazyk Cesky AV CR, Vvi, Letenska 4, Prague 11851 1, Czech Republic
来源
SLOVO A SLOVESNOST | 2018年 / 79卷 / 03期
关键词
spoken language research; corpus linguistics; spoken Czech; basic descriptive unit for spoken language; ANNOTATION SCHEME; GRAMMAR; SYNTAX; CONVERSATION; TRANSCRIPTION; DISCOURSE;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The article aims to review corpus-based research on spoken language, emphasizing issues in description and conceptualization of the grammar of spoken language in relation to the grammar of written language. The review first briefly looks at the development of spoken corpora, from simply transcribed corpora without sound alignment to today's sophisticated multi-modal corpora. The main part of the article deals with issues concerning the metalanguage for the description of spoken language, the choice of its basic descriptive unit, the status of basic linguistic categories such as part-of-speech, and typical lexical and grammatical devices. The existing extensive research on spoken English is reviewed and in line with it, illustrative examples based on Czech spoken corpora are provided. These are further contrasted with examples from written data to enhance the inherent differences between spoken and written language and the need to adjust the metalanguage of the description.
引用
收藏
页码:217 / 240
页数:24
相关论文
共 50 条