Corpus-Based Analysis of Lexical Features of Mongolian Language Policy Text

被引:0
|
作者
Annaer [1 ]
Dahubaiyila [1 ]
机构
[1] Inner Mongolia Univ, Sch Mongolian Studies, Coll Rd 235, Hohhot 010021, Peoples R China
来源
关键词
Mongolian language policy; Policy text analysis; Distribution of parts of speech; Lexical density; Word frequency;
D O I
10.1007/978-981-97-0586-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Like other policy texts, language policy texts also need policy text analysis. Leveraging a corpus of 100 policy documents, this study investigates various linguistic attributes, including the distribution of parts of speech, type-token ratio, and lexical density, through data comparative analysis. Furthermore, the paper categorizes the corpus into twelve distinct types, encompassing instructions, decisions, notices, reports, regulations, ways, rules, methods, summaries, plans, speeches and papers. Employing natural language processing techniques, the study also utilizes frequency statistics and wordclouds to provide both word frequency statistical tables and visual wordcloud representations of the Mongolian language policy text corpus.
引用
收藏
页码:331 / 341
页数:11
相关论文
共 50 条