Corpus-Based Analysis of Lexical Features of Mongolian Language Policy Text

被引:0
|
作者
Annaer [1 ]
Dahubaiyila [1 ]
机构
[1] Inner Mongolia Univ, Sch Mongolian Studies, Coll Rd 235, Hohhot 010021, Peoples R China
来源
关键词
Mongolian language policy; Policy text analysis; Distribution of parts of speech; Lexical density; Word frequency;
D O I
10.1007/978-981-97-0586-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Like other policy texts, language policy texts also need policy text analysis. Leveraging a corpus of 100 policy documents, this study investigates various linguistic attributes, including the distribution of parts of speech, type-token ratio, and lexical density, through data comparative analysis. Furthermore, the paper categorizes the corpus into twelve distinct types, encompassing instructions, decisions, notices, reports, regulations, ways, rules, methods, summaries, plans, speeches and papers. Employing natural language processing techniques, the study also utilizes frequency statistics and wordclouds to provide both word frequency statistical tables and visual wordcloud representations of the Mongolian language policy text corpus.
引用
收藏
页码:331 / 341
页数:11
相关论文
共 50 条
  • [1] Interactive Visual Text Analysis for Corpus-Based Language Learning
    Zhu, Ying
    Friginal, Eric
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 462 - 467
  • [2] A Corpus-based Study on the Lexical Features of Logistics English
    Yang, Run-fen
    2016 2ND INTERNATIONAL CONFERENCE ON EDUCATION AND MANAGEMENT SCIENCE (ICEMS 2016), 2016, : 52 - 56
  • [3] Corpus-based lexical choice in natural language generation
    Bangalore, S
    Rambow, O
    38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 464 - 471
  • [4] Lexical Properties of Slovene Sign Language: A Corpus-Based Study
    Vintar, Spela
    SIGN LANGUAGE STUDIES, 2015, 15 (02) : 182 - 201
  • [5] Corpus-based Set Expansion with Lexical Features and Distributed Representations
    Yu, Puxuan
    Huang, Zhiqi
    Rahimi, Razieh
    Allan, James
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1153 - 1156
  • [6] Stable Lexical Marker Analysis: A corpus-based identification of lexical variation
    De Hertog, Dirk
    Heylen, Kris
    Speelman, Dirk
    PLURICENTRICITY: LANGUAGE VARIATION AND SOCIOCOGNITIVE DIMENSIONS, 2014, 24 : 127 - 141
  • [7] Lexical Profile of Newspapers Revisited: A Corpus-Based Analysis
    Ha, Hung Tan
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [9] Features of orality in the language of fiction: A corpus-based investigation
    Jucker, Andreas H.
    LANGUAGE AND LITERATURE, 2021, 30 (04) : 341 - 360
  • [10] THE DEVELOPMENT AND APPLICATION OF AN ONLINE MALAY LANGUAGE CORPUS-BASED LEXICAL DATABASE
    Lee, Lay Wah
    Low, Hui Min
    KAJIAN MALAYSIA, 2014, 32 : 151 - 166