Corpus-Based Analysis of Lexical Features of Mongolian Language Policy Text

被引:0
|
作者
Annaer [1 ]
Dahubaiyila [1 ]
机构
[1] Inner Mongolia Univ, Sch Mongolian Studies, Coll Rd 235, Hohhot 010021, Peoples R China
来源
关键词
Mongolian language policy; Policy text analysis; Distribution of parts of speech; Lexical density; Word frequency;
D O I
10.1007/978-981-97-0586-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Like other policy texts, language policy texts also need policy text analysis. Leveraging a corpus of 100 policy documents, this study investigates various linguistic attributes, including the distribution of parts of speech, type-token ratio, and lexical density, through data comparative analysis. Furthermore, the paper categorizes the corpus into twelve distinct types, encompassing instructions, decisions, notices, reports, regulations, ways, rules, methods, summaries, plans, speeches and papers. Employing natural language processing techniques, the study also utilizes frequency statistics and wordclouds to provide both word frequency statistical tables and visual wordcloud representations of the Mongolian language policy text corpus.
引用
收藏
页码:331 / 341
页数:11
相关论文
共 50 条
  • [21] Corpus-based predictive text input
    Komatsu, H
    Takabayashi, S
    Masui, T
    Proceedings of the 2005 International Conference on Active Media Technology (AMT 2005), 2005, : 75 - 80
  • [22] A Comparative Corpus-Based Analysis of Lexical Collocations used in EFL textbooks
    Molavi, Ahmad
    Koosha, Mansour
    Hosseini, Hossein
    LATIN AMERICAN JOURNAL OF CONTENT & LANGUAGE INTEGRATED-LACLIL, 2014, 7 (01): : 66 - 81
  • [23] A Corpus-based Study on Lexical Analysis of English Tour Guide Commentary
    孔燕
    海外英语, 2016, (23) : 246 - 248
  • [24] The language of public health-a corpus-based analysis
    Millar, N.
    Budgell, B.
    JOURNAL OF PUBLIC HEALTH-HEIDELBERG, 2008, 16 (05): : 369 - 374
  • [25] A Corpus-Based Study of Lexical Features of Nonnative English from the Perspective of the Belt and Road
    Ling Zhenghua
    PROCEEDINGS OF SYMPOSIUM OF POLICING DIPLOMACY AND THE BELT & ROAD INITIATIVE, 2016, 2016, : 246 - 251
  • [26] A Corpus-based Evaluation of Lexical Components of a Domain-specific Text to Knowledge Mapping Prototype
    Shams, Rushdi
    Elsayed, Adel
    2008 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY: ICCIT 2008, VOLS 1 AND 2, 2008, : 457 - +
  • [27] Corpus-based Generation of Prosodic Features from Text Based on Generation Process Model
    Hirose, Keikichi
    Ochi, Keiko
    Minematsu, Nobuaki
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1033 - +
  • [28] Does simplification hold true for machine translations? A corpus-based analysis of lexical diversity in text varieties across genres
    Niu, Jiang
    Jiang, Yue
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2024, 11 (01):
  • [29] A Corpus-Based Assessment of French CEFR Lexical Content
    Kusseling, Francoise
    Lonsdale, Deryle
    CANADIAN MODERN LANGUAGE REVIEW-REVUE CANADIENNE DES LANGUES VIVANTES, 2013, 69 (04): : 436 - 461
  • [30] Parallel Text Identification Using Lexical and Corpus Features for the English-Maori Language Pair
    Mohaghegh, Mahsa
    Sarrafzadeh, Abdolhossein
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 910 - 915