Text Value and Linguistic Characterization in Chinese Language Literature Based on Text Mining Techniques

被引:0
|
作者
Liu M. [1 ]
Hu S. [1 ]
Qing W. [1 ]
机构
[1] Department of Teacher Education, Nanchong Vocational and Technical College, Sichuan, Nanchong
关键词
Chinese language; Labeled-LDA; literature; Semantic network; Textual disambiguation; Vector space modeling;
D O I
10.2478/amns-2024-0486
中图分类号
学科分类号
摘要
This study applies text mining techniques to deeply analyze Chinese language and literature’s text value and linguistic features. The study adopts the methods of textual disambiguation, vector space modeling, semantic network and Labeled-LDA model. Taking the novels of Yu Hua and Ge Fei as an example, it reveals the differences between the two writers in linguistic features such as using punctuation, average word length, and sentence discrete degree. The study provides a comprehensive heat score for the novels based on three dimensions: reading base group, reading gain, and reading discussion. The results show that the frequency of period use in Yu Hua’s works is decentralized, while Ge Fei’s works are more concentrated. Ge Fei’s average word length is slightly higher, showing a tendency to use multi-syllabic words. The novel popularity and heat scores conform to a power law distribution, reflecting the Pareto rule that 80% of the popularity is concentrated on 20% of the hot novels. This study provides a new perspective on Chinese language and literature through the application of text mining technology, and its methods and tools can effectively enhance the effectiveness and efficiency of teaching. © 2023 published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [31] Text Mining: Techniques, Applications, and Challenges
    Justicia de la Torre, C.
    Sanchez, D.
    Blanco, I
    Martin-Bautista, M. J.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2018, 26 (04) : 553 - 582
  • [32] A SURVEY ON CLASSIFICATION TECHNIQUES FOR TEXT MINING
    Brindha, S.
    Sukumaran, S.
    Prabha, K.
    2016 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2016,
  • [33] Text Mining: Techniques, Applications and Issues
    Talib, Ramzan
    Hanif, Muhammad Kashif
    Ayesha, Shaeela
    Fatima, Fakeeha
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (11) : 414 - 418
  • [34] Text mining techniques for patent analysis
    Tseng, Yuen-Hsien
    Lin, Chi-Jen
    Lin, Yu-I
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (05) : 1216 - 1247
  • [35] Integrating IFC and CityGML Model at Schema Level by Using Linguistic and Text Mining Techniques
    Ding, Xiaohui
    Yang, Ji
    Liu, Lingjia
    Huang, Wumeng
    Wu, Peng
    IEEE ACCESS, 2020, 8 : 56429 - 56440
  • [36] LINGUISTIC CHARACTERIZATION OF TEXT ON ENVIRONMENTAL TOPICS
    Moiseenko, Anna, V
    NAUCHNYI DIALOG, 2019, (10): : 204 - 214
  • [37] The language in the Genesis (linguistic analysis of the Biblical text)
    Pflug, G
    MUTTERSPRACHE, 2004, 114 (02): : 151 - 162
  • [38] Approaches for text mining of mHealth literature
    Ozaydin, Bunyamin
    Zengul, Ferhat
    Oner, Nurettin
    Delen, Dursun
    MHEALTH, 2022, 8 (02)
  • [39] EXPERIMENTAL TECHNIQUES IN THE LINGUISTIC AND GRAMMATICAL EXPLOITATION OF TEXT
    Opris, Miruna
    17TH INTERNATIONAL CONFERENCE THE KNOWLEDGE-BASED ORGANIZATION, CONFERENCE PROCEEDINGS 2: ECONOMIC, SOCIAL AND ADMINISTRATIVE APPROACHES TO THE KNOWLEDGE-BASED ORGANIZATION, 2011, 2 : 957 - 961
  • [40] Blogger's Interest Mining Based on Chinese Text Classification
    Yang, Suhua
    Yan, Jianzhuo
    Gao, Chen
    Tan, Guohua
    NONLINEAR MATHEMATICS FOR UNCERTAINTY AND ITS APPLICATIONS, 2011, 100 : 611 - 618