Text mining of syntactic complexity in L2 writing: an LDA topic modeling approach

被引:0
|
作者
Huang, Zhiyun [1 ]
Jiang, Zhanhao [1 ]
机构
[1] Southeast Univ, Nanjing, Peoples R China
关键词
syntactic complexity; second language writing; topic modeling; Latent Dirichlet Allocation; LINGUISTIC FEATURES; QUALITY; FLUENCY; ACCURACY; LANGUAGE; PROFICIENCY; WRITERS; TOOLS; PAIR;
D O I
10.1515/iral-2024-0132
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The present study provides an overview of syntactic complexity (SC) in second language (L2) writing research, employing the Latent Dirichlet Allocation (LDA) topic modeling approach. Analyzing 470 abstracts sourced from the WoS Core Collection (2005-2023) and Scopus (1978-2023), this investigation explores the broad spectrum of research themes in SC in L2 writing. By employing perplexity and coherence tests, the study identifies the optimal number of topics in the model as five: writing quality, proficiency, genres, SC development, and task-based conditions. The finding reveals that: (1) In cross-sectional studies, the interplay among writing quality, proficiency levels, and genres often involves studying one variable as reliant on the others. Additionally, there is a growing inclination toward regarding phrasal structures as more dependable complexity indicators. (2) Longitudinal investigations predominantly emphasize the evolution of SC, yet there's a scarcity of exploration in continuation writing tasks. (3) Regarding measurement metrics, a transition from broader SC evaluations to more intricate assessments is evident, though several studies lack thorough examinations, particularly in continuation tasks.
引用
收藏
页数:26
相关论文
共 50 条