Unsupervised machine learning to classify language dimensions to constitute the linguistic complexity of mathematical word problems

被引:1
|
作者
Bednorz, David [1 ]
Kleine, Michael [2 ]
机构
[1] IPN Leibniz Inst Sciene & Math Educ, Dept Math Educ, Kiel, Germany
[2] Bielefeld Univ, Dept Math Educ, Bielefeld, Germany
关键词
language dimensions; mathematical word problems; linguistic complexity; machine learning; unsupervised machine learning; ACADEMIC-LANGUAGE; MINORITY-STUDENTS; TEXT; LEARNERS; COMPREHENSION; PERFORMANCE; KNOWLEDGE;
D O I
10.29333/iejme/12588
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The study examines language dimensions of mathematical word problems and the classification of mathematical word problems according to these dimensions with unsupervised machine learning (ML) techniques. Previous research suggests that the language dimensions are important for mathematical word problems because it has an influence on the linguistic complexity of word problems. Depending on the linguistic complexity students can have language obstacles to solve mathematical word problems. A lot of research in mathematics education research focus on the analysis on the linguistic complexity based on theoretical build language dimensions. To date, however it has been unclear what empirical relationship between the linguistic features exist for mathematical word problems. To address this issue, we used unsupervised ML techniques to reveal latent linguistic structures of 17 linguistic features for 342 mathematical word problems and classify them. The models showed that three -and five-dimensional linguistic structures have the highest explanatory power. Additionally, the authors consider a four-dimensional solution. Mathematical word problem from the three-dimensional solution can be classify in two groups, three-and five-dimensional solutions in three groups. The findings revealed latent linguistic structures and groups that could have an implication of the linguistic complexity of mathematical word problems and differ from language dimensions, which are considered theoretically. Therefore, the results indicate for new design principles for interventions and materials for language education in mathematics learning and teaching.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Combining Rule-Based System and Machine Learning to Classify Semi-natural Language Data
    Hussain, Zafar
    Nurminen, Jukka K.
    Mikkonen, Tommi
    Kowiel, Marcin
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2023, 542 : 424 - 441
  • [42] LEARNING CONDITION, LINGUISTIC COMPLEXITY, AND FIRST LANGUAGE TRANSFER IN SEMIARTIFICIAL LANGUAGE LEARNING A CONCEPTUAL REPLICATION AND EXTENSION OF TAGARELLI ET AL. (2016)
    Gao, Jianwu
    Ma, Shuang
    STUDIES IN SECOND LANGUAGE ACQUISITION, 2021, 43 (02) : 355 - 378
  • [43] Computational complexity of combinatorial optimization problems induced by collective procedures in machine learning
    Khachai, M. Yu.
    PROCEEDINGS OF THE STEKLOV INSTITUTE OF MATHEMATICS, 2011, 272 : 46 - 54
  • [44] Computational complexity of combinatorial optimization problems induced by collective procedures in machine learning
    Khachai, M. Yu.
    TRUDY INSTITUTA MATEMATIKI I MEKHANIKI URO RAN, 2010, 16 (03): : 276 - 284
  • [45] Computational complexity of combinatorial optimization problems induced by collective procedures in machine learning
    M. Yu. Khachai
    Proceedings of the Steklov Institute of Mathematics, 2011, 272 : 46 - 54
  • [46] Unsupervised Machine Learning Method for the Phase Behavior of the Constant Magnetization Ising Model in Two and Three Dimensions
    Jang, Inhyuk
    Yethiraj, Arun
    JOURNAL OF PHYSICAL CHEMISTRY B, 2024, 129 (01): : 532 - 539
  • [47] Editorial for the Special Issue "Advances in Machine Learning and Mathematical Modeling for Optimization Problems"
    Chehri, Abdellah
    Rivest, Francois
    MATHEMATICS, 2023, 11 (08)
  • [48] Highly Language-Independent Word Lemmatization Using a Machine-Learning Classifier
    Akhmetov, Iskander
    Pak, Alexandr
    Ualiyeva, Irina
    Gelbukh, Alexander
    COMPUTACION Y SISTEMAS, 2020, 24 (03): : 1353 - 1364
  • [49] Using natural language processing and machine learning to classify health literacy from secure messages: The ECLIPPSE study
    Balyan, Renu
    Crossley, Scott A.
    Brown, William, III
    Karter, Andrew J.
    McNamara, Danielle S.
    Liu, Jennifer Y.
    Lyles, Courtney R.
    Schillinger, Dean
    PLOS ONE, 2019, 14 (02):
  • [50] Linguistic Indicators for Text Complexity in Picture Books for Young Chinese Children Learning English as a Foreign Language
    Zhao, Jing
    Zhu, Meifang
    de Ruiter, Laura
    Chen, Si
    FRONTIERS IN EDUCATION, 2022, 7