Identifying Mixture Components From Large-Scale Keystroke Log Data

被引:3
|
作者
Li, Tingxuan [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Educ, Shanghai, Peoples R China
来源
FRONTIERS IN PSYCHOLOGY | 2021年 / 12卷
关键词
computer-based assessment; keystroke log data; cognitive; writing; finite mixture model (FMM); WRITING INSTRUCTION; MAXIMUM-LIKELIHOOD; WORKING-MEMORY; PAUSES; METAANALYSIS; STUDENTS; BEHAVIOR;
D O I
10.3389/fpsyg.2021.628660
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
In a computer-based writing assessment, massive keystroke log data can provide real-time information on students' writing behaviors during text production. This research aims to quantify the writing process from a cognitive standpoint. The hope is that the quantification may contribute to establish a writing profile for each student to represent a student's learning status. Such profiles may contain richer information to influence the ongoing and future writing instruction. Educational Testing Service (ETS) administered the assessment and collected a large sample of student essays. The sample used in this study contains nearly 1,000 essays collected across 24 schools in 18 U.S. states. Using a mixture of lognormal models, the main findings show that the estimated parameters on pause data are meaningful and interpretable with low-to-high cognitive processes. These findings are also consistent across two writing genres. Moreover, the mixture model captures aspects of the writing process not examined otherwise: (1) for some students, the model comparison criterion favored the three-component model, whereas for other students, the criterion favored the four-component model; and (2) students with low human scores have a wide range of values on the mixing proportion parameter, whereas students with higher scores do not possess this pattern.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Towards Automated Log Parsing for Large-Scale Log Data Analysis
    He, Pinjia
    Zhu, Jieming
    He, Shilin
    Li, Jian
    Lyu, Michael R.
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2018, 15 (06) : 931 - 944
  • [2] Identifying Beneficial Learning Behaviors from Large-Scale Interaction Data
    Cristus, Miruna
    Tackstrom, Oscar
    Tan, Lingyi
    Pacifici, Valentino
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT II, 2020, 12164 : 371 - 375
  • [3] Queries over Large-scale Log Data of Hybrid Granularities
    Zhao, Gansen
    Zhuang, Xutian
    Wang, Xinming
    Nie, Ruihua
    Liao, Zhirui
    Lin, Chengchuang
    Li, Zhenyu
    [J]. 2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2016, : 240 - 246
  • [4] Identifying Recent Adaptations in Large-Scale Genomic Data
    Grossman, Sharon R.
    Andersen, Kristian G.
    Shlyakhter, Ilya
    Tabrizi, Shervin
    Winnicki, Sarah
    Yen, Angela
    Park, Daniel J.
    Griesemer, Dustin
    Karlsson, Elinor K.
    Wong, Sunny H.
    Cabili, Moran
    Adegbola, Richard A.
    Bamezai, Rameshwar N. K.
    Hill, Adrian V. S.
    Vannberg, Fredrik O.
    Rinn, John L.
    Lander, Eric S.
    Schaffner, Stephen F.
    Sabeti, Pardis C.
    [J]. CELL, 2013, 152 (04) : 703 - 713
  • [5] Identifying Skype Traffic in a Large-Scale Flow Data Repository
    Trammell, Brian
    Boschi, Elisa
    Procissi, Gregorio
    Callegari, Christian
    Dorfinger, Peter
    Schatzmann, Dominik
    [J]. TRAFFIC MONITORING AND ANALYSIS: THIRD INTERNATIONAL WORKSHOP, TMA 2011, 2011, 6613 : 72 - +
  • [6] Identifying nearest neighbors in a large-scale incident data archive
    Qi, Y
    Smith, BL
    [J]. INFORMATION SYSTEMS AND TECHNOLOGY, 2004, (1879): : 89 - 98
  • [7] Growth mixture modeling: Application to reading achievement data from a large-scale assessment
    Bilir, Mustafa Kuzey
    Binici, Salih
    Kamata, Akihito
    [J]. MEASUREMENT AND EVALUATION IN COUNSELING AND DEVELOPMENT, 2008, 41 (02) : 104 - 119
  • [8] Estimating Variance Components from Sparse Data Matrices in Large-Scale Educational Assessments
    DeMars, Christine
    [J]. APPLIED MEASUREMENT IN EDUCATION, 2015, 28 (01) : 1 - 13
  • [9] Scalable Algorithms for Bayesian Inference of Large-Scale Models from Large-Scale Data
    Ghattas, Omar
    Isaac, Tobin
    Petra, Noemi
    Stadler, Georg
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 3 - 6
  • [10] Identifying large-scale patterns of unpredictability and response to insolation in atmospheric data
    Fernando Arizmendi
    Marcelo Barreiro
    Cristina Masoller
    [J]. Scientific Reports, 7