Evaluating Content-Related Validity Evidence Using a Text-Based Machine Learning Procedure

被引:8
|
作者
Anderson, Daniel [1 ]
Rowley, Brock [1 ]
Stegenga, Sondra [2 ]
Irvin, P. Shawn [1 ]
Rosenberg, Joshua M. [3 ]
机构
[1] Univ Oregon, Behav Res & Teaching, Eugene, OR 97403 USA
[2] Univ Utah, Special Educ, Salt Lake City, UT 84112 USA
[3] Univ Tennessee, Theory & Practice Teacher Educ, Knoxville, TN 37996 USA
关键词
machine learning; text-mining; textual congruence; validity;
D O I
10.1111/emip.12314
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Validity evidence based on test content is critical to meaningful interpretation of test scores. Within high-stakes testing and accountability frameworks, content-related validity evidence is typically gathered via alignment studies, with panels of experts providing qualitative judgments on the degree to which test items align with the representative content standards. Various summary statistics are then calculated (e.g., categorical concurrence, balance of representation) to aid in decision-making. In this paper, we propose an alternative approach for gathering content-related validity evidence that capitalizes on the overlap in vocabulary used in test items and the corresponding content standards, which we define as textual congruence. We use a text-based, machine learning model, specifically topic modeling, to identify clusters of related content within the standards. This model then serves as the basis from which items are evaluated. We illustrate our method by building a model from the Next Generation Science Standards, with textual congruence evaluated against items within the Oregon statewide alternate assessment. We discuss the utility of this approach as a source of triangulating and diagnostic information and show how visualizations can be used to evaluate the overall coverage of the content standards across the test items.
引用
收藏
页码:53 / 64
页数:12
相关论文
共 50 条
  • [1] Stock Market Prediction using Text-based Machine Learning
    Jordan, Tristan
    Elgazzar, Heba
    [J]. 2020 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS 2020), 2020, : 322 - 326
  • [2] Evaluating text-based similarity measures for musical content
    Garay, A
    [J]. SECOND INTERNATIONAL CONFERENCE ON WEB DELIVERING OF MUSIC, PROCEEDINGS, 2002, : 49 - 55
  • [3] Gathering content-related validity evidence for the School Leaders Licensure Assessment
    Reese C.M.
    Tannenbaum R.J.
    [J]. Journal of Personnel Evaluation in Education, 1999, 13 (3): : 263 - 282
  • [4] Text-based paper-level classification procedure for non-traditional sciences using a machine learning approach
    Daniela Moctezuma
    Carlos López-Vázquez
    Lucas Lopes
    Norton Trevisan
    José Pérez
    [J]. Knowledge and Information Systems, 2024, 66 : 1503 - 1520
  • [5] Text-based paper-level classification procedure for non-traditional sciences using a machine learning approach
    Moctezuma, Daniela
    Lopez-Vazquez, Carlos
    Lopes, Lucas
    Trevisan, Norton
    Perez, Jose
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (02) : 1503 - 1520
  • [6] Machine learning in bank merger prediction: A text-based approach 
    Katsafados, Apostolos G.
    Leledakis, George N.
    Pyrgiotakis, Emmanouil G.
    Androutsopoulos, Ion
    Fergadiotis, Manos
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (02) : 783 - 797
  • [7] Stemming Text-based Web Page Classification using Machine Learning Algorithms: A Comparison
    Razali, Ansari
    Daud, Salwani Mohd
    Zin, Nor Azan Mat
    Shahidi, Faezehsadat
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 570 - 576
  • [8] An empirical study of text-based machine learning models for vulnerability detection
    Kollin Napier
    Tanmay Bhowmik
    Shaowei Wang
    [J]. Empirical Software Engineering, 2023, 28
  • [9] An empirical study of text-based machine learning models for vulnerability detection
    Napier, Kollin
    Bhowmik, Tanmay
    Wang, Shaowei
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (02)
  • [10] Machine Learning Technique for Fake News Detection Using Text-Based Word Vector Representation
    Gaurav, Akshat
    Gupta, B. B.
    Hsu, Ching-Hsien
    Castiglione, Arcangelo
    Chui, Kwok Tai
    [J]. COMPUTATIONAL DATA AND SOCIAL NETWORKS, CSONET 2021, 2021, 13116 : 340 - 348