Insights for Curriculum Development: Identifying Emerging Data Science Topics through Analysis of Q&A Communities

被引:11
|
作者
Karbasian, Habib [1 ]
Johri, Aditya [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
关键词
online Q&A platforms; text mining; topic modeling; StackExchange; Reddit; curriculum development;
D O I
10.1145/3328778.3366817
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Updating curricula in new computer science domains is a critical challenge faced by many instructors and programs. In this paper we present an approach for identifying emerging topics and issues in Data Science by using Question and Answer (Q&A) sites as a resource. Q&A sites provide a useful online platform for discussion of topics and through the sharing of information they become a valuable corpus of knowledge. We applied latent Dirichlet allocation (LDA), a statistical topic modeling technique, to analyze data science related threads from from two popular Q&A communities "Stack Exchange and Reddit". We uncovered both important topics as well as useful examples that can be incorporated into teaching. In addition to technical topics, our analysis also identified topics related to professional development. We believe that approaches such as these are critical in order to update curriculum and bridge the workplace-school divide in teaching of newer topics such as data science. Given the pace of technical development and frequent changes in the field, this is an inventive and effective method to keep teaching up to date. We also discuss the limitations of this approach whereby topics of importance such as data ethics are largely missing from online discussions.
引用
收藏
页码:192 / 198
页数:7
相关论文
共 15 条
  • [1] Keeping Curriculum Relevant: Identifying Longitudinal Shifts in Computer Science Topics through Analysis of Q&A Communities
    Karbasian, Habib
    Johri, Aditya
    [J]. 2021 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2021), 2021,
  • [2] Identifying emerging topics in the peer-reviewed literature to facilitate curriculum renewal and development
    Andrew James Amos
    Kyungmi Lee
    Tarun Sen Gupta
    Bunmi S. Malau-Aduli
    [J]. Current Psychology, 2023, 42 : 30813 - 30824
  • [3] Identifying emerging topics in the peer-reviewed literature to facilitate curriculum renewal and development
    Amos, Andrew James
    Lee, Kyungmi
    Sen Gupta, Tarun
    Malau-Aduli, Bunmi S.
    [J]. CURRENT PSYCHOLOGY, 2023, 42 (35) : 30813 - 30824
  • [4] What issues are data scientists talking about? Identification of current data science issues using semantic content analysis of Q&A communities
    Gurcan, Fatih
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [5] Proposal of topics for data science through the development of a database management system
    Garcia, Carlos A. Espinoza
    Lopez-Morteo, Gabriel
    [J]. 2021 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2021), 2021,
  • [6] Identifying emerging trends and hot topics through intelligent data mining: the case of clinical psychology and psychotherapy
    Sokolova, Anna
    Lobanova, Polina
    Kuzminov, Ilya
    [J]. FORESIGHT, 2024, 26 (01): : 155 - 180
  • [7] Identifying the Development Trends of Emerging Technologies: A Social Awareness Analysis Method Using Web News Data Mining
    Xie, Qian-Qian
    Li, Xin
    Huang, Lu-Cheng
    [J]. 2018 PORTLAND INTERNATIONAL CONFERENCE ON MANAGEMENT OF ENGINEERING AND TECHNOLOGY (PICMET '18): MANAGING TECHNOLOGICAL ENTREPRENEURSHIP: THE ENGINE FOR ECONOMIC GROWTH, 2018,
  • [8] Advancing data science in drug development through an innovative computational framework for data sharing and statistical analysis
    Mallon, Ann-Marie
    Haring, Dieter A.
    Dahlke, Frank
    Aarden, Piet
    Afyouni, Soroosh
    Delbarre, Daniel
    El Emam, Khaled
    Ganjgahi, Habib
    Gardiner, Stephen
    Kwok, Chun Hei
    West, Dominique M.
    Straiton, Ewan
    Haemmerle, Sibylle
    Huffman, Adam
    Hofmann, Tom
    Kelly, Luke J.
    Krusche, Peter
    Laramee, Marie-Claude
    Lheritier, Karine
    Ligozio, Greg
    Readie, Aimee
    Santos, Luis
    Nichols, Thomas E.
    Branson, Janice
    Holmes, Chris
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2021, 21 (01)
  • [9] Advancing data science in drug development through an innovative computational framework for data sharing and statistical analysis
    Ann-Marie Mallon
    Dieter A. Häring
    Frank Dahlke
    Piet Aarden
    Soroosh Afyouni
    Daniel Delbarre
    Khaled El Emam
    Habib Ganjgahi
    Stephen Gardiner
    Chun Hei Kwok
    Dominique M. West
    Ewan Straiton
    Sibylle Haemmerle
    Adam Huffman
    Tom Hofmann
    Luke J. Kelly
    Peter Krusche
    Marie-Claude Laramee
    Karine Lheritier
    Greg Ligozio
    Aimee Readie
    Luis Santos
    Thomas E. Nichols
    Janice Branson
    Chris Holmes
    [J]. BMC Medical Research Methodology, 21
  • [10] Examining How Classroom Communities Developed Practice-Based Epistemologies for Science Through Analysis of Longitudinal Video Data
    Krist, Christina
    [J]. JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2020, 112 (03) : 420 - 443