A Data Science Course for Undergraduates: Thinking With Data

被引:65
|
作者
Baumer, Ben [1 ]
机构
[1] Smith Coll, Stat & Data Sci, Northampton, MA 01063 USA
来源
AMERICAN STATISTICIAN | 2015年 / 69卷 / 04期
关键词
Computational statistics; Data science; Data visualization; Data wrangling; Machine learning; Statistical computing; Undergraduate curriculum; STATISTICS;
D O I
10.1080/00031305.2015.1081105
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Data science is an emerging interdisciplinary field that combines elements of mathematics, statistics, computer science, and knowledge in a particular application domain for the purpose of extracting meaningful information from the increasingly sophisticated array of data available in many settings. These data tend to be nontraditional, in the sense that they are often live, large, complex, and/or messy. A first course in statistics at the undergraduate level typically introduces students to a variety of techniques to analyze small, neat, and clean datasets. However, whether they pursue more formal training in statistics or not, many of these students will end up working with data that are considerably more complex, and will need facility with statistical computing techniques. More importantly, these students require a framework for thinking structurally about data. We describe an undergraduate course in a liberal arts environment that provides students with the tools necessary to apply data science. The course emphasizes modern, practical, and useful skills that cover the full data analysis spectrum, from asking an interesting question to acquiring, managing, manipulating, processing, querying, analyzing, and visualizing data, as well communicating findings in written, graphical, and oral forms. Supplementary materials for this article are available online.[Received June 2014. Revised July 2015.]
引用
收藏
页码:334 / 342
页数:9
相关论文
共 50 条
  • [1] Integrating Data Science into a General Education Information Technology Course An Approach to Developing Data Savvy Undergraduates
    Haynes, Malcolm
    Groen, Joshua
    Sturzinger, Eric
    Zhu, Danny
    Shafer, Justin
    Mcgee, Timothy
    [J]. PROCEEDINGS OF THE 20TH ANNUAL CONFERENCE ON INFORMATION TECHNOLOGY EDUCATION (SIGITE '19), 2019, : 183 - 188
  • [2] A First Course in Data Science
    Yan, Donghui
    Davis, Gary E.
    [J]. JOURNAL OF STATISTICS EDUCATION, 2019, 27 (02): : 99 - 109
  • [3] Human Centered Data Science Ungrading in an Introductory Data Science Course
    Theobold, Allison S.
    [J]. PROCEEDINGS OF THE 2023 CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2023, VOL 1, 2023, : 327 - 333
  • [4] Computational Thinking in the Era of Data Science
    Mike, Koby
    Ragonis, Noa
    Rosenberg-Kima, Rinat B.
    Hazzan, Orit
    [J]. COMMUNICATIONS OF THE ACM, 2022, 65 (08) : 33 - 35
  • [5] Data Science Ethos Lifecycle: Interplay of Ethical Thinking and Data Science Practice
    Boenig-Liptsin, Margarita
    Tanweer, Anissa
    Edmundson, Ari
    [J]. JOURNAL OF STATISTICS AND DATA SCIENCE EDUCATION, 2022, 30 (03): : 228 - 240
  • [6] Data Assignments in Substantive Courses: Getting Undergraduates Excited and Interested in Data Science
    Atkeson, Lonna Rae
    [J]. PS-POLITICAL SCIENCE & POLITICS, 2022, 55 (01) : 206 - 209
  • [7] Using the WorldWide Telescope to Develop Science Data Literacy in STEM Undergraduates: A Conceptual Framework and Course Design
    Guo, Qing
    Chen, Yuqing
    Qiao, Cuilan
    Yu, Yunwei
    [J]. JOURNAL OF SCIENCE EDUCATION AND TECHNOLOGY, 2024,
  • [8] Thinking by classes in data science: the symbolic data analysis paradigm
    Diday, Edwin
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2016, 8 (05): : 172 - 205
  • [9] Anthropological thinking in data science education: Thinking within context
    Binah-Pollak, Avital
    Hazzan, Orit
    Mike, Koby
    Hacohen, Ronit Lis
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (11) : 14245 - 14260
  • [10] Why not teach a science ethics course to undergraduates
    MacGowan, Catherine E.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 237