S, R, and Data Science

被引:0
|
作者
Chambers, John M. [1 ]
机构
[1] Stanford Univ, Line 390 Serra Mall, Line Stanford, CA 94305 USA
来源
R JOURNAL | 2020年 / 12卷 / 01期
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data science is increasingly important and challenging. It requires computational tools and programming environments that handle big data and difficult computations, while supporting creative, high-quality analysis. The R language and related software play a major role in computing for data science. R is featured in most programs for training in the field. R packages provide tools for a wide range of purposes and users. The description of a new technique, particularly from research in statistics, is frequently accompanied by an R package, greatly increasing the usefulness of the description. The history of R makes clear its connection to data science. R was consciously designed to replicate in open-source software the contents of the S software. S in turn was written by data analysis researchers at Bell Labs as part of the computing environment for research in data analysis and collaborations to apply that research, rather than as a separate project to create a programming language. The features of S and the design decisions made for it need to be understood in this broader context of supporting effective data analysis (which would now be called data science). These characteristics were all transferred to R and remain central to its effectiveness. Thus, R can be viewed as based historically on a domain-specific language for the domain of data science.
引用
收藏
页码:462 / 476
页数:15
相关论文
共 50 条
  • [1] S, R, and Data Science
    Chambers, John M.
    [J]. PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2020, 4
  • [2] Modern Data Science with R
    Cruze, Nathan
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2017, 33 (04) : 1087 - 1089
  • [3] Modern Data Science with R
    Liu, Shuangzhe
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2018, 86 (01) : 162 - 162
  • [4] Textual data science with R
    Sanchez, Brisa N.
    [J]. BIOMETRICS, 2019, 75 (04) : 1415 - 1416
  • [5] Modern data science with R
    Hoyer, Annika
    [J]. BIOMETRICAL JOURNAL, 2021, 63 (08) : 1748 - 1749
  • [6] R for health data science
    Greenacre, Michael
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2022, 185 : S765 - S766
  • [7] Textual Data Science with R
    Benoit, Kenneth R.
    [J]. AMERICAN STATISTICIAN, 2021, 75 (04): : 453 - 454
  • [8] Modern data science with R
    Shalabh
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2022, 185 (02) : 735 - 736
  • [9] Data science in education using R
    Michela, Esther
    Moore, Robert L.
    [J]. TECHTRENDS, 2021, 65 (03) : 402 - 403
  • [10] R - a Global Sensation in Data Science
    Caragea, Nicoleta
    Alexandru, Antoniade-Ciprian
    Dobre, Ana Maria
    [J]. ROMANIAN STATISTICAL REVIEW, 2014, (02) : 7 - 16