Data Science and Prediction

被引:447
|
作者
Dhar, Vasant [1 ]
机构
[1] NYU, Stern Sch Business, Ctr Business Analyt, New York, NY 10012 USA
关键词
D O I
10.1145/2500499
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Vasant Dhar states that data science or big data is gaining increasing significance with the potential of providing automated actionable knowledge creation and predictive models for use by both humans and computers. Data science implies a focus involving data and the systematic study of the organization, properties, and analysis of data and its role in inference, including confidence in the inference. Data science is different from statistics and other existing disciplines in several important ways. The emphasis on prediction is particularly strong in the machine learning and knowledge discovery in databases, or KDD, communities. The emphasis on predictive accuracy implicitly favors 'simple' theories over more complex theories in that the accuracy of sparser models tends to be more robust on future data. The requirement on predictive accuracy on observations that will occur in the future is a key consideration in data science.
引用
收藏
页码:64 / 73
页数:10
相关论文
共 50 条