Data Science: A Comprehensive Overview

被引:192
|
作者
Cao, Longbing [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, UTS Adv Analyt Inst, POB 123 Broadway, Sydney, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Big data; data analysis; data analytics; advanced analytics; big data analytics; data science; data engineering; data scientist; statistics; computing; informatics; data DNA; data innovation; data economy; data industry; data service; data profession; data education; BIG DATA; STATISTICS; ANALYTICS; FUTURE; GUIDE;
D O I
10.1145/3076253
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The 21st century has ushered in the age of big data and data economy, in which data DNA, which carries important knowledge, insights, and potential, has become an intrinsic constituent of all data-based organisms. An appropriate understanding of data DNA and its organisms relies on the new field of data science and its keystone, analytics. Although it is widely debated whether big data is only hype and buzz, and data science is still in a very early phase, significant challenges and opportunities are emerging or have been inspired by the research, innovation, business, profession, and education of data science. This article provides a comprehensive survey and tutorial of the fundamental aspects of data science: the evolution from data analysis to data science, the data science concepts, a big picture of the era of data science, the major challenges and directions in data innovation, the nature of data analytics, new industrialization and service opportunities in the data economy, the profession and competency of data education, and the future of data science. This article is the first in the field to draw a comprehensive big picture, in addition to offering rich observations, lessons, and thinking about data science and analytics.
引用
收藏
页数:42
相关论文
共 50 条
  • [1] The science of rural human settlements: a comprehensive overview
    Liu, Junyou
    Zheng, Bohong
    Tang, Haifang
    [J]. FRONTIERS IN ENVIRONMENTAL SCIENCE, 2023, 11
  • [2] Data science and AI in FinTech: an overview
    Longbing Cao
    Qiang Yang
    Philip S. Yu
    [J]. International Journal of Data Science and Analytics, 2021, 12 : 81 - 99
  • [3] Data science and AI in FinTech: an overview
    Cao, Longbing
    Yang, Qiang
    Yu, Philip S.
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2021, 12 (02) : 81 - 99
  • [4] Wildfire science is at a loss for comprehensive data
    Bowman, David
    [J]. NATURE, 2018, 560 (7716) : 7 - 7
  • [5] Wildfire science is at a loss for comprehensive data
    David Bowman
    [J]. Nature, 2018, 560 : 7 - 7
  • [6] An Overview of data science uses in bioimage informatics
    Chessel, Anatole
    [J]. METHODS, 2017, 115 : 110 - 118
  • [7] Informatics and data science: an overview for the information professional
    Cervone, H. Frank
    [J]. DIGITAL LIBRARY PERSPECTIVES, 2016, 32 (01) : 7 - 10
  • [8] MTI science, data products and ground data processing overview
    Szymanski, JJ
    Atkins, W
    Balick, L
    Borel, CC
    Clodius, WB
    Christensen, W
    Davis, AB
    Echohawk, JC
    Galbraith, A
    Hirsch, K
    Krone, JB
    Little, C
    Maclachlan, P
    Morrison, A
    Pollock, K
    Pope, P
    Novak, C
    Ramsey, K
    Riddle, E
    Rohde, C
    Roussel-Dupre, D
    Smith, BW
    Smith, K
    Starkovich, K
    Theiler, J
    Weber, PG
    [J]. ALGORITHMS FOR MULTISPECTRAL, HYPERSPECTRAL AND ULTRASPECTRAL IMAGERY VII, 2001, 4381 : 195 - 203
  • [9] A comprehensive overview of RDF for spatial and spatiotemporal data management
    Zhang, Fu
    Lu, Qingzhe
    Du, Zhenjun
    Chen, Xu
    Cao, Chunhong
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2021, 36
  • [10] Modern data science for analytical chemical data - A comprehensive review
    Szymanska, Ewa
    [J]. ANALYTICA CHIMICA ACTA, 2018, 1028 : 1 - 10