The paradox of big data

被引:2
|
作者
Smith, Gary [1 ]
机构
[1] Pomona Coll, Dept Econ, 425 N Coll Ave, Claremont, CA 91711 USA
关键词
Data mining; Big data; Holdout data; KNOWLEDGE DISCOVERY; REPLICABILITY; SCIENCE;
D O I
10.1007/s42452-020-2862-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The data deluge seemingly makes it more likely that data mining will discover new, heretofore unknown relationships. Findings Monte Carlo simulations demonstrate the paradox of big data: the data deluge makes it more likely that the patterns and relationships discovered by data mining are spurious. Conclusion Models are more likely to be reliable if expert opinion is used in their specification, instead of viewing human expertise as an unhelpful constraint on knowledge discovery.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] The paradox of big data
    Gary Smith
    [J]. SN Applied Sciences, 2020, 2
  • [2] The Big Data Paradox in Clinical Practice
    Msaouel, Pavlos
    [J]. CANCER INVESTIGATION, 2022, 40 (07) : 567 - 576
  • [3] The big data paradox: More data, less confidence
    Sigmon, Paula Wiles
    [J]. IBM Data Management Magazine, 2013, (06):
  • [4] Zadig's paradox Big data and security
    Dupuy, Jean-Pierre
    [J]. ESPRIT, 2019, (12): : 115 - 122
  • [5] The paradox of asthma: neglect, burden and big data
    Stelmach, Rafael
    Cruz, Alvaro Augusto
    [J]. JORNAL BRASILEIRO DE PNEUMOLOGIA, 2017, 43 (03) : 159 - 160
  • [6] The Big Health Data-Intelligent Machine Paradox
    Miller, Douglas
    [J]. AMERICAN JOURNAL OF MEDICINE, 2018, 131 (11): : 1272 - 1275
  • [7] A Paradox in Rounding Errors Approximate Computing for Big Data
    Lin, Tsau-Young
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2567 - U998
  • [8] The Big Paradox
    de Hart, Alvaro Acosta
    [J]. REVISTA COLOMBIANA DE CANCEROLOGIA, 2009, 13 (01): : 5 - 7
  • [9] Revisiting the paradox of replication: Is the solution to the paradox big data style research or something else?
    Oh, In-Sue
    [J]. INDUSTRIAL AND ORGANIZATIONAL PSYCHOLOGY-PERSPECTIVES ON SCIENCE AND PRACTICE, 2022, 15 (04): : 533 - 536
  • [10] THE EFFICIENCY PARADOX What Big Data Can't Do
    Beckerman, Gal
    [J]. NEW YORK TIMES BOOK REVIEW, 2018, 123 (23): : 14 - 14