Correlation Analysis of Big Data to Support Machine Learning

被引:8
|
作者
Pandey, Rajiv [1 ]
Dhoundiyal, Manoj [2 ]
Kumar, Amrendra [2 ]
机构
[1] Amity Univ, Amity Inst Informat Technol, Lucknow, Uttar Pradesh, India
[2] Amity Univ, IT Dept, Lucknow, Uttar Pradesh, India
来源
2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015) | 2015年
关键词
Quantitative Variables; R; Correlation analysis; Big Data; Linear Model; Linear Regression;
D O I
10.1109/CSNT.2015.32
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The large size and complexity of datasets in Big Data need specialized statistical tools for analysis and we use R for correlation analysis of our data set. This paper explores the correlation analysis through best fit linear regression of quantitative variables with help of the demonstration based on scatter plots and linear regression best fit line. The analysis demonstrated in this paper is scalable to Big Data in any other context where the quantitative variables are clearly delineated. R provides multiple techniques and inferences to statistical analysis of dataset, this paper however explores the correlation between quantitative variable establishing the extent of dependability between them using R functions. The correlation and best fit line functions of R i.e. cor() and abline( lmout) respectively are significantly explored.
引用
收藏
页码:996 / 999
页数:4
相关论文
共 50 条
  • [41] Machine Learning Challenges in Big Data Era
    Veganzones-Bodon, Miguel
    DYNA, 2019, 94 (05): : 478 - 479
  • [42] Machine Learning for Astronomical Big Data Processing
    Xu, Long
    Yan, Yihua
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [43] Machine learning in 'big data': handle with care
    Loring, Zak
    Mehrotra, Suchit
    Piccini, Jonathan P.
    EUROPACE, 2019, 21 (09): : 1284 - 1285
  • [44] Machine Learning and Computational Intelligence in Big Data
    Anagnostopoulos, Christos
    Kolomvatsos, Kostas
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 873 - 874
  • [45] Machine learning on big data: Opportunities and challenges
    Zhou, Lina
    Pan, Shimei
    Wang, Jianwu
    Vasilakos, Athanasios V.
    NEUROCOMPUTING, 2017, 237 : 350 - 361
  • [46] Big Data and Machine Learning Framework in Healthcare
    Dogaru, Delia Ioana
    Dumitrache, Ioan
    2019 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2019,
  • [47] Green Computing for Big Data and Machine Learning
    Barua, Hrishav Bakul
    Mondal, Kartick Chandra
    Khatua, Sunirmal
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 348 - 351
  • [48] PivotalR: A Package for Machine Learning on Big Data
    Qian, Hai
    R JOURNAL, 2014, 6 (01): : 57 - 67
  • [49] Machine learning for Big Data analytics in plants
    Ma, Chuang
    Zhang, Hao Helen
    Wang, Xiangfeng
    TRENDS IN PLANT SCIENCE, 2014, 19 (12) : 798 - 808
  • [50] Big data and machine learning for materials science
    Rodrigues J.F., Jr.
    Florea L.
    de Oliveira M.C.F.
    Diamond D.
    Oliveira O.N., Jr.
    Discover Materials, 1 (1):