Correlation Analysis of Big Data to Support Machine Learning

被引:8
|
作者
Pandey, Rajiv [1 ]
Dhoundiyal, Manoj [2 ]
Kumar, Amrendra [2 ]
机构
[1] Amity Univ, Amity Inst Informat Technol, Lucknow, Uttar Pradesh, India
[2] Amity Univ, IT Dept, Lucknow, Uttar Pradesh, India
来源
2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015) | 2015年
关键词
Quantitative Variables; R; Correlation analysis; Big Data; Linear Model; Linear Regression;
D O I
10.1109/CSNT.2015.32
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The large size and complexity of datasets in Big Data need specialized statistical tools for analysis and we use R for correlation analysis of our data set. This paper explores the correlation analysis through best fit linear regression of quantitative variables with help of the demonstration based on scatter plots and linear regression best fit line. The analysis demonstrated in this paper is scalable to Big Data in any other context where the quantitative variables are clearly delineated. R provides multiple techniques and inferences to statistical analysis of dataset, this paper however explores the correlation between quantitative variable establishing the extent of dependability between them using R functions. The correlation and best fit line functions of R i.e. cor() and abline( lmout) respectively are significantly explored.
引用
收藏
页码:996 / 999
页数:4
相关论文
共 50 条
  • [31] PV Forecasting Using Support Vector Machine Learning in a Big Data Analytics Context
    Preda, Stefan
    Oprea, Simona-Vasilica
    Bara, Adela
    Belciu , Anda
    SYMMETRY-BASEL, 2018, 10 (12):
  • [32] Machine Learning and Big Data Analytics in Support of Fleet Safety During Severe Weather
    Spielman, Zachary
    Gertman, David I.
    Liu, Haoran
    Pray, Ira
    Traiteur, Justin
    Wold, Scott
    Wysmuller, Steven
    ADVANCES IN HUMAN ASPECTS OF TRANSPORTATION, 2018, 597 : 662 - 671
  • [33] Big data analytics and machine learning: A retrospective overview and bibliometric analysis
    Zhang, Justin Zuopeng
    Srivastava, Praveen Ranjan
    Sharma, Dheeraj
    Eachempati, Prajwal
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
  • [34] Big Data Analysis and Decision Support System Based on Deep Learning
    Yan Y.
    Yang H.
    Computer-Aided Design and Applications, 2024, 21 (S13): : 62 - 74
  • [35] Perspectives on Big Data, Cloud-Based Data Analysis and Machine Learning Systems
    Marozzo, Fabrizio
    Talia, Domenico
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (02)
  • [36] Performance Analysis of Machine Learning Algorithms for Big Data Classification: ML and Al-Based Algorithms for Big Data Analysis
    Punia, Sanjeev Kumar
    Kumar, Manoj
    Stephan, Thompson
    Deverajan, Ganesh Gopal
    Patan, Rizwan
    INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2021, 12 (04) : 60 - 75
  • [37] Machine learning on big data for future computing
    Jeong, Young-Sik
    Hassan, Houcine
    Sangaiah, Arun Kumar
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (06): : 2925 - 2929
  • [38] Machine learning on big data for future computing
    Young-Sik Jeong
    Houcine Hassan
    Arun Kumar Sangaiah
    The Journal of Supercomputing, 2019, 75 : 2925 - 2929
  • [39] Machine Learning With Big Data: Challenges and Approaches
    L'Heureux, Alexandra
    Grolinger, Katarina
    Elyamany, Hany F.
    Capretz, Miriam A. M.
    IEEE ACCESS, 2017, 5 : 7776 - 7797
  • [40] Machine Learning Meets Big Spatial Data
    Sabek, Ibrahim
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 1982 - 1985