Improved data clustering methods and integrated A-FP algorithm for crop yield prediction

被引:7
|
作者
Vani, P. Suvitha [1 ]
Rathi, S. [2 ]
机构
[1] Sri Shakthi Inst Engn & Technol, Dept Comp Sci & Engn, Coimbatore 641062, Tamil Nadu, India
[2] Govt Coll Technol, Dept Comp Sci & Engn, Coimbatore 641013, Tamil Nadu, India
关键词
Big data analysis; Crop yield prediction; Data preprocessing; Sparse and densely data clustering; PLMDC;
D O I
10.1007/s10619-021-07350-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data analysis is the process of gathering, managing and analyzing a large volume of data to determine patterns and other valuable information. Agricultural data can be a significant area of big data applications. The big data analysis for agricultural data can comprise the various data from both internal systems and outside sources like weather data, soil data, and crop data. Though big data analysis has led to advances in different industries, it has not yet been extensively used in agriculture. Several machine learning techniques are developed to cluster the data for the prediction of crop yield. However, it has low accuracy and low quality of the clustering. To improve clustering accuracy with less complexity, a Proximity Likelihood Maximization Data Clustering (PLMDC) technique is developed for both sparse and densely distributed agricultural big data to enhance the accuracy of crop yield prediction for farmers. In this process, unnecessary data is cleansed from the sparse and dense based agricultural data using a logical linear regression model. After that, the presented clustering method is executed depending on the similarity and weight-based Manhattan distance. The genetic algorithm (GA) is applied with a good fitness function to select the features from the clustered data. Finally, the decision support system is computed by the A-FP growth algorithm to predict the crop yields according to their selected features such as weather features and crop features. The results of the proposed PLMDC technique are better in case of clustering accuracy of both spare and densely distributed data with minimum time and space complexity. Based on the results observations, the PLMDC technique is more efficient than the existing methods.
引用
收藏
页码:117 / 131
页数:15
相关论文
共 50 条
  • [31] Cotton Crop Yield Prediction using Data Mining Technique
    Patel, Amiksha Ashok
    Kathiriya, Dhaval
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 725 - 731
  • [32] A Study on Various Data Mining Techniques for Crop Yield Prediction
    Gandge, Yogesh
    Sandhya
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 420 - 423
  • [33] Data analytics in ensemble learning for effective crop yield prediction
    Tripathi, Deeksha
    Biswas, Saroj K.
    Baruah, Barnana
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (03):
  • [34] Bitter Melon Crop Yield Prediction using Machine Learning Algorithm
    Villanueva, Marizel B.
    Salenga, Ma. Louella M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (03) : 1 - 6
  • [35] Application of improved ant algorithm in spatial data clustering
    Hazhong Qian
    Fang Wu
    Lei Ge
    Bo Chen
    Huilian Wang
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 798 - 801
  • [36] An Improved Mean Imputation Clustering Algorithm for Incomplete Data
    Shi, Hong
    Wang, Pingxin
    Yang, Xin
    Yu, Hualong
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 3537 - 3550
  • [37] An Improved Mean Imputation Clustering Algorithm for Incomplete Data
    Hong Shi
    Pingxin Wang
    Xin Yang
    Hualong Yu
    Neural Processing Letters, 2022, 54 : 3537 - 3550
  • [38] Data Clustering Using Improved Fire Fly Algorithm
    Sadeghzadeh, Mehdi
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 801 - 809
  • [39] Spatial data clustering using an improved Evolutionary Algorithm
    Tang, Yiping
    Long, Wenxing
    Hu, Chuan
    SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546
  • [40] Improved Black Hole optimization algorithm for data clustering
    Deeb, Hasan
    Sarangi, Archana
    Mishra, Debahuti
    Sarangi, Shubhendu Kumar
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5020 - 5029