Application of machine-learning algorithms to predict calving difficulty in Holstein dairy cattle

被引:1
|
作者
Avizheh, Mahdieh [1 ]
Dadpasand, Mohammad [1 ]
Dehnavi, Elena [2 ]
Keshavarzi, Hamideh [3 ]
机构
[1] Shiraz Univ, Sch Agr, Dept Anim Sci, Shiraz, Iran
[2] AGBU, Armidale, NSW 2351, Australia
[3] Commonwealth Sci & Ind Res Org CSIRO, Agr & Food, Armidale, NSW 2350, Australia
关键词
cost-sensitive technique; dairy cow; difficult calving; down-sampling; herd-cow factors; imbalanced dataset; machine-learning algorithms; predictive models; RISK-FACTORS; FRIESIAN HEIFERS; BIRTH-WEIGHT; DYSTOCIA; COWS; PREVALENCE; STILLBIRTH; INSEMINATION; ASSISTANCE; MORTALITY;
D O I
10.1071/AN22461
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Context. An ability to predict calving difficulty could help farmers make better farm-management decisions, thereby improving dairy farm profitability and welfare. Aims. This study aimed to predict calving difficulty in Iranian dairy herds using machine-learning (ML) algorithms and to evaluate sampling methods to deal with imbalanced datasets. Methods. For this purpose, the history records of cows that calved between 2011 and 2021 on two commercial dairy farms were used. Using WEKA software, four commonly used ML algorithms, namely naive Bayes, random forest, decision trees, and logistic regression, were applied to the dataset. The calving difficulty was considered as a binary trait with 0, normal or unassisted calving, and 1, difficult calving, i.e. receiving any help during parturition from farm personnel involvement to surgical intervention. The average rate of difficult calving was 18.7%, representing an imbalanced dataset. Therefore, down-sampling and cost-sensitive techniques were implemented to tackle this problem. Different models were evaluated on the basis of F-measure and the area under the curve. Key results. The results showed that sampling techniques improved the predictive model (P = 0.07, and P = 0.03, for down-sampling and cost-sensitive techniques respectively). F-measure ranged from 0.387 (decision tree) to 0.426 (logistic regression) with the balanced dataset. However, when applied to the original imbalanced dataset, naive Bayes had the best performance of up to 0.388 in terms of F-measure. Conclusions. Overall, sampling techniques improved the prediction model compared with original imbalanced dataset. Although prediction models performed worse than expected (due to an imbalanced dataset, and missing values), the implementation of ML algorithms can still lead to an effective method of predicting calving difficulty. Implications. This research indicated the capability of ML algorithms to predict the incidence of calving difficulty within a balanced dataset, but that more explanatory variables (e.g. genetic information) are required to improve the prediction based on an unbalanced original dataset.
引用
收藏
页码:1095 / 1104
页数:10
相关论文
共 50 条
  • [41] Application of Machine-Learning Models to Predict Tacrolimus Stable Dose in Renal Transplant Recipients
    Tang, Jie
    Liu, Rong
    Zhang, Yue-Li
    Liu, Mou-Ze
    Hu, Yong-Fang
    Shao, Ming-Jie
    Zhu, Li-Jun
    Xin, Hua-Wen
    Feng, Gui-Wen
    Shang, Wen-Jun
    Meng, Xiang-Guang
    Zhang, Li-Rong
    Ming, Ying-Zi
    Zhang, Wei
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [42] Application of four machine-learning methods to predict short-horizon wind energy
    Bouabdallaoui, Doha
    Haidi, Touria
    Elmariami, Faissal
    Derri, Mounir
    Mellouli, El Mehdi
    [J]. GLOBAL ENERGY INTERCONNECTION-CHINA, 2023, 6 (06): : 726 - 737
  • [43] Application of four machine-learning methods to predict short-horizon wind energy
    Bouabdallaoui D.
    Haidi T.
    Elmariami F.
    Derri M.
    Mellouli E.M.
    [J]. Global Energy Interconnection, 2023, 6 (06) : 726 - 737
  • [44] Application of Machine-Learning Models to Predict Tacrolimus Stable Dose in Renal Transplant Recipients
    Jie Tang
    Rong Liu
    Yue-Li Zhang
    Mou-Ze Liu
    Yong-Fang Hu
    Ming-Jie Shao
    Li-Jun Zhu
    Hua-Wen Xin
    Gui-Wen Feng
    Wen-Jun Shang
    Xiang-Guang Meng
    Li-Rong Zhang
    Ying-Zi Ming
    Wei Zhang
    [J]. Scientific Reports, 7
  • [45] Application of four machine-learning methods to predict short-horizon wind energy
    Doha Bouabdallaoui
    Touria Haidi
    Faissal Elmariami
    Mounir Derri
    El Mehdi Mellouli
    [J]. Global Energy Interconnection, 2023, 6 (06) : 726 - 737
  • [46] Machine-learning regression in evolutionary algorithms and image registration
    Spanakis, Constantinos
    Mathioudakis, Emmanouil
    Kampanis, Nikos
    Tsiknakis, Manolis
    Marias, Kostas
    [J]. IET IMAGE PROCESSING, 2019, 13 (05) : 843 - 849
  • [47] Predicting Perovskite Performance with Multiple Machine-Learning Algorithms
    Li, Ruoyu
    Deng, Qin
    Tian, Dong
    Zhu, Daoye
    Lin, Bin
    [J]. CRYSTALS, 2021, 11 (07)
  • [48] An integrated machine-learning model to predict nucleosome architecture
    Sala, Alba
    Labrador, Mireia
    Buitrago, Diana
    De Jorge, Pau
    Battistini, Federica
    Heath, Isabelle Brun
    Orozco, Modesto
    [J]. NUCLEIC ACIDS RESEARCH, 2024, 52 (17) : 10132 - 10143
  • [49] Machine-learning models predict produced water properties
    Procyk, Alex
    [J]. OIL & GAS JOURNAL, 2023, 121 (10) : 48 - 53
  • [50] Adapting machine-learning algorithms to design gene circuits
    Tom W. Hiscock
    [J]. BMC Bioinformatics, 20