The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models

被引:499
|
作者
Syfert, Mindy M. [1 ,2 ]
Smith, Matthew J. [2 ]
Coomes, David A. [1 ]
机构
[1] Univ Cambridge, Dept Plant Sci, Forest Ecol & Conservat Grp, Cambridge, England
[2] Microsoft Res, Sci Computat Lab, Computat Ecol & Environm Sci Grp, Cambridge, England
来源
PLOS ONE | 2013年 / 8卷 / 02期
关键词
PSEUDO-ABSENCES; TREE FERNS; CLIMATE; CONSERVATION; SIZE; EVAPOTRANSPIRATION; INFORMATION; PATTERNS; IMPACTS; ERRORS;
D O I
10.1371/journal.pone.0055158
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Species distribution models (SDMs) trained on presence-only data are frequently used in ecological research and conservation planning. However, users of SDM software are faced with a variety of options, and it is not always obvious how selecting one option over another will affect model performance. Working with MaxEnt software and with tree fern presence data from New Zealand, we assessed whether (a) choosing to correct for geographical sampling bias and (b) using complex environmental response curves have strong effects on goodness of fit. SDMs were trained on tree fern data, obtained from an online biodiversity data portal, with two sources that differed in size and geographical sampling bias: a small, widely-distributed set of herbarium specimens and a large, spatially clustered set of ecological survey records. We attempted to correct for geographical sampling bias by incorporating sampling bias grids in the SDMs, created from all georeferenced vascular plants in the datasets, and explored model complexity issues by fitting a wide variety of environmental response curves (known as "feature types" in MaxEnt). In each case, goodness of fit was assessed by comparing predicted range maps with tree fern presences and absences using an independent national dataset to validate the SDMs. We found that correcting for geographical sampling bias led to major improvements in goodness of fit, but did not entirely resolve the problem: predictions made with clustered ecological data were inferior to those made with the herbarium dataset, even after sampling bias correction. We also found that the choice of feature type had negligible effects on predictive performance, indicating that simple feature types may be sufficient once sampling bias is accounted for. Our study emphasizes the importance of reducing geographical sampling bias, where possible, in datasets used to train SDMs, and the effectiveness and essentialness of sampling bias correction within MaxEnt.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Effects of simulated observation errors on the performance of species distribution models
    Fernandes, Rui F.
    Scherrer, Daniel
    Guisan, Antoine
    DIVERSITY AND DISTRIBUTIONS, 2019, 25 (03) : 400 - 413
  • [42] Effects of species and habitat positional errors on the performance and interpretation of species distribution models
    Osborne, Patrick E.
    Leitao, Pedro J.
    DIVERSITY AND DISTRIBUTIONS, 2009, 15 (04) : 671 - 681
  • [43] Habitat effects and sampling bias on Phanerozoic reef distribution
    Kiessling, W
    FACIES, 2005, 51 (1-4) : 24 - 32
  • [44] Bias in butterfly distribution maps: the effects of sampling effort
    Dennis, Roger L. H.
    Sparks, Tim H.
    Hardy, Peter B.
    JOURNAL OF INSECT CONSERVATION, 1999, 3 (01) : 33 - 42
  • [45] Bias in Butterfly Distribution Maps: The Effects of Sampling Effort
    Roger L.H. Dennis
    Tim H. Sparks
    Peter B. Hardy
    Journal of Insect Conservation, 1999, 3 : 33 - 42
  • [46] Habitat effects and sampling bias on Phanerozoic reef distribution
    Wolfgang Kiessling
    Facies, 2005, 51 : 24 - 32
  • [47] Evaluating the predictive performance of stacked species distribution models applied to plant species selection in ecological restoration
    Gaston, Aitor
    Garcia-Vinas, Juan I.
    ECOLOGICAL MODELLING, 2013, 263 : 103 - 108
  • [48] To mix or not to mix: comparing the predictive performance of mixture models vs. separate species distribution models
    Hui, Francis K. C.
    Warton, David I.
    Foster, Scott D.
    Dunstan, Piers K.
    ECOLOGY, 2013, 94 (09) : 1913 - 1919
  • [49] Testing whether ensemble modelling is advantageous for maximising predictive performance of species distribution models
    Hao, Tianxiao
    Elith, Jane
    Lahoz-Monfort, Jose J.
    Guillera-Arroita, Gurutzeta
    ECOGRAPHY, 2020, 43 (04) : 549 - 558
  • [50] Assessing the effect of prevalence on the predictive performance of species distribution models using simulated data
    Santika, Truly
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2011, 20 (01): : 181 - 192