The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models

被引:499
|
作者
Syfert, Mindy M. [1 ,2 ]
Smith, Matthew J. [2 ]
Coomes, David A. [1 ]
机构
[1] Univ Cambridge, Dept Plant Sci, Forest Ecol & Conservat Grp, Cambridge, England
[2] Microsoft Res, Sci Computat Lab, Computat Ecol & Environm Sci Grp, Cambridge, England
来源
PLOS ONE | 2013年 / 8卷 / 02期
关键词
PSEUDO-ABSENCES; TREE FERNS; CLIMATE; CONSERVATION; SIZE; EVAPOTRANSPIRATION; INFORMATION; PATTERNS; IMPACTS; ERRORS;
D O I
10.1371/journal.pone.0055158
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Species distribution models (SDMs) trained on presence-only data are frequently used in ecological research and conservation planning. However, users of SDM software are faced with a variety of options, and it is not always obvious how selecting one option over another will affect model performance. Working with MaxEnt software and with tree fern presence data from New Zealand, we assessed whether (a) choosing to correct for geographical sampling bias and (b) using complex environmental response curves have strong effects on goodness of fit. SDMs were trained on tree fern data, obtained from an online biodiversity data portal, with two sources that differed in size and geographical sampling bias: a small, widely-distributed set of herbarium specimens and a large, spatially clustered set of ecological survey records. We attempted to correct for geographical sampling bias by incorporating sampling bias grids in the SDMs, created from all georeferenced vascular plants in the datasets, and explored model complexity issues by fitting a wide variety of environmental response curves (known as "feature types" in MaxEnt). In each case, goodness of fit was assessed by comparing predicted range maps with tree fern presences and absences using an independent national dataset to validate the SDMs. We found that correcting for geographical sampling bias led to major improvements in goodness of fit, but did not entirely resolve the problem: predictions made with clustered ecological data were inferior to those made with the herbarium dataset, even after sampling bias correction. We also found that the choice of feature type had negligible effects on predictive performance, indicating that simple feature types may be sufficient once sampling bias is accounted for. Our study emphasizes the importance of reducing geographical sampling bias, where possible, in datasets used to train SDMs, and the effectiveness and essentialness of sampling bias correction within MaxEnt.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Choice of predictors and complexity for ecosystem distribution models: effects on performance and transferability
    Naas, Adam Eindride
    Keetz, Lasse Torben
    Halvorsen, Rune
    Horvath, Peter
    Mienna, Ida Marielle
    Simensen, Trond
    Bryn, Anders
    ECOGRAPHY, 2024, 2024 (08)
  • [32] Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria
    Warren, Dan L.
    Seifert, Stephanie N.
    ECOLOGICAL APPLICATIONS, 2011, 21 (02) : 335 - 342
  • [33] Accounting for preferential sampling in species distribution models
    Grazia Pennino, Maria
    Paradinas, Iosu
    Illian, Janine B.
    Munoz, Facundo
    Maria Bellido, Jose
    Lopez-Quilez, Antonio
    Conesa, David
    ECOLOGY AND EVOLUTION, 2019, 9 (01): : 653 - 663
  • [34] The effects of model and data complexity on predictions from species distributions models
    Garcia-Callejas, David
    Araujo, Miguel B.
    ECOLOGICAL MODELLING, 2016, 326 : 4 - 12
  • [35] Testing the predictive performance of distribution models
    Bahn, Volker
    McGill, Brian J.
    OIKOS, 2013, 122 (03) : 321 - 331
  • [36] SPATIAL SCALE EFFECTS OF SAMPLING ON THE INTERPOLATION OF SPECIES DISTRIBUTION MODELS IN THE SOUTHWESTERN AMAZON
    de Melo Figueiredo, Symone Maria
    Venticinque, Eduardo Martins
    Figueiredo, Evandro Orfano
    REVISTA ARVORE, 2016, 40 (04): : 617 - 625
  • [37] Optimising occurrence data in species distribution models: sample size, positional uncertainty, and sampling bias matter
    Moudry, Vitezslav
    Bazzichetto, Manuele
    Remelgado, Ruben
    Devillers, Rodolphe
    Lenoir, Jonathan
    Mateo, Ruben G.
    Lembrechts, Jonas J.
    Sillero, Neftali
    Lecours, Vincent
    Cord, Anna F.
    Bartak, Vojtech
    Balej, Petr
    Rocchini, Duccio
    Torresani, Michele
    Arenas-Castro, Salvador
    Man, Matej
    Prajzlerova, Dominika
    Gdulova, Katerina
    Prosek, Jiri
    Marchetto, Elisa
    Zarzo-Arias, Alejandra
    Gabor, Lukas
    Leroy, Francois
    Martini, Matilde
    Malavasi, Marco
    Cazzolla Gatti, Roberto
    Wild, Jan
    Simova, Petra
    ECOGRAPHY, 2024, 2024 (12)
  • [38] Effective strategies for correcting spatial sampling bias in species distribution models without independent test data
    Baker, David J.
    Maclean, Ilya M. D.
    Gaston, Kevin J.
    DIVERSITY AND DISTRIBUTIONS, 2024, 30 (03)
  • [39] Equivalence of MAXENT and Poisson Point Process Models for Species Distribution Modeling in Ecology
    Renner, Ian W.
    Warton, David I.
    BIOMETRICS, 2013, 69 (01) : 274 - 281
  • [40] Predicting the Distribution of the Invasive Species Leptocybe invasa: Combining MaxEnt and Geodetector Models
    Zhang, Hua
    Song, Jinyue
    Zhao, Haoxiang
    Li, Ming
    Han, Wuhong
    INSECTS, 2021, 12 (02) : 1 - 18