The Effects of Sampling Bias and Model Complexity on the Predictive Performance of MaxEnt Species Distribution Models

被引:499
|
作者
Syfert, Mindy M. [1 ,2 ]
Smith, Matthew J. [2 ]
Coomes, David A. [1 ]
机构
[1] Univ Cambridge, Dept Plant Sci, Forest Ecol & Conservat Grp, Cambridge, England
[2] Microsoft Res, Sci Computat Lab, Computat Ecol & Environm Sci Grp, Cambridge, England
来源
PLOS ONE | 2013年 / 8卷 / 02期
关键词
PSEUDO-ABSENCES; TREE FERNS; CLIMATE; CONSERVATION; SIZE; EVAPOTRANSPIRATION; INFORMATION; PATTERNS; IMPACTS; ERRORS;
D O I
10.1371/journal.pone.0055158
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Species distribution models (SDMs) trained on presence-only data are frequently used in ecological research and conservation planning. However, users of SDM software are faced with a variety of options, and it is not always obvious how selecting one option over another will affect model performance. Working with MaxEnt software and with tree fern presence data from New Zealand, we assessed whether (a) choosing to correct for geographical sampling bias and (b) using complex environmental response curves have strong effects on goodness of fit. SDMs were trained on tree fern data, obtained from an online biodiversity data portal, with two sources that differed in size and geographical sampling bias: a small, widely-distributed set of herbarium specimens and a large, spatially clustered set of ecological survey records. We attempted to correct for geographical sampling bias by incorporating sampling bias grids in the SDMs, created from all georeferenced vascular plants in the datasets, and explored model complexity issues by fitting a wide variety of environmental response curves (known as "feature types" in MaxEnt). In each case, goodness of fit was assessed by comparing predicted range maps with tree fern presences and absences using an independent national dataset to validate the SDMs. We found that correcting for geographical sampling bias led to major improvements in goodness of fit, but did not entirely resolve the problem: predictions made with clustered ecological data were inferior to those made with the herbarium dataset, even after sampling bias correction. We also found that the choice of feature type had negligible effects on predictive performance, indicating that simple feature types may be sufficient once sampling bias is accounted for. Our study emphasizes the importance of reducing geographical sampling bias, where possible, in datasets used to train SDMs, and the effectiveness and essentialness of sampling bias correction within MaxEnt.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] A quantitative synthesis of the importance of variables used in MaxEnt species distribution models
    Bradie, Johanna
    Leung, Brian
    JOURNAL OF BIOGEOGRAPHY, 2017, 44 (06) : 1344 - 1361
  • [22] A comprehensive evaluation of predictive performance of 33 species distribution models at species and community levels
    Norberg, Anna
    Abrego, Nerea
    Blanchet, F. Guillaume
    Adler, Frederick R.
    Anderson, Barbara J.
    Anttila, Jani
    Araujo, Miguel B.
    Dallas, Tad
    Dunson, David
    Elith, Jane
    Foster, Scott D.
    Fox, Richard
    Franklin, Janet
    Godsoe, William
    Guisan, Antoine
    O'Hara, Bob
    Hill, Nicole A.
    Holt, Robert D.
    Hui, Francis K. C.
    Husby, Magne
    Kalas, John Atle
    Lehikoinen, Aleksi
    Luoto, Miska
    Mod, Heidi K.
    Newell, Graeme
    Renner, Ian
    Roslin, Tomas
    Soininen, Janne
    Thuiller, Wilfried
    Vanhatalo, Jarno
    Warton, David
    White, Matt
    Zimmermann, Niklaus E.
    Gravel, Dominique
    Ovaskainen, Otso
    ECOLOGICAL MONOGRAPHS, 2019, 89 (03)
  • [23] Novel methods to correct for observer and sampling bias in presence-only species distribution models
    Chauvier, Yohann
    Zimmermann, Niklaus E.
    Poggiato, Giovanni
    Bystrova, Daria
    Brun, Philipp
    Thuiller, Wilfried
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2021, 30 (11): : 2312 - 2325
  • [24] Spatial sampling bias in the Neotoma paleoecological archives affects species paleo-distribution models
    Inman, Richard
    Franklin, Janet
    Esque, Todd
    Nussear, Kenneth
    QUATERNARY SCIENCE REVIEWS, 2018, 198 : 115 - 125
  • [25] Sampling bias correction in species distribution models by quasi-linear Poisson point process
    Komori, Osamu
    Eguchi, Shinto
    Saigusa, Yusuke
    Kusumoto, Buntarou
    Kubota, Yasuhiro
    ECOLOGICAL INFORMATICS, 2020, 55
  • [26] Effects of sample size on the performance of species distribution models
    Wisz, M. S.
    Hijmans, R. J.
    Li, J.
    Peterson, A. T.
    Graham, C. H.
    Guisan, A.
    DIVERSITY AND DISTRIBUTIONS, 2008, 14 (05) : 763 - 773
  • [27] Controlling the effects of sampling bias in biodiversity models
    Oliveira, Ubirajara
    Soares-Filho, Britaldo
    Nunes, Felipe
    JOURNAL OF BIOGEOGRAPHY, 2024, 51 (09) : 1755 - 1766
  • [28] Performance tradeoffs in target-group bias correction for species distribution models
    Ranc, Nathan
    Santini, Luca
    Rondinini, Carlo
    Boitani, Luigi
    Poitevin, Francoise
    Angerbjorn, Anders
    Maiorano, Luigi
    ECOGRAPHY, 2017, 40 (09) : 1076 - 1087
  • [29] Incorporating model complexity and spatial sampling bias into ecological niche models of climate change risks faced by 90 California vertebrate species of concern
    Warren, Dan L.
    Wright, Amber N.
    Seifert, Stephanie N.
    Shaffer, H. Bradley
    DIVERSITY AND DISTRIBUTIONS, 2014, 20 (03) : 334 - 343
  • [30] Defining Predictive Probability Functions for Species Sampling Models
    Lee, Jaeyong
    Quintana, Fernando A.
    Mueller, Peter
    Trippa, Lorenzo
    STATISTICAL SCIENCE, 2013, 28 (02) : 209 - 222