TY - JOUR
T1 - A comprehensive evaluation of predictive performance of 33 species distribution models at species and community levels
AU - Norberg, Anna
AU - Abrego, Nerea
AU - Blanchet, F. Guillaume
AU - Adler, Frederick R.
AU - Anderson, Barbara J.
AU - Anttila, Jani
AU - Araújo, Miguel B.
AU - Dallas, Tad
AU - Dunson, David
AU - Elith, Jane
AU - Foster, Scott D.
AU - Fox, Richard
AU - Franklin, Janet
AU - Godsoe, William
AU - Guisan, Antoine
AU - O'Hara, Bob
AU - Hill, Nicole A.
AU - Holt, Robert D.
AU - Hui, Francis K.C.
AU - Husby, Magne
AU - Kålås, John Atle
AU - Lehikoinen, Aleksi
AU - Luoto, Miska
AU - Mod, Heidi K.
AU - Newell, Graeme
AU - Renner, Ian
AU - Roslin, Tomas
AU - Soininen, Janne
AU - Thuiller, Wilfried
AU - Vanhatalo, Jarno
AU - Warton, David
AU - White, Matt
AU - Zimmermann, Niklaus E.
AU - Gravel, Dominique
AU - Ovaskainen, Otso
N1 - Publisher Copyright:
© 2019 The Authors. Ecological Monographs published by Wiley Periodicals, Inc. on behalf of Ecological Society of America
PY - 2019/8
Y1 - 2019/8
N2 - A large array of species distribution model (SDM) approaches has been developed for explaining and predicting the occurrences of individual species or species assemblages. Given the wealth of existing models, it is unclear which models perform best for interpolation or extrapolation of existing data sets, particularly when one is concerned with species assemblages. We compared the predictive performance of 33 variants of 15 widely applied and recently emerged SDMs in the context of multispecies data, including both joint SDMs that model multiple species together, and stacked SDMs that model each species individually combining the predictions afterward. We offer a comprehensive evaluation of these SDM approaches by examining their performance in predicting withheld empirical validation data of different sizes representing five different taxonomic groups, and for prediction tasks related to both interpolation and extrapolation. We measure predictive performance by 12 measures of accuracy, discrimination power, calibration, and precision of predictions, for the biological levels of species occurrence, species richness, and community composition. Our results show large variation among the models in their predictive performance, especially for communities comprising many species that are rare. The results do not reveal any major trade-offs among measures of model performance; the same models performed generally well in terms of accuracy, discrimination, and calibration, and for the biological levels of individual species, species richness, and community composition. In contrast, the models that gave the most precise predictions were not well calibrated, suggesting that poorly performing models can make overconfident predictions. However, none of the models performed well for all prediction tasks. As a general strategy, we therefore propose that researchers fit a small set of models showing complementary performance, and then apply a cross-validation procedure involving separate data to establish which of these models performs best for the goal of the study.
AB - A large array of species distribution model (SDM) approaches has been developed for explaining and predicting the occurrences of individual species or species assemblages. Given the wealth of existing models, it is unclear which models perform best for interpolation or extrapolation of existing data sets, particularly when one is concerned with species assemblages. We compared the predictive performance of 33 variants of 15 widely applied and recently emerged SDMs in the context of multispecies data, including both joint SDMs that model multiple species together, and stacked SDMs that model each species individually combining the predictions afterward. We offer a comprehensive evaluation of these SDM approaches by examining their performance in predicting withheld empirical validation data of different sizes representing five different taxonomic groups, and for prediction tasks related to both interpolation and extrapolation. We measure predictive performance by 12 measures of accuracy, discrimination power, calibration, and precision of predictions, for the biological levels of species occurrence, species richness, and community composition. Our results show large variation among the models in their predictive performance, especially for communities comprising many species that are rare. The results do not reveal any major trade-offs among measures of model performance; the same models performed generally well in terms of accuracy, discrimination, and calibration, and for the biological levels of individual species, species richness, and community composition. In contrast, the models that gave the most precise predictions were not well calibrated, suggesting that poorly performing models can make overconfident predictions. However, none of the models performed well for all prediction tasks. As a general strategy, we therefore propose that researchers fit a small set of models showing complementary performance, and then apply a cross-validation procedure involving separate data to establish which of these models performs best for the goal of the study.
KW - community assembly
KW - community modeling
KW - environmental filtering
KW - joint species distribution model
KW - model performance
KW - prediction
KW - predictive power
KW - species interactions
KW - stacked species distribution model
UR - http://www.scopus.com/inward/record.url?scp=85067400878&partnerID=8YFLogxK
U2 - 10.1002/ecm.1370
DO - 10.1002/ecm.1370
M3 - Article
SN - 0012-9615
VL - 89
JO - Ecological Monographs
JF - Ecological Monographs
IS - 3
M1 - e01370
ER -