TY - JOUR
T1 - gllvm
T2 - Fast analysis of multivariate abundance data with generalized linear latent variable models in r
AU - Niku, Jenni
AU - Hui, Francis K.C.
AU - Taskinen, Sara
AU - Warton, David I.
N1 - Publisher Copyright:
© 2019 The Authors. Methods in Ecology and Evolution © 2019 British Ecological Society
PY - 2019/12/1
Y1 - 2019/12/1
N2 - There has been rapid development in tools for multivariate analysis based on fully specified statistical models or ‘joint models’. One approach attracting a lot of attention is generalized linear latent variable models (GLLVMs). However, software for fitting these models is typically slow and not practical for large datasets. The r package gllvm offers relatively fast methods to fit GLLVMs via maximum likelihood, along with tools for model checking, visualization and inference. The main advantage of the package over other implementations is speed, for example, being two orders of magnitude faster, and capable of handling thousands of response variables. These advances come from using variational approximations to simplify the likelihood expression to be maximized, automatic differentiation software for model-fitting (via the TMB package) and careful choice of initial values for parameters. Examples are used to illustrate the main features and functionality of the package, such as constrained or unconstrained ordination, including functional traits in ‘fourth corner’ models, and (if the number of environmental coefficients is not large) make inferences about environmental associations.
AB - There has been rapid development in tools for multivariate analysis based on fully specified statistical models or ‘joint models’. One approach attracting a lot of attention is generalized linear latent variable models (GLLVMs). However, software for fitting these models is typically slow and not practical for large datasets. The r package gllvm offers relatively fast methods to fit GLLVMs via maximum likelihood, along with tools for model checking, visualization and inference. The main advantage of the package over other implementations is speed, for example, being two orders of magnitude faster, and capable of handling thousands of response variables. These advances come from using variational approximations to simplify the likelihood expression to be maximized, automatic differentiation software for model-fitting (via the TMB package) and careful choice of initial values for parameters. Examples are used to illustrate the main features and functionality of the package, such as constrained or unconstrained ordination, including functional traits in ‘fourth corner’ models, and (if the number of environmental coefficients is not large) make inferences about environmental associations.
KW - abundance data
KW - generalized linear latent variable models
KW - high-dimensional data
KW - joint modelling
KW - maximum likelihood
KW - multivariate analysis
KW - ordination
KW - species interactions
UR - http://www.scopus.com/inward/record.url?scp=85074457404&partnerID=8YFLogxK
U2 - 10.1111/2041-210X.13303
DO - 10.1111/2041-210X.13303
M3 - Article
SN - 2041-210X
VL - 10
SP - 2173
EP - 2182
JO - Methods in Ecology and Evolution
JF - Methods in Ecology and Evolution
IS - 12
ER -