Generalized Linear Latent Variable Models for Multivariate Count and Biomass Data in Ecology

Jenni Niku*, David I. Warton, Francis K.C. Hui, Sara Taskinen

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    53 Citations (Scopus)

    Abstract

    In this paper we consider generalized linear latent variable models that can handle overdispersed counts and continuous but non-negative data. Such data are common in ecological studies when modelling multivariate abundances or biomass. By extending the standard generalized linear modelling framework to include latent variables, we can account for any covariation between species not accounted for by the predictors, notably species interactions and correlations driven by missing covariates. We show how estimation and inference for the considered models can be performed efficiently using the Laplace approximation method and use simulations to study the finite-sample properties of the resulting estimates. In the overdispersed count data case, the Laplace-approximated estimates perform similarly to the estimates based on variational approximation method, which is another method that provides a closed form approximation of the likelihood. In the biomass data case, we show that ignoring the correlation between taxa affects the regression estimates unfavourably. To illustrate how our methods can be used in unconstrained ordination and in making inference on environmental variables, we apply them to two ecological datasets: abundances of bacterial species in three arctic locations in Europe and abundances of coral reef species in Indonesia. Supplementary materials accompanying this paper appear on-line.

    Original languageEnglish
    Pages (from-to)498-522
    Number of pages25
    JournalJournal of Agricultural, Biological, and Environmental Statistics
    Volume22
    Issue number4
    DOIs
    Publication statusPublished - 1 Dec 2017

    Fingerprint

    Dive into the research topics of 'Generalized Linear Latent Variable Models for Multivariate Count and Biomass Data in Ecology'. Together they form a unique fingerprint.

    Cite this