TY - JOUR
T1 - Compositional data in neuroscience
T2 - If you've got it, log it!
AU - Smith, Paul F.
AU - Renner, Ross M.
AU - Haslett, Stephen J.
N1 - Publisher Copyright:
© 2016 Elsevier B.V.
PY - 2016/9/15
Y1 - 2016/9/15
N2 - Background Compositional data sum to a constant value, for example, 100%. In neuroscience, such data are common, for example, when estimating the percentage of time spent for a behavioural response in a limited choice situation or a neurochemical within brain tissue. Compositional data have a distinct structure which complicates analysis and makes inappropriate standard statistical analyses such as general linear model analyses and principal components or factor analysis (whether Q-mode or R-mode), as a result of the correlation of the components, the dependence of the pairwise covariance on which other components are included in the analysis, and the bounded nature of the data. New method This problem has been recognised in disciplines such as geology and zoology for decades, where log ratio methods have been successfully applied. The isometric log ratio (ilr) method has some particular advantages. Comparison with existing method Classical statistical methods such as t-tests, ANOVAs, and multivariate analyses are invalid when applied to compositional data. Conclusions The compositional data analysis methods developed by statisticians and used by geologists and zoologists should be considered for compositional data analysis in neuroscience.
AB - Background Compositional data sum to a constant value, for example, 100%. In neuroscience, such data are common, for example, when estimating the percentage of time spent for a behavioural response in a limited choice situation or a neurochemical within brain tissue. Compositional data have a distinct structure which complicates analysis and makes inappropriate standard statistical analyses such as general linear model analyses and principal components or factor analysis (whether Q-mode or R-mode), as a result of the correlation of the components, the dependence of the pairwise covariance on which other components are included in the analysis, and the bounded nature of the data. New method This problem has been recognised in disciplines such as geology and zoology for decades, where log ratio methods have been successfully applied. The isometric log ratio (ilr) method has some particular advantages. Comparison with existing method Classical statistical methods such as t-tests, ANOVAs, and multivariate analyses are invalid when applied to compositional data. Conclusions The compositional data analysis methods developed by statisticians and used by geologists and zoologists should be considered for compositional data analysis in neuroscience.
KW - Additive log ratio
KW - Centred log ratio
KW - Compositional data
KW - Isometric log ratio
KW - Logit transformation
KW - Statistical dependence
UR - http://www.scopus.com/inward/record.url?scp=84979502157&partnerID=8YFLogxK
U2 - 10.1016/j.jneumeth.2016.07.008
DO - 10.1016/j.jneumeth.2016.07.008
M3 - Review article
SN - 0165-0270
VL - 271
SP - 154
EP - 159
JO - Journal of Neuroscience Methods
JF - Journal of Neuroscience Methods
ER -