TY - JOUR
T1 - Inferring differential leukocyte activity from antibody microarrays using a latent variable model.
AU - Ho, Joshua W.K.
AU - Koundinya, Rajeev
AU - Caetano, Tibério S.
AU - dos Remedios, Cristobal G.
AU - Charleston, Michael A.
PY - 2008
Y1 - 2008
N2 - Recent development of cluster of differentiation (CD) antibody arrays has enabled expression levels of many leukocyte surface CD antigens to be monitored simultaneously. Such membrane-proteome surveys have provided a powerful means to detect changes in leukocyte activity in various human diseases, such as cancer and cardiovascular diseases. The challenge is to devise a computational method to infer differential leukocyte activity among multiple biological states based on antigen expression profiles. Standard DNA microarray analysis methods cannot accurately infer differential leukocyte activity because they often fail to take the cell-to-antigen relationships into account. Here we present a novel latent variable model (LVM) approach to tackle this problem. The idea is to model each cell type as a latent variable, and represent the class-to-cell and cell-to-antigen relationships as a LVM. Once the parameters of the LVM are learned from the data, differentially active leukocytes can be easily identified from the model. We describe the model formulation and assumptions which lead to an efficient expectation-maximization algorithm. Our LVM method was applied to re-analyze two cardiovascular disease datasets. We show that our results match existing biological knowledge better than other methods such as gene set enrichment analysis. Furthermore, we discuss how our approach can be extended to become a general framework for gene set analysis for DNA microarrays.
AB - Recent development of cluster of differentiation (CD) antibody arrays has enabled expression levels of many leukocyte surface CD antigens to be monitored simultaneously. Such membrane-proteome surveys have provided a powerful means to detect changes in leukocyte activity in various human diseases, such as cancer and cardiovascular diseases. The challenge is to devise a computational method to infer differential leukocyte activity among multiple biological states based on antigen expression profiles. Standard DNA microarray analysis methods cannot accurately infer differential leukocyte activity because they often fail to take the cell-to-antigen relationships into account. Here we present a novel latent variable model (LVM) approach to tackle this problem. The idea is to model each cell type as a latent variable, and represent the class-to-cell and cell-to-antigen relationships as a LVM. Once the parameters of the LVM are learned from the data, differentially active leukocytes can be easily identified from the model. We describe the model formulation and assumptions which lead to an efficient expectation-maximization algorithm. Our LVM method was applied to re-analyze two cardiovascular disease datasets. We show that our results match existing biological knowledge better than other methods such as gene set enrichment analysis. Furthermore, we discuss how our approach can be extended to become a general framework for gene set analysis for DNA microarrays.
UR - http://www.scopus.com/inward/record.url?scp=67651237076&partnerID=8YFLogxK
U2 - 10.1142/9781848163324_0011
DO - 10.1142/9781848163324_0011
M3 - Article
C2 - 19425153
AN - SCOPUS:67651237076
SN - 0919-9454
VL - 21
SP - 126
EP - 137
JO - Genome informatics. International Conference on Genome Informatics
JF - Genome informatics. International Conference on Genome Informatics
ER -