TY - JOUR
T1 - Network aggregation improves gene function prediction of grapevine gene co-expression networks
AU - Wong, Darren C.J.
N1 - Publisher Copyright:
© 2020, Springer Nature B.V.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - Key message: Aggregation across multiple networks highlights robust co-expression interactions and improves the functional connectivity of grapevine gene co-expression networks. Abstract: In recent years, the rapid accumulation of transcriptome datasets from diverse experimental conditions has enabled the widespread use of gene co-expression network (GCN) analysis in plants. In grapevine, GCN analysis has shown great promise for gene function prediction, however, measurable progress is currently lacking. Using accumulated microarray datasets from the grapevine whole-genome array (33 experiments, 1359 samples), we explored how meta-analysis through aggregation influences the functional connectivity (performance) of derived networks using guilt-by-association neighbor voting. Two annotation schemes, i.e. MapMan BIN and Pfam, at two sparsity thresholds, i.e. top 100 (stringent) and 300 (relaxed) ranked genes were evaluated. We observed that aggregating across multiple networks improves performance dramatically, with the aggregate outperforming the majority of functional terms across individual networks. Network sparsity and size (i.e. the number of samples and aggregates) were key factors influencing performance while the choice of annotation scheme had little. Systematic comparison with various state-of-the-art microarray and RNA-seq networks was also performed, however, none outperformed the aggregate microarray network despite having good predictive performance. Repeating these series of tests using a functional enrichment-based performance metric also showed remarkably consistent findings with guilt-by-association neighbor voting. To demonstrate its functionality, we explore the function and transcriptional regulation of grapevine EXPANSIN genes. We envisage that network aggregation will offer new and unique opportunities for gene function prediction in future grapevine functional genomics studies. To this end, we make the aggregate networks and associated metadata publicly available at VTC-Agg (https://sites.google.com/view/vtc-agg).
AB - Key message: Aggregation across multiple networks highlights robust co-expression interactions and improves the functional connectivity of grapevine gene co-expression networks. Abstract: In recent years, the rapid accumulation of transcriptome datasets from diverse experimental conditions has enabled the widespread use of gene co-expression network (GCN) analysis in plants. In grapevine, GCN analysis has shown great promise for gene function prediction, however, measurable progress is currently lacking. Using accumulated microarray datasets from the grapevine whole-genome array (33 experiments, 1359 samples), we explored how meta-analysis through aggregation influences the functional connectivity (performance) of derived networks using guilt-by-association neighbor voting. Two annotation schemes, i.e. MapMan BIN and Pfam, at two sparsity thresholds, i.e. top 100 (stringent) and 300 (relaxed) ranked genes were evaluated. We observed that aggregating across multiple networks improves performance dramatically, with the aggregate outperforming the majority of functional terms across individual networks. Network sparsity and size (i.e. the number of samples and aggregates) were key factors influencing performance while the choice of annotation scheme had little. Systematic comparison with various state-of-the-art microarray and RNA-seq networks was also performed, however, none outperformed the aggregate microarray network despite having good predictive performance. Repeating these series of tests using a functional enrichment-based performance metric also showed remarkably consistent findings with guilt-by-association neighbor voting. To demonstrate its functionality, we explore the function and transcriptional regulation of grapevine EXPANSIN genes. We envisage that network aggregation will offer new and unique opportunities for gene function prediction in future grapevine functional genomics studies. To this end, we make the aggregate networks and associated metadata publicly available at VTC-Agg (https://sites.google.com/view/vtc-agg).
KW - Co-expression
KW - EXPANSIN
KW - Meta-analysis
KW - Network aggregation
KW - Transcriptome
KW - Vitis vinifera
UR - http://www.scopus.com/inward/record.url?scp=85083057140&partnerID=8YFLogxK
U2 - 10.1007/s11103-020-01001-2
DO - 10.1007/s11103-020-01001-2
M3 - Article
SN - 0167-4412
VL - 103
SP - 425
EP - 441
JO - Plant Molecular Biology
JF - Plant Molecular Biology
IS - 4-5
ER -