TY - JOUR
T1 - Protein function prediction via graph kernels
AU - Borgwardt, Karsten
AU - Ong, Cheng Soon
AU - Schoenauer, Stefan
AU - Vishwanathan, S
AU - Smola, Alexander
AU - Kriegel, Hans-Peter
PY - 2005
Y1 - 2005
N2 - Motivation: Computational approaches to protein function prediction infer protein function by finding proteins with similar sequence, structure, surface clefts, chemical properties, amino acid motifs, interaction partners or phylogenetic profiles. We present a new approach that combines sequential, structural and chemical information into one graph model of proteins. We predict functional class membership of enzymes and non-enzymes using graph kernels and support vector machine classification on these protein graphs. Results: Our graph model, derivable from protein sequence and structure only, is competitive with vector models that require additional protein information, such as the size of surface pockets. If we include this extra information into our graph model, our classifier yields significantly higher accuracy levels than the vector models. Hyperkernels allow us to select and to optimally combine the most relevant node attributes in our protein graphs. We have laid the foundation for a protein function prediction system that integrates protein information from various sources efficiently and effectively.
AB - Motivation: Computational approaches to protein function prediction infer protein function by finding proteins with similar sequence, structure, surface clefts, chemical properties, amino acid motifs, interaction partners or phylogenetic profiles. We present a new approach that combines sequential, structural and chemical information into one graph model of proteins. We predict functional class membership of enzymes and non-enzymes using graph kernels and support vector machine classification on these protein graphs. Results: Our graph model, derivable from protein sequence and structure only, is competitive with vector models that require additional protein information, such as the size of surface pockets. If we include this extra information into our graph model, our classifier yields significantly higher accuracy levels than the vector models. Hyperkernels allow us to select and to optimally combine the most relevant node attributes in our protein graphs. We have laid the foundation for a protein function prediction system that integrates protein information from various sources efficiently and effectively.
UR - https://www.scopus.com/pages/publications/29144446929
U2 - 10.1093/bioinformatics/bti1007
DO - 10.1093/bioinformatics/bti1007
M3 - Article
VL - 21
SP - 147
EP - 156
JO - Bioinformatics
JF - Bioinformatics
IS - Supplement 1
ER -