ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Predicting gene ontology annotations of orphan GWAS genes using protein-protein interactions

Kuppuswamy, Usha and Ananthasubramanian, Seshan and Wang, Yanli and Balakrishnan, Narayanaswamy and Ganapathiraju, Madhavi K (2014) Predicting gene ontology annotations of orphan GWAS genes using protein-protein interactions. In: ALGORITHMS FOR MOLECULAR BIOLOGY, 9 .

alg_mol_bio_9_2014.pdf - Published Version

Download (4MB) | Preview
Official URL: http://dx.doi.org/10.1186/1748-7188-9-10


Background: The number of genome-wide association studies (GWAS) has increased rapidly in the past couple of years, resulting in the identification of genes associated with different diseases. The next step in translating these findings into biomedically useful information is to find out the mechanism of the action of these genes. However, GWAS studies often implicate genes whose functions are currently unknown; for example, MYEOV, ANKLE1, TMEM45B and ORAOV1 are found to be associated with breast cancer, but their molecular function is unknown. Results: We carried out Bayesian inference of Gene Ontology (GO) term annotations of genes by employing the directed acyclic graph structure of GO and the network of protein-protein interactions (PPIs). The approach is designed based on the fact that two proteins that interact biophysically would be in physical proximity of each other, would possess complementary molecular function, and play role in related biological processes. Predicted GO terms were ranked according to their relative association scores and the approach was evaluated quantitatively by plotting the precision versus recall values and F-scores (the harmonic mean of precision and recall) versus varying thresholds. Precisions of similar to 58% and similar to 40% for localization and functions respectively of proteins were determined at a threshold of similar to 30 (top 30 GO terms in the ranked list). Comparison with function prediction based on semantic similarity among nodes in an ontology and incorporation of those similarities in a k nearest neighbor classifier confirmed that our results compared favorably. Conclusions: This approach was applied to predict the cellular component and molecular function GO terms of all human proteins that have interacting partners possessing at least one known GO annotation. The list of predictions is available at http://severus.dbmi.pitt.edu/engo/GOPRED.html. We present the algorithm, evaluations and the results of the computational predictions, especially for genes identified in GWAS studies to be associated with diseases, which are of translational interest.

Item Type: Journal Article
Additional Information: Copyright for this article belongs to the BIOMED CENTRAL LTD, ENGLAND
Keywords: GWAS; GO annotation; Prediction; Function; Protein-protein interactions; Precision-recall curves
Department/Centre: Division of Interdisciplinary Sciences > Supercomputer Education & Research Centre
Date Deposited: 13 Jun 2014 06:12
Last Modified: 13 Jun 2014 06:12
URI: http://eprints.iisc.ac.in/id/eprint/49128

Actions (login required)

View Item View Item