ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Use of a structural alphabet to find compatible folds for amino acid sequences

Mahajan, Swapnil and de Brevern, Alexandre G and Sanejouand, Yves-Henri and Srinivasan, Narayanaswamy and Offmann, Bernard (2015) Use of a structural alphabet to find compatible folds for amino acid sequences. In: PROTEIN SCIENCE, 24 (1). pp. 145-153.

[img] PDF
pro_sci_24-1_145_2015.pdf - Published Version
Restricted to Registered users only

Download (659kB) | Request a copy
Official URL: http://dx.doi.org/ 10.1002/pro.2581


The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as Protein Blocks (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa.

Item Type: Journal Article
Additional Information: Copy right for this article belongs to the WILEY-BLACKWELL, 111 RIVER ST, HOBOKEN 07030-5774, NJ USA
Keywords: protein structures; structural alphabet; fold recognition; protein domains; threading; sequence-structure relationship; structural annotation; protein blocks
Department/Centre: Division of Biological Sciences > Molecular Biophysics Unit
Depositing User: Id for Latest eprints
Date Deposited: 06 Feb 2015 14:47
Last Modified: 06 Feb 2015 14:47
URI: http://eprints.iisc.ac.in/id/eprint/50776

Actions (login required)

View Item View Item