Jones home>

The PSIPRED Protein Structure Prediction Server

David Jones, Tim Nugent, Anna Lobley, Daniel Buchan & Kevin Bryson
Past contributors: Liam McGuffin
Bryson home>
Description

PSIPRED
Server
Help Page


PSIPRED
Server
History


PSIPRED
Fair Usage
Policy


PSIPRED
Commercial
Enquiries
The PSIPRED protein structure prediction server allows you to submit a protein sequence, perform a prediction of your choice and receive the results of the prediction via e-mail. You may select one of three prediction methods to apply to your sequence: PSIPRED - a highly accurate method for protein secondary structure prediction, MEMSAT - our widely used transmembrane topology prediction method and GenTHREADER - a sequence profile based fold recognition method. More...


CLICK HERE TO ACCESS THE SERVER

For queries regarding PSIPRED: psipred@cs.ucl.ac.uk


News:
  • May 2009: pGenTHREADER and pDomTHREADER now running (mGenTHREADER now discontinued).
  • Mar 2008: New versions of PSIPRED (2.6) and mGenTHREADER (8.1) running. Major revision of fold library (all PDB chains with < 90% sequence identity now included).
  • Feb 2008: New server hardware and (m)GenTHREADER software updates under test.
  • May 2006: New version of MEMSAT (3.0) running. Minor cleanup of (m)GenTHREADER fold library.


Please cite the following references:

The PSIPRED server
  • Bryson K, McGuffin LJ, Marsden RL, Ward JJ, Sodhi JS. & Jones DT. (2005) Protein structure prediction servers at University College London. Nucl. Acids Res. 33(Web Server issue): W36-38.
The PSIPRED secondary structure prediction method
  • Jones DT. (1999) Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292: 195-202.
GenTHREADER
  • McGuffin LJ & Jones DT. (2003) Improvement of the GenTHREADER method for genomic fold recognition. Bioinformatics, 19: 874-881.
  • Jones DT. (1999) GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J. Mol. Biol. 287: 797-815.
MEMSAT
  • Jones DT. (2007) Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics. 23: 538-544.
  • Jones DT, Taylor WR, Thornton JM. (1994) A Model Recognition Approach to the Prediction of All-Helical Membrane Protein Structure and Topology. Biochem. 33: 3038-3049.
MEMSAT-SVM
  • Nugent T, Jones DT. (2009) Transmembrane protein topology prediction using support vector machines. BMC Bioinformatics. 10: 159
pGenTHREADER / pDomTHREADER
  • Lobley A, Sadowski MI & Jones DT. (2009) pGenTHREADER and pDomTHREADER: New methods For Improved Protein Fold Recognition and Superfamily Discrimination. Bioinformatics. 25: 1761-1767.


Currently loaded data banks:

Sequences: Filtered UNIREF90 (updated weekly)
Fold library: PDB90 Weekly updates


Overview of prediction methods:

Predict Secondary Structure (PSIPRED)

PSIPRED is a  simple and accurate secondary structure prediction method, incorporating two feed-forward neural networks which perform an analysis on output obtained from PSI-BLAST (Position Specific Iterated - BLAST). Using a very stringent cross validation method to evaluate the method's performance, PSIPRED 2.6 achieves an average Q3 score of 80.7%.

Predictions produced by PSIPRED were also submitted to the CASP4 evaluation and assessed during the CASP4 meeting, which took place in December 2000 at Asilomar. PSIPRED 2.0 achieved an average Q3 score of 80.6% across all 40 submitted target domains with no obvious sequence similarity to structures present in PDB, which ranked PSIPRED top out of 20 evaluated methods (an earlier version of PSIPRED was also ranked top in CASP3 held in 1998).

It is important to realise, however, that due to the small sample sizes, the results from CASP are not statistically significant, although they do give a rough guide as to the current "state of the art". For a more reliable evaluation, the EVA web site at Columbia University provides a continuous evaluation. Also see the EVA servlet to visualize a breakdown of specific types of errors made by PSIPRED and other secondary structure prediction methods. NOTE that at the time of writing, the EVA site is no longer being updated.

Downloads:

The PSIPRED V2.6 software can be downloaded from HERE.

Please note that you should read the license terms given in the README file if you wish to incorporate PSIPRED in another program or Web server.

Older releases of PSIPRED can be downloaded here HERE.

Predict Transmembrane Topology (MEMSAT)

MEMSAT V3 is the latest version of the widely used all-helical membrane protein prediction method MEMSAT. The method was benchmarked on a test set of transmembrane proteins of known topology. From sequence data MEMSAT was estimated to have an accuracy of over 78% at predicting the structure of all-helical transmembrane proteins and the location of their constituent helical elements within a membrane.

Academic users can download MEMSAT3 code here.

Predict Transmembrane Topology with SVM (MEMSAT-SVM)

MEMSAT-SVM is a support vector machine-based (SVM) TM protein topology predictor that integrates both signal peptide and re-entrant helix prediction, benchmarked with full cross-validation on a novel data set of 131 sequences with known crystal structures. The method achieves topology prediction accuracy of 89%, while signal peptides and re-entrant helices are predicted with 93% and 44% accuracy respectively.

Users can download the MEMSATSVM source code from http://bioinf.cs.ucl.ac.uk/memsat/memsat-svm/

Fold Recognition (GenTHREADER)

GenTHREADER is a fast and relatively powerful fold recognition method, which can be applied to either whole, translated genomic sequences (proteomes) as in the case of the GTD or individual protein sequences as in the case of the PSIPRED server. It is not as sensitive at pGenTHREADER but is much faster.

Fold Recognition (pGenTHREADER)

This method is now our recommended method for fold recognition and identification of distant homologues. Essentially it is the based on the original GenTHREADER method, but makes use of profile-profile alignments and predicted secondary structure (using PSIPRED) as inputs. This increases both the sensitivity of the method and enhances the accuracy of alignments, but also makes it much slower than the normal GenTHREADER method as PSI-BLAST needs to be run on the target sequence before the search can begin.

Domain Recognition (pDomTHREADER)

pDomTHREADER is an accurate and sensitive superfamily discrimination, combining information from both sequence and structure to produce highly accurate domain alignments. The method employs the same underlying threading algorithm as pGenTHREADER, however it aligns sequences to a domain-based template library rather than a chain-based template library. The use of smaller regions of structure for templates means that different features of the alignments are required for optimal scoring. The final prediction score results from an SVM trained on a combination of 5 different feature inputs; template coverage, alignment score, template length, solvation and pairwise potentials.

Compared with other superfamily discrimination methods using Hidden Markov Models and PSI-BLAST profile alignments, we found that pDomTHREADER provided higher coverage on the CATH S35 superfamilies. Additionally, pDomTHREADER produced more accurate alignments that can be used to better predict domain boundaries. For more information regarding the method, please consult the reference above.

Please note that the pDomTHREADER method is tuned for performance in fine superfamily discrimination, for fold recognition problems or structural annotation of very distant sequences, pGenTHREADER should be used.




McGuffin LJ, Bryson K, Jones, D.T. (2000) The PSIPRED protein structure prediction server. Bioinformatics. 16: 404-405.
UCL home | UCL Bioinformatics Group | Bloomsbury Centre for Bioinformatics | Jones home | Bryson home |