These notes are taken from the presentation slides for CSE 648, November 14, 2000, by Vinhthuy Phan (phan@cs.sunysb.edu).




Threading





NCBI CASP3 Assessment


Counts of "correct models" and "overpredictions":


  Easy Medium Hard  
Team T54 T79 T46 T53 T63 T71 T81 T83 T43 T44 T59 T67 T80 sum
003     +                     1/0
005 + - - + +   + + - - - -   5/6
009 - - + + - - - - - - - - - 2/11
017     + -       +       - - 2/3
019   - - - - - + +     - - - 2/8
028 - + - - -   - + -   - - - 2/9
035 - + -   - - - - -   -     1/8
061     + - -   - - - - - - - 1/9
066 -   -   + -     -   -     1/5
074     +       -   - - - - - 1/6
076 - + - -     -   - - -   - 1/8
084 - +     -       - - -     1/5
156   +     -       - -   -   1/4
162       +   -         -     1/2
166 +   -     - +   -   - -   2/5
176   - - + -   - + -         2/5
212 + + + +   + +             6/0
217   + -     - +   -   - -   2/5
273   -       - + -     - - - 1/6

 


"Team Bryant"





Measurements

Percent of the structure predicted: 43.97.
Largest set of residues from the Model that can fit under DISTANCE cutoff :
        13   percent of residues   ( 15 residues )   fits under   1A   with local RMSD   0.50
        28   percent of residues   ( 33 residues )   fits under   3A   with local RMSD   1.88
        37   percent of residues   ( 43 residues )   fits under   6A   with local RMSD   3.04

Detailed analysis of the model.
Each residue in the prediction is assigned to the largest set of residues containing that residue and deviating from the target by no more than a specified CA distance cutoff.
For a given residue the distance cutoff under which the set of 5, 10 and 50 percent of the modeled structure can be fitted is used for graphical presentation (three blue lines).

Target (top row): helices - PURPLE , strands - GREEN
Model (second row): helices - PURPLE , strands - GREEN
Target-Model GDT(5,10,50): BLUE
Bad model or residues not predicted: BLACK
Percent of the structure predicted: 43.97




  • Longest Continuous Segments (LCS)

    • For a given residue the size of the longest segment that can be fitted under a specified CA RMSD cutoff of 1A, 2A, and 5A.

    • LCS measures the longest segment in the model that is close to a segment in the template. "Closeness" means RMSD of 1A, 2A, or 5A.

Target (top row): helices - PURPLE , strands - GREEN
Model (second row): helices - PURPLE , strands - GREEN
LCS(1,2,5): BLUE
Bad model or residues not predicted: BLACK
Percent of the structure predicted: 43.97.
Longest Continuous Segments under RMSD cutoff :   1A   2A   5A   :   14   14   18   residues



The Gibbs-Sampling Threader



Getting target/probe sequences:

     gicA <- READ.FASTA("test.tfa")




Defining a Threading Core





Calculating Alignments and Examining Results




Setting Up a Data Search