Revised paper, 5/28/2010
Supplementary Data
The following sequences were used in benchmarking tests for the paper,
"On Lattice Protein Structure Prediction Revisited", submitted by
Ivan Dotu, Manuel Cebrian, Pascal Van Hentenryck, Peter Clote to
IEEE Transactions on Computational Biology and Bioinformatics.
HP-sequences used in benchmarking tests
Harvard Instances
-
HPHHPPHHHHPHHHPPHHPPHPHHHPHPHHPPHHPPPHPPPPPPPPHH
-
HHHHPHHPHHHHHPPHPPHHPPHPPPPPPHPPHPPPHPPHHPPHHHPH
-
PHPHHPHHHHHHPPHPHPPHPHHPHPHPPPHPPHHPPHHPPHPHPPHP
-
PHPHHPPHPHHHPPHHPHHPPPHHHHHPPHPHHPHPHPPPPHPPHPHP
-
PPHPPPHPHHHHPPHHHHPHHPHHHPPHPHPHPPHPPPPPPHHPHHPH
-
HHHPPPHHPHPHHPHHPHHPHPPPPPPPHPHPPHPPPHPPHHHHHHPH
-
PHPPPPHPHHHPHPHHHHPHHPHHPPPHPHPPPHHHPPHHPPHHPPPH
-
PHHPHHHPHHHHPPHHHPPPPPPHPHHPPHHPHPPPHHPHPHPHHPPP
-
PHPHPPPPHPHPHPPHPHHHHHHPPHHHPHPPHPHHPPHPHHHPPPPH
-
PHHPPPPPPHHPPPHHHPHPPHPHHPPHPPHPPHHPPHHHHHHHPPHH
Instances S
-
HHHHPHHHHHHPPHHPHHHHHHHHPHHPHHHHHHHHHHPPHHPPPPPHHPHHHHHHHHPHPPHHPPPHHHHHHHHPHHHHHHPPHHHHHHHPHPPHHHHHHHHHPPHHPPPHHHHHHHPHHPHHHHHHHPPHHHH
-
HHPPHPHHHHHHHHHHPHPPPPHHHPPPHHHHHPPHHHHHPPHHHHPPHHHHPPHHHHHHPHHHHPPPHHPPPHHHHHHHHPHPPHHHPPPHHHHHPPHHHHHHHPPPHHPPHHHHHPPPHHHHHHHHHPHPPHHHHHHHPPPHHHPPHHP
-
HHHPPPHHPHHPPPPPHHHHHHHHPHPPHHPHHPHHHHHPPPHHHHHHHHHPPHPHPPHPHPPHHHPHPPHPHPPPHHHHHHPHHHHPPPHHHPPPPHHPPPHHHPPHHHHPHHHHHPPHHHHHHPPPHHHHHHPPPHPPHHHHPHHHHHHHPPHHPPHHH
-
HHPPHPHHHHHHHPPHPHPPHPHPPPPHHHPPPHHPHPHHPPHHHHHPPHHHHPPHHHHPPHHHHHHPHHHHPPPHHPPPHHHHHHHHPHPPHHHPPPHHHHHPPHHHHHHPHPPPHHPPHHHPHHPPPHPHHHHHHHPHPPHPPHHHHHHPHPPPHHHPPHHP
Instances R
-
PPPHPHHPHHPPPHPHPPPPHPHHPPHPHHHHHPPHHPPHHHHHHPPHPPHHPPHPHPHHHHHPHHPHHHPPPHHHPHHPPHPHPPHPPPHPPHPPHPPHHHPHHHPHPPHPHHPHHHHPHPHHHPHHHPPPPPPHHHHHHPPPPPPPPHHHPPHPHPPPHPHPHPHHPPHHPPPPHHHHHHPPPHHPPPPPHPPPHHPP
-
HPHHPPHPPPPPHHPHPHPHHPPHPPPPHHHHHHPPPHPPHHHPPHPPPPHHPPHHHPHPHHHPPHPHHPPHPHHPPPPHHPPHPPHHHHPPPPPHHHPPPPHPPPPPPHPPHHPHHHHPHHHHHHHHPPHHPPPHPHHHPHHHHHPHHPHHHPHPHHPPPPHPHHPHHHPHPPPPHPPPPPPHPHHHHHPHHPPPHPPH
-
HPHHHPHHPHPHPPPHHHHHPHPHPHHHHPPPHHPPPPPPHHPPPPHPHHHPPPPHPPPHHPHHPPPHPPHPPPHHHHPHHPHPPPPHHPPPHHPPHPPPHPPHHHPHHHPHPPHPHHHHPPHHPPPPHHHPHHPPHPPHHHHPPHPHPPHPHPPPPPHPHPHHHHHHHPHPHHHHHHPHHPPPPHPPPPHPPPHHHPHH
Instances F90
-
PPHHHPPPHHPPPPHHPHHHHHHPHPHPHHPHHHHHPHHHPHPHHHHPHHPPPPHHHPHPHPPHHHPHHPHPHPPHHHPPPPHHPPHPPP
-
PHHPPHPHHPHHHPHHHPPHHHHHHPPHPHPPPPHHHPHPPHHHHPHHHHPHHHPHHPPPPPHHPPPPHPHPHPHPHHPPHHHPPPHHHP
-
HPHPHHHPHHHHPHHHPPPHPPPHPPPPHHHPPHPPPPHHHPPPPPPPPHPHHPHHHHPHHHPHPHHPPHHHHHPHHPPHHPHHHHHHPH
-
PHHHPPHPPHPHPPPPHPPPHPHPPHPHHPHPPPHHHPHHHPPHHHPPHPPPPHPHHHPPHHPPHHHPPHHHHHHPHHHHHHHPHHHHPH
-
PPPHPHHHHHHHPPPHPPHHHHHPHHPPHHPPHHHHPHPHPHHPPHHPPPPHPPPHHHPHPHHHHHHHPHHPHPPHHPPPHHHPHPPHPP
Instances F160
-
HHPPHHHHHPHHHPPPHHHPPHHHPHPPHHHHHPPPHHHPPPHPHHPPPPPHHPPHHPHHPHPHHPPPPPHHHPPPPHPHHHPPHPPPHHHPHHHHPPHHPHPHHHHPHHHHPPHHPHHPHHPHHHPHPPHPHHPHPHHPHHHPHHPPHPPPHPPPPPPPHHHPHHHHHPHHHHHPPHPP
-
PHHPHPPPHPPHHPHHHPHPHHPHHHPHHHPPPHHPPHPHPHHPHHHHPPHHPHPHHHHHPHHPPPPHPHPHPPHHHHPHHHHPHHHHHPPHPHHHPPPHPHPPHHPPPHHPHPHPPPPPHPHHPHHHPHPPPPHHPHHHHHPPPHHHHHHHHPHHPPPPPHPPPHPPHPPPHHPHHHHH
-
HHHPHPPHHPPPHPPPHPHPHPPHHHHPPHHHHHHPHPHHPPPPPHPPHHPHHHHHHHHHHHPPHPPHPPHHHHHHHHPPPPHPPHHHHHPPHHHPPHHPPHHHHHPPPHHHHHHPHHHPPPHHPPHPPPHPPHPPPHPPPPHHHPPHHPHPPHHHPHHPPHHPHHPHPHPHPHPHPHHP
Predictions for Harvard instances
Will's H-core threading
-
harvard1.pdb
-
harvard2.pdb
-
harvard3.pdb
-
harvard4.pdb
-
harvard5.pdb
-
harvard6.pdb
-
harvard7.pdb
-
harvard8.pdb
-
harvard9.pdb
-
harvard10.pdb
Tabu plus LNS
-
harvard1.pdb
-
harvard2.pdb
-
harvard3.pdb
-
harvard4.pdb
-
harvard5.pdb
-
harvard6.pdb
-
harvard7.pdb
-
harvard8.pdb
-
harvard9.pdb
-
harvard10.pdb
Miscellaneous
Benchmarks for Will's H core threading program
Small examples for use of Will's code:
HPstruct -help
HPstruct -seq=HPPPHPHHHPPHHHHPHPHHPHPH -lat=FCC -maxSol=1 -dbPath=/cluster/data/CPSP_CoreDB
HPstruct -seq=HPPPHPHHHPPHHHHPHPHHPHPH -lat=FCC -allBest -dbPath=/cluster/data/CPSP_CoreDB
HPconvert -help
HPconvert -m=a2p -str=BDRUBLRDRDBLFLFLLURUBDBLBRFRBDFLBLRDFRRDFUFLRDBLBLFUBLLUFDRUFLFDBRFRFULDBURUFRBRBDFDLUFDBDBLFL -lat=FCC
Exectubles,libraries and include files are in
/usr/local/bin
/usr/local/lib
/usr/local/include
Randomized Data
These sequences were created by using the Altschul-Erikson
dinucleotide shuffle algorithm to create sequences having the
same diresidues (not simply the same expected diresidue
frequency, which latter can easily be constructed by a first-order
Markov chain).
Programs to randomize HP-sequences using Altschul-Erikson
dinucleotide shuffle
-
analyze.py: miscellaneous
program to massage and analyze output of cluster runs of Will's program.
-
README
-
dataRandomized.txt
-
run.py: Program to launch torque cluster
computation with bound of 30 minutes of Will's program.