![]() |
|
||||||||||||||||||||||||||||||||||||
| |
|||||||||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||||||||
|
Submit this job using the PBS 'qsub' command. Example:
| qsub -l nodes=1 rosettaRun |
See here for more information about PBS.
Many of the Rosetta++ methods can generate a large number of decoys or intermediate files. In these cases, the method can be broken into a large number of independent jobs using swarm. The script /usr/local/bin/rosetta_swarm_setup has been created to automate this breakup and submission to swarm. The script accepts Rosetta commands and creates a series of swarm commands based on the series code and nstruct values. For more information, type 'rosetta_swarm_setup' at the Biowulf prompt.
Because of the huge number of files created by Rosetta, it is very easy to reach your quota limit before the run completes (typically greater than 10,000 decoys). In these cases, it is recommended to use the -scorefilter, -smart_scorefilter and -output_pdb_gz options. See here in the options section for more information about these and other options.
RosettaAbinitio is used to generate a set of model structures from an amino acid sequence. This is done in two steps:
A better way of generating decoys is with rosetta_swarm_setup -- HIGHLY RECOMMENDED!
Files you will need in your running directory:
|
|
Generate 1000 decoy structures from a fasta file:
Do a cluster analysis on concatenated silentfiles (combined.out):
Extract a structure file for decoy number 50 from the silentfile combined.out:
By default, PDB files are written as C-alpha models. To generate full-atom models from C-alpha models, use RosettaDesign:
|
RosettaDesign can be run in several different modes. They include:
Example runs can be found in /usr/local/rosetta/RosettaDesign.
Files you will need in your running directory:
|
Redesign using a resfile, output 3 structures each with a name that begins with test1:
Just repack a protein with extra chi1 rotamers:
Move the backbone and design. In this case the 3 standard arguments to rosetta must be used. these are:
Also,
To design with a flexible backbone the starting structure must have ideal bond lengths and angles. Use the following command to idealize a structure:
|
An excellent tutorial on using RosettaDock is available here.
Most of the scripts for docking are available in /usr/local/rosetta/rosetta_scripts/docking. A better way of generating decoys is with rosetta_swarm_setup -- HIGHLY RECOMMENDED!
Files you will need in your running directory:
|
|
Prepack structure prior to full atom run:
Generate 1000 decoys using full atom docking run:
Cluster the resulting decoys by RMSD:
|
For now, see the RosettaCommons page here.
Rosetta++ builds and refines protein structures on the basis of fragment libraries. The target sequence is broken into 3- and 9-amino acid segments. A library of fragments that represent the range of accessible local structures for all short segments of the target sequence are selected from a a database of known protein structures. Segments are matched with structural fragments on the basis of sequence profiles using PSIBLAST against a non-redundant Fasta sequence database (NCBI nr) and secondary structure predictions (PSIPRED, SAM-T02, JUFO, and PHD). Compact structures are then assembled by randomly combining these fragments, using a Monte Carlo simulated annealing search.
In the simplest case, generating a fragment library on Biowulf requires only a target sequence, and is executed by typing the following command:
| rosetta_make_fragments abcde.fasta |
The FASTA file name must either have a prefix of five characters plus the 'fasta' suffix, or the -id option must be given with a four character sequence/one character chain identifier (e.g., 1fld_).
The target sequence will be subjected to two rounds of PSIBLAST, and a secondary structure prediction will be calculated using PSIPRED and JUFO. For better profiling, secondary structure predictions from SAM-T02, and PHD can be included.
SAM predictions can be done in conjunction with PHD by using the script /usr/local/rosetta/rosetta_fragments/SAM-PHD.pl as a batch or interactive qsub job:
|
JUFO and PHD predictions must be done on their respective online servers and the results given in seperate files with the following naming format:
| abcde.rdb - SAM-T02 | abcde.jufo_ss - JUFO | abcde.phd - PHD |
PLEASE NOTE: The secondary structure prediction files must be written in a strict format. The formats of the secondary structure predictions are as follows:
abcde.phd (PHD prediction, FORTRAN format = (9x,a3,2x,a60) or (9x,a3,2x,i60)):
|
abcde.rdb (SAM-T02, DSSP 3-value prediction in rdb format, FORTRAN format = (i8,1x,a1,3(1x,f8.3))):
|
In addition to functionalities associated with particular modules, the Rosetta binary can be used to 'score' structure according to the Rosetta energy function using the '-score' command line option. A typical use to score a single structure would use the command line:
| rosetta -score -s [pdb_file] -scorefile [score_file] |
Where [pdb_file] is the structure to be scored and [score_file] is an output file where scoring terms will be written. To score a list of structures, use
| rosetta -score -l [list_file] -scorefile [score_file] |
where [list_file] contains a list of pdb files, one per line.
The '-score' mode can also be used to add sidechains to a structure by specifying that a structure should be output:
| rosetta -score -s [pdb_file] -scorefile [score_file] -nstruct 1 -fa_output |
where "-nstruct 1" indicates that a structure should be output and "-fa_output" indicates that the structure should have fullatom sidechain coordinates. If you want to keep any available sidechain coordinates on the input pdb file, add "-fa_input" to the command line.
cat_silent.pl: concatenate silentfiles
changeChain.pl: change the chain id of a PDB
compose_score_silent.py: generate a silentfile from a set of PDBs
createLoop.pl: create a dummy structure from a sequence of amino acids
createTemplate.pl: create a homology model template from a FASTA file and a homologous structure
make_coords_file.py: generate .coords format from native pdb (for input to cluster_info_silent.out, see below)
molecule.exe: generate JUFO file and rename ligand atoms (with addhydrogens.inp, mdl2rosetta.inp, and pdb2mdl.inp)
pdb_fasta.pl: generate a FASTA from a PDB
pdb2tag.pl: rename a PDB or set of PDBs to their tag names (as shown in the silentfile)
reconstruct_PDB_by_index: generate PDBs from abinitio-format silentfile
renumberPDBandchains.pl: renumber the residues of a PDB sequentially, starting at 1
renumberPDBatoms.pl: renumber the atoms of a PDB sequentially, starting at 1
silentDock2pdb.pl: generate PDBs from a docking silentfile
TMalign: aligns structures based on CA-CA distances
VMD: X-Windows molecular graphics viewer
getColumn.pl: display silentfile and scorefile columns
gnuplot: graphically display data
histogram.pl: generate a quick histogram from STDIN data
cluster.pl: automatic clustering of an abinitio-format silentfile
cluster_info_silent.out: fully configurable silentfile clustering
cluster_pdbs.pl: cluster a set of PDBs
cluster_variation.pl: find per-residue variation within a cluster
make_color_trees.py: make a dendrogram of the clusters
make_new_plot.py: make a contacts plot