back to Table of Contents
14. Scanning mutagenesis
14.1 Overview
EGAD can be used for scanning mutagenesis on a template structure. The
EGAD energy function is able to predict the relative affinity of
>1500 mutants from over 30 proteins to within 1kcal/mol (Pokala and Handel 2005). A
brief
description of the thermodynamic cycle used for mutant stability
prediction is shown in Figure 14.1.1. Figure 14.1.2 and Figure 14.1.3
show some of these data.
The scanning mutagenesis functionality is used with the same format as
designs, except that "VARIABLE_POSITIONS"
is replaced with "SCANNING_POSITIONS" in
the inputfile. For example (examples/scanning_mutagenesis/scan_mut/gb1.scan_mut.input):
START
TARGET ../../template_structures/gb1.pdb
FORCEFIELD ../../energy_function/forcefield
LOOKUP_TABLE_DIRECTORY ../../lookup_tables/gb1.lookup_table
OUTPUT_PREFIX gb1.scan_mut
SCANNING_POSITIONS
43 W, E, K.
44 A, E.
45 Q, R, S.
46 all
47 polar
END
Instead of running a design, however, the residues specified for
each position are substituted, one by one, their rotamers optimized,
and their energies calculated. An important caveat (compared to the
other jobtypes) is that no other residues will be allowed to move. The
user must specify the positions whose rotamers are allowed to move by
listing them in the SCANNING_POSITIONS
section without an amino acid type; these will be allowed to move, with
the template amino-acid identity.
The resulting output_prefix.scanning_mutagenesis
file lists the following:
seq_pos wt_res mutant_res ∆∆Gfolded
∆clashes ∆hbond_unsat ∆∆Gtotal coresurfint
For example, examples/scanning_mutagenesis/scan_mut/gb1.scan_mut.scanning_mutagenesis
file has:
seq_pos wt_res mutant_res
ddG_folded clashes unsat_hbond ddG_total coresurfint
...
43 W W 0.000000 0 0 0.000000 s
44 T T 0.000000 0 0 0.000000 s
45 Y Y 0.000000 0 0 0.000000 s
46 D A 0.304587 0 0 1.522936 s
46 D D 0.000000 0 0 0.000000 s
46 D E -0.571090 0 0 -2.855454 s
46 D F -0.508344 0 1 12.172872 s
...
Asp (D) 46
(s = surface position) -> Ala (A) has a ∆∆Gfolded = 0.304587 kcal/mol and a ∆∆Gtotal
= 1.522936 (meanings
of energy values discussed in output pdb file section). It does not
form or relieve a clash, nor does it have any affect on the number of
unsatisfied hydrogen-bonding atoms.
As discussed previously, predicted ∆∆Gs are unreliable if
the difference if |∆clashes| > 2 (Pokala
and Handel 2005).
Bu default, the files for the individual point mutants are deleted
upon completion. To keep these files, include:
CLEAN_UP_FLAG 0 # default 1 removes intermediate
files
in the inputfile.
By default, the coordinates of the point mutants are not saved to the
pdb files, since the master only requires the energies and rotamers
listed in the header of the output pdb file; writing the coordinates
can take a significant amount of time. If the coordinates are to be
kept, include:
OUTPUT_COORD_FLAG 1 # for scanning mutagenesis, default 0 does not save
coordinates
in the inputfile
Note that since the non-structured reference states for cysteine
have not been calculated, the energies for positions that contain
cysteine in the template sequence are unreliable.
Scanning mutagenesis is readily parallelized:
EGAD.exe gb1.scan_mut.input parallel (parallel
options here or in inputfile)
will parallelize the lookup table generation, as well as the
rotamer optimization of the point-mutants.
14.2 alanine_scan and saturation_mutagenesis
The following is a shortcut for performing an alanine scan at all
movable positions:
START
...
END
SCANNING_POSITIONS
alanine_scan
END
Saturation mutagenesis (allow every amino acid at all positions) may be
defined in a similar manner:
START
...
END
SCANNING_POSITIONS
saturation_mutagenesis
END
For examples of these jobs, see examples/ala_scan/gb1.ala_scan.input,
examples/sat_mut/gb1.sat_mut.input, and
their outputs for alanine scanning and saturation mutagenesis,
respectively. For both of these jobs, for each mutant, rotamers at all
other positions are allowed to move.
back to Table of Contents