Protein stability

Reading material:   The DeepView manual (also available as a PDF file here) and the SPDV FAQ.

Goals: to gain a better understanding of how proteins are held together, and why some proteins are more stable than others.


Enzymes are used in a variety of industrial processes, many of them under extreme conditions such as high temperature or high pH. Common household items may also contain stabilized enzymes or ones engineered for special purposes. Examples of these are proteases, lipases and cellulases in washing powder, and amylase in flour.

Structural biology and protein-engineering experiments have provided an understanding of the basic protein stabilizing forces, which include hydrophobic and electrostatic interactions and entropic effects. Furthermore, studies of proteins from organisms living in extreme surroundings, such as high temperature, high salt etc., have given insights into why some proteins are more stable than others.

In this exercise, you will analyze mesophilic (PDB: 1st3) and thermophilic (PDB: 1thm) subtilisin. You will analyze the enzyme using different methods and also suggest and perform (in silico) mutations that you propose would stabilize the mesophilic form. (Remember, we wouldn’t want to change anything that affects the overall structure or activity of the protein, so knowing something about that is important, too! A brief overview can be found at: http://en.wikipedia.org/wiki/Subtilisin.

You need to be familiar with some of the basic commands, and know how to do mutations, in Swiss-PdbViewer (DeepView).


1. Conformational entropy

The entropy of the system is very important for the stability of a protein. By decreasing the conformational entropy of the main chain in the unfolded state, increased stability can be achieved.


- Glycine and proline

Two amino acids have significantly different conformational properties than the others: Gly and Pro. By changing a Gly residue into an Ala (or any other amino acid) increased stability might be achieved. The same effect could be obtained by replacing a non-proline residue with a Pro. However, the mutations should not destroy the ability to form secondary structure elements, which, for example, means that Pro should preferably be introduced in turns or loops, i.e. places where the structure is more likely to accommodate this sort of change.

Q1: You should now analyze mesophilic subtilisin (1st3) and see if you can find any suitable position to remove a Gly residue or introduce a Pro residue.

Don’t forget to investigate the surroundings of the suggested mutations – will the introduced amino acid residues be happy here?


- Disulfide bridges

Disulfide linkages are also an effective way to reduce the conformational entropy of the unfolded protein. When designing such linkages you need to consider the geometry of the residues involved. Ideally, the C-alpha/C-alpha distance should be 6-8 Å, the side chains should point towards each other (or be able to do so, in a favorable side-chain conformation), and the distance between the side-chain gamma atoms (if they exist) should be less than 6 Å. You might find this site helpful here: http://cptweb.cpt.wayne.edu/DbD2/

Q2: Find a suitable place to create a disulfide linkage in mesophilic subtilisin (1st3).

 

- B-factors

Clues about which are the most flexible regions of the protein (and any other uncertainties) can be obtained from the temperature factors (the B-factors) for the atoms involved. The B-factors are given in the coordinate file (i.e. the pdb file). You can take a look at the actual coordinate file by clicking on the small document symbol on the left-hand side of the toolbar. The B-factors are in the very right-hand column (the last number). A B-factor of 30 Å2, for example, means a root-mean-square error in the atomic coordinate of 0.6 Å, while one of 80 Å2 means a coordinate error of 1 Å (more than half the length of a C-C bond). In the program you can color the protein according to the B-factors. Lower B-factors are blue and higher are green, and higher still are red. Click on Color and select “B-factor”. 

Q3: Investigate the B-factors of subtilisin (1st3) and see if these can lead you to residues that could be good targets for stabilization. Suggest mutations that could improve the stability.


2. Electrostatic forces

- Stabilizing helices

Alpha helices are very common structure elements of proteins, and helix ”caps” are important determinants of protein stability. In addition, the introduction of a negatively-charged side chain at the N-terminal end of a helix will create favorable interactions with the partial positive charge that exists there because of the aligned carbonyl groups of the helix’s main chain (see lecture notes on protein structure). Similar effects have been reported for the removal of negatively-charged amino acid residues near the C-terminal end of alpha helices.

Q4: Can you find any suitable positions for introducing a change that might stabilize an alpha-helix in mesophilic subtilisin (1st3)?

- Improving electrostatic contacts

An optimized network of electrostatic contacts can contribute substantially to the stability of a protein.

Q5: Investigate the subtilisin (1st3) structure and suggest two mutations that could improve the stability by introducing new electrostatic interactions. This could be in terms of improved hydrogen bonding networks or new salt bridges.

 

3. Holes and hydrophobic contacts

The core of globular proteins is mostly hydrophobic in nature. If there are any buried charges, they have to be counteracted by other factors such as opposite charges (salt bridges). In general, thermostable enzymes have fewer internal cavities than their mesophilic counterparts. Removal of such cavities can stabilize protein structures.

We will now take a look at the hydrophobic core of subtilisin (1st3) and analyze the packing of the core. Then we will try to find ways of improving that packing by filling in cavities and pockets.

We will use the web server CASTp found at: http://sts.bioe.uic.edu/castp/calculation.php This program identifies protein pockets and cavities based on a specified pdb file. The calculation uses a probe with a radius of 1.4 Å, which is the size of a water molecule, to search for holes in the structure. If this one isn't working, remember, backups are http://voronoi.hanyang.ac.kr/betacavityweb/ and DoGSiteScorer.

Enter 1st3 in the window marked “Type the 4 letter pdb ID:”, and click the “Submit” button. Jmol is preselected for visualization. The calculation might take a few moments. Be patient! The pockets are shown in different colours. On the left-hand side, you will find a list of the pockets. Here you get information on their volume and area. Select a pocket in the list to identify its position and to see which residues are lining it.

Q6: Select one of the pockets, and suggest a suitable mutation that could stabilize the protein by filling the pocket.


4. Comparisons with a thermophilic protein

Comparison of sequences as well as structures of thermostable proteins can give us clues about changes that could increase the stability. Therefore, load the 1thm file and superimpose it on the mesophilic 1st3 structure.

In addition to comparing the structures, you should also investigate the sequence alignments. Use the sequence alignment tools in the Fit menu of Swiss-PdbViewer and open the Alignment window to see the result. Put the cursor on the letters in the sequence align window; this will allow you to see the position of the amino acid residues in the structure.

  Q7: Compare the two proteins (amino acid sequences, 3D structures, B-factors) and see if you can find any differences that might be related to the higher stability of the thermophilic variant.



If you still have time and want to learn more about protein stability, here are a couple of other things you could investigate:

Extra exercise A. Conformational preferences

- Analysis of the protein structure: Ramachandran plot

The Ramachandran plot presents the backbone torsion (phi and psi) angles for residues in a protein structure (see lecture notes), and can help identify regions that have strained conformations. These places might be targets for genetic engineering aimed at relaxing those regions and so stabilizing the protein.

It is very easy to make use of the Swiss-PdbViewer program for the analysis. First, select "all atoms”, then choose "Ramachandran" under the "Wind" menu to calculate the plot. This plot shows how well the phi and psi angles in the structure conform to the most common (favorable) pairs of values. Right-handed alpha helices and beta sheets, for example, have very different phi/psi combinations, and so appear in two separate regions of a Ramachandran plot. Put the pointer on the symbols in the diagram and see where residues are located. Residues falling outside of the "allowed" regions are called outliers. These could actually be wrong in the structure (we’ll talk about that later). However, in most structures, there are residues that are “true” outliers, in the sense that their peptide bond angles are strained in the structure. So, one reasonable idea for a stabilizing mutation could be to relax strained parts of the structure. This often means that the residue is mutated to glycine, which can have almost any values for phi and psi. (Be careful, though, as some Ramachandran outliers are needed for reasons of structure or function.)

Or, you can make plots in PyMol with the Dynoplot plugin (https://pymolwiki.org/index.php/DynoPlot). First, go to the plugin manager and click on the "installed plugins" to check whether you already have it installed. If you don't , click on "install new plugin" and paste https://raw.githubusercontent.com/Pymol-Scripts/Pymol-script-repo/master/plugins/dynoplot.py in the URL box and click fetch. Once dynoplot is installed, you can write 'rama all' in the command line to visualize the plot, or select residues to view. If you encounter problems, and have Python installed in your system, PyRAMA is another (although less interactive) option.


Otherrwise, the sever at https://zlab.umassmed.edu/bu/rama/ coiuld be useful. Turn on Gly and Pro, and labels for outliers.


- Analysis of the protein sequence/structure: secondary structure propensities

Investigate the 1st3 structure to find out if the secondary structure propensities might be improved. Secondary structure propensities are found in this file.


Extra exercise B. Literature search

Because subtilisin is a protease frequently used in detergents, there are many structures of this enzyme in the PDB. How many entries in the database can you find? What kind of mutants are there, and with which purposes were they created? Do you find any examples of mutations made to stabilize the protein?



Last modified: S. Mowbray, 6 September, 2022.