Protein Modifications
Proteolytic Cleavage of Proteins
Glycoproteins: Synthesis and Clinical Consequences
Acylation
Methylation
Phosphorylation
Sulfation
Prenylation
Vitamin C-Dependent Modifications
Vitamin K-Dependent Modifications
Selenoproteins
Ubiquitin and Targeted Protein Degradation
Protein Modification by SUMO
Return to The Medical Biochemistry Page
Secreted and Membrane-Associated Proteins
Proteins that are membrane bound or are destined for excretion are synthesized by ribosomes associated with the membranes of the endoplasmic reticulum (ER). The ER associated with ribosomes is termed rough ER (RER). This class of proteins all contain an N-terminus termed a signal sequence or signal peptide. The signal peptide is usually 13-36 predominantly hydrophobic residues. The signal peptide is recognized by a multi-protein complex termed the signal recognition particle (SRP). This signal peptide is removed following passage through the endoplasmic reticulum membrane. The removal of the signal peptide is catalyzed by signal peptidase. Proteins that contain a signal peptide are called preproteins to distinguish them from proproteins. However, some proteins that are destined for secretion are also further proteolyzed following secretion and, therefore contain pro sequences. This class of proteins is termed preproproteins.

Mechanism of synthesis of membrane bound or secreted proteins. Ribosomes engage the ER membrane through interaction of the signal recognition particle, SRP in the ribosome with the SRP receptor in the ER membrane. As the protein is synthesized the signal sequence is passed through the ER membrane into the lumen of the ER. After sufficient synthesis the signal peptide is removed by the action of signal peptidase. Synthesis will continue and if the protein is secreted it will end up completely in the lumen of the ER. If the protein is membrane associated a stop transfer motif in the protein will stop the transfer of the protein through the ER membrane. This will become the membrane spanning domain of the protein.
back to the top
Proteolytic Cleavage
Most proteins undergo proteolytic cleavage following translation. The simplest form of this is the removal of the initiation methionine. Many proteins are synthesized as inactive precursors that are activated under proper physiological conditions by limited proteolysis. Pancreatic enzymes and enzymes involved in clotting are examples of the latter. Inactive precursor proteins that are activated by removal of polypeptides are termed proproteins.
A complex example of post-translational processing of a preproprotein is the cleavage of prepro-opiomelanocortin (POMC) synthesized in the pituitary (see the Peptide Hormones page for discussion of POMC). This preproprotein undergoes complex cleavages, the pathway of which differs depending upon the cellular location of POMC synthesis.
Another is example of a preproprotein is insulin. Since insulin is secreted from the pancreas it has a prepeptide. Following cleavage of the 24 amino acid signal peptide the protein folds into proinsulin. Proinsulin is further cleaved yielding active insulin which is composed of two peptide chains linked togehter through disulfide bonds.
Still other proteins (of the enzyme class) are synthesized as inactive precursors called zymogens. Zymogens are activated by proteolytic cleavage such as is the situation for several proteins of the blood clotting cascade.
back to the topAcylation
Many proteins are modified at their N-termini following synthesis. In most cases the initiator methionine is hydrolyzed and an acetyl group is added to the new N-terminal amino acid. Acetyl-CoA is the acetyl donor for these reactions. Some proteins have the 14 carbon myristoyl group added to their N-termini. The donor for this modification is myristoyl-CoA. This latter modification allows association of the modified protein with membranes. The catalytic subunit of cyclicAMP-dependent protein kinase (PKA) is myristoylated.
back to the topMethylation
Post-translational methylation of proteins occurs on nitrogens and oxygens. The activated methyl donor is S-adenosylmethionine (SAM). The most common methylations are on the ε-amine of lysine residues. Methylation of lysine residues in histones in DNA is an important regulator of chromatin structure and consequently of transcriptional activity. Lysine methylation was originally thought to be a permanent covalent mark, providing long-term signaling, including the histone-dependent mechanism for transcriptional memory. However, recent evidence has shown that lysine methylation, similar to other covalent modifications, can be transient and dynamically regulated by an opposing de-methylation activity. Recent findings indicate that methylation of lysine residues affects gene expression not only at the level of chromatin, but also by modifying transcription factors.
Additional nitrogen methylations are found on the imidazole ring of histidine, the guanidino moiety of arginine and the R-group amides of glutamate and aspartate. Methylation of the oxygen of the R-group carboxylates of gutamate and aspartate also takes place and forms methyl esters. Proteins can also be methylated on the thiol R-group of cysteine.
As indicated below, many proteins are modified at their C-terminus by prenylation near a cysteine residue in the consensus CAAX. Following the prenylation reaction the protein is cleaved at the peptide bond of the cysteine and the carboxylate residue is methylated by a prenylated protein methyltransferase. One such protein that undergoes this type of modification is the proto-oncogene RAS.
back to the topPhosphorylation
Post-translational phosphorylation is one of the most common protein modifications that occurs in animal cells. The vast majority of phosphorylations occur as a mechanism to regulate the biological activity of a protein and as such are transient. In other words a phosphate (or more than one in many cases) is added and later removed.
Physiologically relevant examples are the phosphorylations that occur in glycogen synthase and glycogen phosphorylase in hepatocytes in response to glucagon release from the pancreas. Phosphorylation of synthase inhibits its activity, whereas, the activity of phosphorylase is increased. These two events lead to increased hepatic glucose delivery to the blood.
The enzymes that phosphorylate proteins are termed kinases and those that remove phosphates are termed phosphatases. Protein kinases catalyze reactions of the following type:
ATP + protein <——> phosphoprotein + ADP
In animal cells serine, threonine and tyrosine are the amino acids subject to phosphorylation. The largest group of kinases are those that phsophorylate either serines or threonines and as such are termed serine/threonine kinases. The ratio of phosphorylation of the three different amino acids is approximately 1000/100/1 for serine/threonine/tyrosine.
Although the level of tyrosine phosphorylation is minor, the importance of phosphorylation of this amino acid is profound. As an example, the activity of numerous growth factor receptors is controlled by tyrosine phosphorylation.
back to the topSulfation
Sulfate modification of proteins occurs at tyrosine residues. As many as 1% of all tyrosine residues present in the eukaryotic proteome are modified by sulfate addition making this the most common tyrosine modification. Tyrosine sulfation is accomplished via the activity of tyrosylprotein sulfotransferases (TPST) which are membrane-associated enzymes of the trans-Golgi network. There are two known TPSTs identified as TPST-1 and TPST-2. The universal sulfate donor for these TPST enzymes is 3'-phosphoadenosyl-5'-phosphosulphate (PAPS). Addition of sulfate occurs almost exclusively on secreted and trans-membrane spanning proteins. Since sulfate is added permanently it is necessary for the biological activity and not used as a regulatory modification like that of tyrosine phosphorylation.
At least 34 human proteins have been identified that are tyrosine sulfated although the total number that are predicted is much higher. In all vertebrates a total of 310 tyrosine sulfated proteins have been identified. It is predicted that the mouse proteome is likely to contain over 2000 tyrosine sulfated proteins. The addition of sulfate to tyrosine is believed to play a role in the modulation of protein-protein interactions of secreted and membrane-bound proteins. The process of tyrosine sulfation has been shown to be critical for the processes of blood coagulation, various immune functions, intracellular trafficking, and ligand recognition by several G-protein-coupled receptors (GPCRs). Some well-known tyrosine sulfated proteins are the coagulation protein factor VIII, and the gut peptides gastrin and cholecystokinin (CCK).
back to the topPrenylation
Prenylation refers to the addition of the 15 carbon farnesyl group or the 20 carbon geranylgeranyl group to acceptor proteins, both of which are isoprenoid compounds derived from the cholesterol biosynthetic pathway. The isoprenoid groups are attached to cysteine residues at the carboxy terminus of proteins in a thioether linkage (C-S-C). A common consensus sequence at the C-terminus of prenylated proteins has been identified and is composed of CAAX, where C is cysteine, A is any aliphatic amino acid (except alanine) and X is the C-terminal amino acid. In order for the prenylation reaction to occur the three C-terminal amino acids (AAX) are first removed. Following attachment of the prenyl group the carboxylate of the cysteine is methylated in a reaction utilizing S-adenosylmethionine as the methyl donor.
In addition to numerous prenylated proteins that contain the CAAX consensus, prenylation is known to occur on proteins of the RAB family of RAS-related G-proteins. There are at least 60 proteins in this family that are prenylated at either a CC or CXC element in their C-termini. The RAB family of proteins are involved in signaling pathways that control intracellular membrane trafficking.
Some of the most important proteins whose functions depend upon prenylation are those that modulate immune responses. These include proteins involved in leukocyte motility, activation, and proliferation and endothelial cell immune functions. It is these immune modulatory roles of many prenylated proteins that are the basis for a portion of the anti-inflammatory actions of the statin class of cholesterol synthesis-inhibiting drugs due to a reduction in the synthesis of farnesylpyrophosphate and geranylpyrophosphate and thus reduced extent of inflammatory events. Other important examples of prenylated proteins include the oncogenic GTP-binding and hydrolyzing protein RAS and the γ-subunit of the visual protein transducin, both of which are farnesylated. In addition, numerous GTP-binding and hydrolyzing proteins (termed G-proteins) of signal transduction cascades have γ-subunits modified by geranylgeranylation.
back to the topVitamin C-Dependent Modifications
Modifications of proteins that depend upon vitamin C as a cofactor include proline and lysine hydroxylations and carboxy terminal amidation. The hydroxylating enzymes are identified as prolyl hydroxylase and lysyl hydroxylase. The donor of the amide for C-terminal amidation is glycine. The most important hydroxylated proteins are the collagens. Several peptide hormones such as oxytocin and vasopressin have C-terminal amidation.
back to the topVitamin K-Dependent Modifications
Vitamin K is a cofactor in the carboxylation of glutamic acid residues. The result of this type of reaction is the formation of a γ-carboxyglutamate (gamma-carboxyglutamate), referred to as a gla residue.

Structure of a gla residue
The formation of gla residues within several proteins of the blood clotting cascade is critical for their normal function. The presence of gla residues allows the protein to chelate calcium ions and thereby render an altered conformation and biological activity to the protein. The coumarin-based anticoagulants, warfarin and dicumarol function by inhibiting the carboxylation reaction.
back to the topSelenoproteins
Selenium is a trace element and is found as a component of several prokaryotic and eukaryotic enzymes that are involved in redox reactions. The selenium in these selenoproteins is incorporated as a unique amino acid, selenocysteine, during translation. A particularly important eukaryotic selenoenzyme is glutathione peroxidase. This enzyme is required during the oxidation of glutathione by hydrogen peroxide (H2O2) and organic hydroperoxides.

Structure of the selenocysteine residue
Incorporation of selenocysteine by the translational machinery occurs via an interesting and unique mechanism. The tRNA for selenocysteine is charged with serine and then enzymatically selenylated to produce the selenocysteinyl-tRNA. The anticodon of selenocysteinyl-tRNA interacts with a stop codon in the mRNA (UGA) instead of a serine codon. The selenocysteinyl-tRNA has a unique structure that is not recognized by the termination machinery and is brought into the ribosome by a dedicated specific elongation factor. An element in the 3' non-translated region (UTR) of selenoprotein mRNAs determines whether UGA is read as a stop codon or as a selenocysteine codon.
back to the topUbiquitin and Targeted Protein Degradation
Proteins are in a continual state of flux, being synthesized and degraded. In addition, when proteins become damaged they must be degraded to prevent aberrant activities of the defective proteins and/or other proteins associated with those that have been damaged. One of the major mechanisms for the destruction of cellular proteins involves a complex structure referred to as the proteosome. In eukayotic cells the proteosome is found in the cytosol and the nucleus and has a large mass such that it has a sedimentation coefficient of 26S. The 26S proteosome comprises a 20S barrel-shaped catalytic core as well as 19S regulatory complexes at both ends. Degradation of proteins in the proteosome occurs via an ATP-dependent mechanism.
Proteins that are to be degraded by the proteosome are first tagged by attachment of multimers of the 76 amino acid polypeptide ubiquitin. Many proteins involved in cell cycle regulation, control of proliferation and differentiation, programmed cell death (apoptosis), DNA repair, immune and inflammatory processes and organelle biogenesis have been discovered to undergo regulated degradation via the 26S proteosome. Of clinical significance are the recent findings that deregulation of the functions of the proteosome can contribute to the pathogenesis of various human diseases such as cancer, myeloproliferative diseases, and neurodegenerative diseases.
The degradation of proteins via the 26S proteosome involves a two-step process starting with ubiquitination of the protein followed by entry into and degradation by the proteosome complex with release of ubiquitin monomers that can be re-used to tag additional proteins. The process of ubiquitin addition to the substrate protein involves multiple ubiquitin additions such that the targeted protein is polyubiquitinated.
Attachment of ubiquitin involves a series of three enzyme activities. The first, identified as E1 (also called ubiquitin-activating enzymes), activates ubiquitin in an ATP-dependent manner such that the ubiquitin is bound to the E1 enzyme via a high-energy thiol ester. The next class of enzyme, referred to as E2 (also called ubiquitin-conjugating enzymes or ubiquitin-carrier proteins), transfers the ubiquitin via an E2 thiol ester intermediate to the substrate protein. The substrate proteins are recognizable by E2 because they are bound by the third class of enzyme called E3 or ubiquitin-protein ligases. The E3 enzymes carry out the final step in the process which is the covalent attachment of ubiquitin to the substrate protein. The ubiquitin is generally transferred to the ε-amino group of an internal lysine residue in the substrate protein. There are however, examples where the ubiquitin is attached to the N-terminal amino group in a substrate protein. Whereas, ubiquitination targets proteins for degradation in the proteosome there needs to be a mechanism to ensure that inappropriately tagged proteins, i.e. those that are not destined for degradation, can be untagged. There are a family of enzymes called isopeptidases that carry out this vital function of removing ubiquitin from from proteins to which it is mistakenly attached.

Process of ubiquitination and proteosome-mediated protein degradation
Inactivation of a critical activity such as that catalyzed by the E1 enzymes results in lethality. However, there are numerous pathological states that can be attributed, in part, to mutations in recognition motifs in ubiquitination substrates and enzymes in the ubiquitination process. Disease states associated with the ubiquitin modification system can be classified into two groups. One group results from a loss of function mutation in a ubiquitin system enzyme or target protein that results in stabilization of the protein that should normally be degraded. The other group results from gain of function mutations that result in abnormal or accelerated degradation of target proteins. The most obvious disease state that could be expected to arise as a result of defective ubiquitination processes is cancer. In fact, many malignancies are known to result from defective ubiquitin-mediated degradation of growth promoting proteins such as FOS, MYC, and SRC. Likewise, inappropriate degradation of key regulators of cell cycle progression such as the tumor suppressor p53 and p27KIP1 (CDKN1B), which is an inhibitor of cyclin-dependent kinases (CDKs which control progression through the cell cycle) is also associated with various types of cancer. In addition to cancers, defective ubiquitination is found associated with neurodegenerative diseases such as Parkinson disease, Alzheimer disease, and amyotrophic lateral sclerosis.
back to the topProtein Modification by SUMO
As discussed in the above section, many proteins are post-translationally modified via the addition of ubiquitin. Over the past several years numerous ubiquitin-like (Ubl) proteins have been identified. Like ubiquitin, Ubls are added to other proteins via post-translational reactions. However, unlike ubiquitination which targets proteins for degradation in the proteosome, Ubl modifications do not. Although there are several types of Ubls, those with the broadest range of functions and the largest number of known substrates are members of the SUMO (small ubiquitin-related modifier) proteins. There have been over 50 proteins, that function in a variety of different capacities within cells, shown to be modified by SUMOylation. Modification of proteins by SUMO addition has been shown to occur in all tissues and at all developmental stages.
There are four SUMO proteins in mammalian cells designated SUMO-1 to SUMO-4. SUMO-1 is also identified as UBL1, PIC1 [promyelocytic leukemia (PML) interacting clone 1], sentrin, GMP1 [GTPase activating protein (GAP) modifying protein 1], and Smt3c (suppressor of mif two 3 homolog 1c). SUMO-2 is also identified as sentrin 2, Smt3b, and GMP-related protein. SUMO-3 is also identified as sentrin 3 and Smt3a. SUMO-2 and SUMO-3 differ only at three N-terminal amino acid residues and are, therefore, often referred to collectively as SUMO-2/3. SUMO-2/3 share only 50% amino acid similarity to SUMO-1. SUMO-4 is based upon DNA sequence analysis and exhibits 87% amino acid similarity to SUMO-2. It is believed that the SUMO-4 gene is actually a pseudogene since it does not contain any introns. Although SUMO-4 mRNA has been detected in tissues such as spleen and lymph nodes, no protein product has been detected.
Following de novo synthesis, SUMO proteins must undergo a C-terminal cleavage processing event in order to render the proteins biologically active. The C-terminal cleavage reactions are catalyzed by a family of proteins called SENP (sentrin/SUMO-specific protease). There are at least six SENP proteins in mammals. The removal of the C-terminal amino acids reveals a di-glycine (–G–G) motif that allows the SUMO protein to subsequently conjugate to lysine (K) residues in target proteins. SUMO conjugation to target proteins requires a series of enzymatic steps that is similar in mechanism to the conjugation of ubiquitin to target proteins. SUMO proteins are initially activated via an ATP-dependent reaction catalyzed by the E1 activating complex. This complex is a heterodimer composed of SUMO-activating enzyme E1 (SAE1) and SAE2. This activating reaction forms a covalent bond between an active site cysteine in SAE2 and the C-terminal glycine of the SUMO protein. The next step involves the transfer of the SUMO protein to an active site cysteine of the protein identified as Ubc9 (ubiquitin-conjugating 9). As yet Ubc9 is the only known SUMO-conjugating enzyme in mammalian tissues. Ubc9 brings the SUMO protein to the target protein by recognizing, and binding to, the consensus SUMOylation motif in target proteins. The consensus motif for SUMOylation is the following: ΨKxD/E where Ψ is a large hydrophobic amino acid and x is any amino acid. Approximately 75% of all SUMOylated proteins contain this target motif, however, not all proteins that contain this motif are SUMOylated and some proteins are SUMOylated on lysine residues that do not lie in this motif.
As described above for ubiquitination, there are three classes of enzyme involved in the process: E1, E2, and E3. Since Ubc9 can directly conjugate SUMO proteins to their targets it was thought that no E3-like activity was required. However, SUMO E3 ligases have been identified and although they do not function directly in the enzymatic process of SUMO attachment to target they do act as a scaffold. SUMO E3 ligases bring Ubc9-SUMO complexes into contact with target proteins. The mammalian SUMO E3 proteins are members of the PIAS [protein inhibitor of activated STAT (signal transducer and activator of transcription)] family of proteins. There are currently five members of the mammalian PIAS protein family.
Whereas, ubiquitination of a protein results in is destruction in the proteosome, SUMOylation is a dynamic process and once attached the SUMO residue can be removed. Removal of SUMO from a target protein is accomplished by the same SENP enzymes that are required for the activation step of SUMO processing. SENP1 and SENP2 have broad specificity for SUMO-1 and SUMO-2/3 and are involved in their processing and deconjugation. SENP3 and SENP5 exhibit a preference for SUMO-2/3. Although SENP6 and SENP7 exhibit the same preference for SUMO-2/3 they are only minimally involved in deconjugation reactions. The primary functions of SENP6 and SENP7 are in editing the length of poly-SUMO-2/3 chains on target proteins. Therefore, it seems clear that SENP1 and SENP2 are responsible for maturation of SUMO proteins and deconjugation of SUMO-1 and SUMO-2/3 conjugated targets. SENP3 and SENP5 function in the removal of monomeric SUMO-2/3 from target proteins. SENP6 and SENP7 function as editors of SUMO-2/3 chains in target proteins.
The exact functional consequences of SUMOylation of a particular target protein is difficult to predict. However, modification of target proteins by SUMO addition is likely to lead to at least three non-mutually exclusive effects. The attachment of SUMO can result in the masking of a site in the target protein that is required for binding or interaction with a substrate protein. The addition of SUMO may alternatively result in the formation of an attachment site allowing for the recruitment of proteins that can now interact with the SUMOylated protein. The third consequence could be that the conformation of the SUMOylated protein is altered such that activity is regulated in some way.
Mammalian Substrates for SUMO
The following Table is not intended to represent a complete list of all known SUMO target proteins it is just a representative list.
| Protein Symbol & Name | Protein Function | Role of SUMOylation |
| Androgen receptor | transcriptional activation | reduces the transcriptional activation activity of the receptor |
|
GLUT1 glucose transporter 1 |
glucose transport | exact consequence not fully defined but GLUT1 protein levels are down-regulated by Ubc9 |
|
GLUT4 glucose transporter 4 |
glucose transport | exact consequence not fully defined but GLUT4 protein levels are up-regulated by Ubc9 |
|
HIPK2 homeodomain-interacting protein kinase 2 |
transcriptional co-repression | mediates the localization of HIPK2 nuclear bodies, also called promyelocytic leukemia (PML) bodies |
|
IκBα inhibitory κBα |
inhibitor of NF-κB (nuclear factor κB) signal transduction | inhibits ubiquitination of IκBα thereby blocking NF-κB activity |
|
Mdm2 originally isolated from mouse tumorigenic cell line 3T3DM |
E3 ubiquitin ligase for tumor suppressor p53 | inhibits ubiquitylation of Mdm2 resulting in activation of the E3 function of Mdm2 |
| p53 | tumor suppressor, is a transcription factor activated in response to DNA damage | activates p53 transactivation leading to increased apoptosis |
|
PML promyelocytic leukemia |
tumor suppressor | allows for the formation of nuclear bodies and the recruitment p53 |
|
Topo I topoisomerase I |
topoisomerase involved in DNA replication and repair | exact consequence not fully defined but SUMOylation is induced after DNA damage with camptothecin |
|
Topo II topoisomerase II |
topoisomerase involved in DNA replication and repair | exact consequence not fully defined but SUMOylation is induced after DNA damage with teniposide |
back to the top
Return to The Medical Biochemistry Page
Michael W King, PhD | © 1996–2012 themedicalbiochemistrypage.org, LLC | info @ themedicalbiochemistrypage.org
Last modified: March 10, 2012
![]() |
||||||
|
