User:Jeremiah C Hagler/Protein 1

From Proteopedia

< User:Jeremiah C Hagler(Difference between revisions)
Jump to: navigation, search
Current revision (20:41, 24 August 2020) (edit) (undo)
(Introduction to Computer-Aided Protein Visualization Lab)
 
(9 intermediate revisions not shown.)
Line 1: Line 1:
== Introduction to Computer-Aided Protein Visualization Lab ==
== Introduction to Computer-Aided Protein Visualization Lab ==
<StructureSection load='1pgb' size='340' side='right' caption='This simple protein, B1 Immunoglobulin-binding domain of Streptococcal protein G, shows secondary structures nicely. The alpha helix is red, beta sheet in yellow.' scene='71/713432/Protein_secondary_structure/1'>
<StructureSection load='1pgb' size='340' side='right' caption='This simple protein, B1 Immunoglobulin-binding domain of Streptococcal protein G, shows secondary structures nicely. The alpha helix is red, beta sheet in yellow.' scene='71/713432/Protein_secondary_structure/1'>
 +
== Computer-Aided Protein Visualization Lab ==
== Computer-Aided Protein Visualization Lab ==
Knowing the three-dimensional structure of a protein can be a very powerful tool for biologists. Much can be learned about enzyme function, interaction of molecules in your immune system, the appearance of the surface of viruses, and the interaction of ligands and receptors.
Knowing the three-dimensional structure of a protein can be a very powerful tool for biologists. Much can be learned about enzyme function, interaction of molecules in your immune system, the appearance of the surface of viruses, and the interaction of ligands and receptors.
Line 6: Line 7:
== First some background: (make sure that you understand the underlined words)==
== First some background: (make sure that you understand the underlined words)==
-
Proteins are synthesized on ribosomes by linking together many amino acids into a long chain. If you could observe a protein as it is made, it would look like a string of pearls (amino acids) feeding out the end of the ribosome as it floats in the cytoplasm of the cell [https://www.dnalc.org/view/15501-Translation-RNA-to-protein-3D-animation-with-basic-narration.html Video of Translation (DNALC)]. This structure is called the primary structure (or 1° structure) and refers to the sequence of amino acids of the protein. After protein synthesis has started, two choices are possible:
+
Proteins are synthesized on ribosomes by linking together many amino acids into a long chain. If you could observe a protein as it is made, it would look like a string of pearls (amino acids) feeding out the end of the ribosome as it floats in the cytoplasm of the cell ([https://www.dnalc.org/view/15501-Translation-RNA-to-protein-3D-animation-with-basic-narration.html Video of Translation (DNALC)]). This structure is called the primary structure (or 1° structure) and refers to the sequence of amino acids of the protein. After protein synthesis has started, two choices are possible:
(1) if the protein is destined to be secreted or to reside in an organelle of the secretory pathway, the first twenty or so amino acids will comprise a signal sequence. These act to direct the ribosome to the endoplasmic reticulum (ER) where the protein will be fed through a channel in the membrane into the interior of the ER. Once inside the ER, the protein will fold and receive sugar modifications called glycosylations.
(1) if the protein is destined to be secreted or to reside in an organelle of the secretory pathway, the first twenty or so amino acids will comprise a signal sequence. These act to direct the ribosome to the endoplasmic reticulum (ER) where the protein will be fed through a channel in the membrane into the interior of the ER. Once inside the ER, the protein will fold and receive sugar modifications called glycosylations.
Line 16: Line 17:
The most important rule about protein structure is that it is determined by the primary sequence of the protein. Protein folding is a complicated multi-step process. The first step results in the secondary structure (or 2o structure) of the protein. Secondary structures come in two flavors: alpha helices and beta sheets (or beta-pleated sheets). Alpha helices are spiral staircase structures (see structure 1 below), and beta-pleated sheets are flat regions where the amino acids run back and forth next to each other in long ribbons (see structure 2 below). These two structures form spontaneously based on the shape/hydrophobicity/charges of the amino acids and are held together by hydrogen bonds. The protein will now look like a string of pearls with twists or zig-zags at intervals along its length.
The most important rule about protein structure is that it is determined by the primary sequence of the protein. Protein folding is a complicated multi-step process. The first step results in the secondary structure (or 2o structure) of the protein. Secondary structures come in two flavors: alpha helices and beta sheets (or beta-pleated sheets). Alpha helices are spiral staircase structures (see structure 1 below), and beta-pleated sheets are flat regions where the amino acids run back and forth next to each other in long ribbons (see structure 2 below). These two structures form spontaneously based on the shape/hydrophobicity/charges of the amino acids and are held together by hydrogen bonds. The protein will now look like a string of pearls with twists or zig-zags at intervals along its length.
-
1. <StructureSection load='1pgb' size='340' side='right' caption='Here is an alpha helix. The protein backbone is red, the amino acid side chains yellow' scene='71/713432/Protein_secondary_structure/3'>
+
1. <scene name='71/713432/Protein_secondary_structure/3'>Click to see alpha helix</scene>
<br>
<br>
2. <scene name='71/713432/Protein_secondary_structure_bs/2'>Click to see beta sheet</scene>
2. <scene name='71/713432/Protein_secondary_structure_bs/2'>Click to see beta sheet</scene>
Line 22: Line 23:
<br>
<br>
The second step of protein folding results in the tertiary structure (or 3° structure). Tertiary structure gives the protein an overall three-dimensional structure. The tertiary structure of a protein is determined by a combination of factors including hydrogen bonds, ionic bonds (between positively and negatively charged amino acids), covalent disulfide bonds (between cysteine residues), and Van der Waals interactions. Tertiary structure can also be affected by repulsive forces between similarly charged amino acids, as well as hydrophobic and hydrophilic interactions with a solvent (commonly water). At a distance many proteins form what look to be large globs at this point, and it is only upon more careful and close up inspection that one can see the true uniqueness of the shape.
The second step of protein folding results in the tertiary structure (or 3° structure). Tertiary structure gives the protein an overall three-dimensional structure. The tertiary structure of a protein is determined by a combination of factors including hydrogen bonds, ionic bonds (between positively and negatively charged amino acids), covalent disulfide bonds (between cysteine residues), and Van der Waals interactions. Tertiary structure can also be affected by repulsive forces between similarly charged amino acids, as well as hydrophobic and hydrophilic interactions with a solvent (commonly water). At a distance many proteins form what look to be large globs at this point, and it is only upon more careful and close up inspection that one can see the true uniqueness of the shape.
-
 
+
<br>
-
+
[[Image:Intramolecular forces in tertiary structures.png]]
Figure 3: The various intramolecular interactions that help determine teriary structure
Figure 3: The various intramolecular interactions that help determine teriary structure
Line 31: Line 32:
Parts of the secondary and tertiary structures of a protein are usually arranged to form domains, functional units associated with a particular structure. For example, a pair of alpha helices situated side by side might form a binding site, or a particular folding pattern might form the active site of an enzyme, where it binds to its substrate, or the site at which it binds to a coenzyme such as NAD+. The structure of the domain (though not necessarily the exact amino acid sequence) is frequently preserved in different proteins from the same organism that have a similar function (to move phosphate groups, for instance). Domains are also conserved in proteins from different species that have the same function (such as hemoglobins for oxygen transport or cytochromes in the electron transfer system of mitochondria). Variations in the amino acid sequences in similar domains (or in the nucleotide sequences or genes that code for the proteins) give important clues about evolutionary relationships between organisms. Individual domains are sometimes found (but not always, a fact that makes this a very controversial topic) contained within single exons of eukaryotic genes. In other words, a single exon might represent all of the protein coding sequence required to generate a functional domain within the context of the whole protein structure. This finding has implications for the evolution of eukaryotic genes, since it implies that new proteins can be generated by simply duplicating preexisting protein domain encoding exons and recombining them into new combinations (a process known as exon-shuffling). Thus, a vast variety of proteins with new functions can be generated from preexisting genes, allowing great evolutionary flexibility. Looking at the genes of many eukaryotic organisms shows that this is exactly what appears to happen.
Parts of the secondary and tertiary structures of a protein are usually arranged to form domains, functional units associated with a particular structure. For example, a pair of alpha helices situated side by side might form a binding site, or a particular folding pattern might form the active site of an enzyme, where it binds to its substrate, or the site at which it binds to a coenzyme such as NAD+. The structure of the domain (though not necessarily the exact amino acid sequence) is frequently preserved in different proteins from the same organism that have a similar function (to move phosphate groups, for instance). Domains are also conserved in proteins from different species that have the same function (such as hemoglobins for oxygen transport or cytochromes in the electron transfer system of mitochondria). Variations in the amino acid sequences in similar domains (or in the nucleotide sequences or genes that code for the proteins) give important clues about evolutionary relationships between organisms. Individual domains are sometimes found (but not always, a fact that makes this a very controversial topic) contained within single exons of eukaryotic genes. In other words, a single exon might represent all of the protein coding sequence required to generate a functional domain within the context of the whole protein structure. This finding has implications for the evolution of eukaryotic genes, since it implies that new proteins can be generated by simply duplicating preexisting protein domain encoding exons and recombining them into new combinations (a process known as exon-shuffling). Thus, a vast variety of proteins with new functions can be generated from preexisting genes, allowing great evolutionary flexibility. Looking at the genes of many eukaryotic organisms shows that this is exactly what appears to happen.
-
A good example of all of these principles can be found in immunoglobulins (see figure 4). They are protein molecules that form one of the main lines of defense against foreign organism invasion of the body and are part of the humoral immune response (this is the branch of the immune system that is activated when you are given a vaccine). They are made up of four subunits: two identical heavy chains and two identical light chains. Immunoglobulins are divided into several domains, including the 2 variable domains on the tips of the “Y” arms and are involved in binding specific antigens, and the constant domains which make up the rest of the molecule. The constant domains serve to determine the type of antibody (IgG, IgM, IgA, etc) the molecule represents and to mediate the response of the immune system to the antibody tagged antigen. Each of these domains is defined by it’s own exon within the immunoglobulin gene structure.
+
A good example of all of these principles can be found in immunoglobulins (see figure 4). They are protein molecules that form one of the main lines of defense against foreign organism invasion of the body and are part of the humoral immune response (this is the branch of the immune system that is activated when you are given a vaccine). They are made up of four subunits: two identical heavy chains and two identical light chains. Each is synthesized as an individual protein and then later complexed into the complex secondary, tertiary and quatenary structure you see below. Immunoglobulins are divided into several domains, including the 2 variable domains on the tips of the “Y” arms and are involved in binding specific antigens, and the constant domains which make up the rest of the molecule. The constant domains serve to determine the type of antibody (IgG, IgM, IgA, etc) the molecule represents and to mediate the response of the immune system to the antibody tagged antigen. Each of these domains is defined by it’s own exon within the immunoglobulin gene structure.
-
+
Figure 4: Immunoglobulins. Three views of the immunoglobulin complex IgG.
-
Figure 4: Immunoglobulins. Three views of the immunoglobulin complex IgG. On top is a schematic of an immunoglobulin complex showing the two light chains, the two heavy chains and the variable and constant domains. In the middle is a space filling model based on an X-ray crystal structure. This clearly shows the 4 subunits that make up antibodies, and how they are divided into functional (and structural) domains. On the bottom is the same structure represented in a ribbon diagram that shows secondary structure. The flat ribbons show beta-sheets while the cylindrical barrels represent alpha-helices. The interaction of secondary structure to form tertiary structure, and the interaction of these structures to form quaternary structure is apparent.
+
<br>
-
+
a. [[Image:Antibody schematic.png|300px|]]
-
The final step of protein folding results in quarternary structure (or 4° structure). This step is only taken in proteins that are made of multiple subunits; meaning that strands of proteins - coded for on separate mRNAs and synthesized independently - come together to form a single functional molecule. Many proteins have multiple subunits; for example, immunoglobulins are made up of four subunits (figure 4).
+
<br>
 +
This is a schematic of an immunoglobulin complex showing the two light chains, the two heavy chains and the variable and constant domains.
 +
<br>
 +
b. [[Image:Antibody spacefilling diagram.jpg|300px|]]
 +
<br>
 +
This is a space filling model based on an X-ray crystal structure. This clearly shows the 4 subunits that make up antibodies, and how they are divided into functional (and structural) domains.
 +
<br>
 +
c. <scene name='71/713432/Antibody/1'>3-Dimensional structure of IgG antibody</scene>This is the same structure represented in a ribbon diagram that shows secondary structure. The flat ribbons show beta-sheets while the cylindrical barrels represent alpha-helices. The two heavy chains are in blue, the two light chains in green. The variable domains are the section of the structure where the light chains interact with the heavy chain. This is where antibodies bind to antigen. The constant domain consists of the region where the two heavy chains interact with each other. The interaction of secondary structure to form tertiary structure, and the interaction of these structures to form quaternary structure is apparent. The final step of protein folding results in quarternary structure (or 4° structure). This step is only taken in proteins that are made of multiple subunits; meaning that strands of proteins - coded for on separate mRNAs and synthesized independently - come together to form a single functional molecule. Many proteins have multiple subunits; for example, immunoglobulins are made up of four subunits (figure 4).
-
== Relevance ==
+
== Determining the 3-Dimensional Structure of a Protein ==
 +
Scientists can use several techniques to observe the folding of a protein.
-
== Structural highlights ==
+
(1) The most commonly used technique is called x-ray crystallography. This technique requires the scientist to form crystals of the protein of interest - very similar to how you can form sugar crystals by dangling a string in a super-saturated sucrose solution! The crystal is then bombarded with x-rays, and the diffraction pattern of the x-rays is recorded on film. By analyzing the diffraction pattern, the spacing of atoms in the protein can be determined. Rosalind Franklin also used this technique on DNA crystals; her diffraction pictures were in turn used by James Watson and Francis Crick to determine the double-helix shape of DNA.
-
This is a sample scene created with SAT to <scene name="/12/3456/Sample/1">color</scene> by Group, and another to make <scene name="/12/3456/Sample/2">a transparent representation</scene> of the protein. You can make your own scenes on SAT starting from scratch or loading and editing one of these sample scenes.
+
(2) A second technique used is NMR, or Nuclear Magnetic Resonance, (also called MRI in medicine). In this technique proteins are placed in a magnetic field. The resonance frequency of the field can be varied. Different atoms in different chemical environments will absorb maximally at different frequencies. By viewing a spectrum of absorbance vs. resonance frequency, it is possible to specify the identity of atoms and their location with the protein. This technique is particularly useful where it can detect movement in molecules as proteins fold and/or as they bind with other molecules.
 +
 
 +
(3) A third technique that has been developed is the use of computers to simulate protein folding strategies. Originally programs were developed to allow scientists to predict the structural effect of a relatively small change in a protein sequence. The computer will look at the three-dimensional structure, as determined by x-ray crystallography or NMR, of a closely related protein (a homologue from another species or a slight variant from the same species) and predict what the effect of the amino acid changes would be. This process is done by having the computer determine the "lowest energy configuration" of the protein - or simply put, which folding of the protein puts the least stress on the molecule. It looks to make sure that two amino acids will not be pushing into each other, that two similarly charged amino acids will not be opposing each other, that hydrogen bonds and disulfide bonds are formed where they can be, etc.
 +
 
 +
New programs aim to predict the three-dimensional structure of proteins from scratch - where no known homologue has ever been studied. This technique is quite powerful because forming crystals of many proteins is hard, if not impossible. Instead, these programs start at the same point that protein folding starts in the cell. They take the primary sequence of the protein and look for the correct sequences of amino acids to form alpha helices and beta-pleated sheets. Once these are in place, the program searches through for tertiary structures that obey the "lowest energy configuration" rules.
 +
 
 +
== Viewing the 3D structure of a protein ==
 +
This now brings us back to where we started at the top of page 1 - once we know the structure, how can we look at it? There are many computer programs in existence to visualize proteins in three-dimensions. For the drug design studies, powerful computers and programs are necessary to analyze the energetics of drug fit. (Simulating structures and their interactions is a powerful “weeding out” tool in deciding which drugs to test in laboratory studies, making the development of new drugs more efficient and less costly.)
 +
 
 +
For this lab you will be working with a very commonly used computer modeling program known as Jsmol to look at "pdb" files of proteins with a known structure (using the methods outlined above). Jsmol is an open-source JavaScript viewer for chemical structures in 3D: http://wiki.jmol.org/index.php/JSmol. We are using the website proteopedia.org as a wiki host site to house and access these exercises.
 +
 
 +
 
 +
== Preliminary Questions ==
 +
Answer the next few questions to review your organic molecule knowledge and to get up to speed quickly.
 +
<br>
 +
1. What are the four major organic molecule groups?
 +
<br>
 +
2. How many of the other three groups can you find in association with proteins? Where in a cell do these interactions occur?
 +
<br>
 +
3. Draw the structure of an amino acid. Identify the position of the alpha carbon (called ca in the MAGE program). Identify the location of the side chain.
 +
<br>
 +
4. What is an enzyme?
 +
<br>
 +
5. Draw a schematic figure to show the progression of a protein from simple primary structure to the most complex tertiary structure.
 +
<br>
 +
Make sure you answer all questions in the lab. Questions marked by an * are more complicated and may require outside reading and thinking time to answer completely.
 +
<br>
 +
== Using Jmol through proteopedia.org ==
 +
A VERY BRIEF GUIDE TO USING Jmol
 +
 
 +
Logon to your computer, then navigate to the following website:
 +
 
 +
All of the structures hosted these sites initially are viewable in the small window on the right side of the page. However, if you click on "popup" in the lower left of that window, a larger window containing the structure will become visible. This window can be resized to any size by dragging the window edges to desirable dimensions.
 +
<br>
 +
TURNING THE PROTEIN: Once you have the molecule displayed on the black screen, try rotating the molecule by placing the pointer on the screen, holding down the mouse button and dragging the mouse. Move the mouse slowly so that you can see the molecule turn. If you place the pointer in the center of the field and drag left/right, the protein will rotate around the y-axis. If you place the pointer in the center of the field and drag up/down, the protein will rotate around the x-axis. If you place the pointer at the top of the field and drag left/right, the protein will rotate around the z-axis.
 +
 +
TO VIEW SELECTED GROUPS (carbon backbones or side chains of amino acids): Make sure the protein is not spinning (click on "toggle spin" in the bottom left corner of the image window to stop spinning). Then, simply move your cursor to any portion of the molecule you want to identify. After a second or two, a small window should popup containing a chain of information--a three letter code (amino acid) followed by a number (the position of the amino acid in the protein primary structure), a chain designation (indicating which protein chain/subunit you are pointing at) and then the element you are pointing at (C,H,N,O,P,S,Zn,Mg,Ca,etc).
 +
 
 +
== Here are the structures you will want to examine ==
 +
1.[http://proteopedia.org/wiki/index.php/User:Jeremiah_C_Hagler/Sandbox_2 Protein 1: MHC Class I]
 +
<br>
 +
2.[http://proteopedia.org/wiki/index.php/User:Jeremiah_C_Hagler/Sandbox_3 Protein 3: HIV Reverse Transcriptase]
 +
<br>
 +
3.[http://proteopedia.org/wiki/index.php/User:Jeremiah_C_Hagler/Sandbox_1 Protein 2: Alkaline Phosphatase]
 +
<br>
-
</StructureSection>
 
-
== References ==
 
<references/>
<references/>

Current revision

Introduction to Computer-Aided Protein Visualization Lab

This simple protein, B1 Immunoglobulin-binding domain of Streptococcal protein G, shows secondary structures nicely. The alpha helix is red, beta sheet in yellow.

Drag the structure with the mouse to rotate

Proteopedia Page Contributors and Editors (what is this?)

Jeremiah C Hagler

Personal tools