Sandbox2O14

From Proteopedia

(Difference between revisions)
Jump to: navigation, search
Current revision (17:19, 28 April 2025) (edit) (undo)
 
(17 intermediate revisions not shown.)
Line 3: Line 3:
(NOTE TO ALL EDITORS: This page is part of a final project for a biochemistry lab at Elizabethtown College. Please do not edit this.)
(NOTE TO ALL EDITORS: This page is part of a final project for a biochemistry lab at Elizabethtown College. Please do not edit this.)
-
2O14 is a monomeric protein complex that originates from the bacterial species Bacillus subtilis and has a mass of 41.79 kDa.
+
2O14 is a monomeric protein complex that originates from the bacterial species Bacillus subtilis and has a mass of 41.79 kDa. Using InterPro this protein was assumed to be like Rhamnogalacturan acetylesterase, which falls under the SGNH Hydrolase Superfamily with structural and sequential similarities to lipases, esterases, along with similar functional domains to galactose-binding. With the research performed with current protein databases available, the evidence suggests that this protein removes acetyl groups from the backbone proteins by hydrolyzing the acetyl esters. It accomplishes this most likely through the catalytic triad of serine-histidine-aspartic acid.
-
It is a member of the SGNH Hydrolase Superfamily with structural and sequential similarities to esterases and lipases. Current evidence suggests it causes the hydrolysis of esters and/or acetyl groups on lipids/lipid-like molecules via a serine protease-like active site.
+
<StructureSection load='2O14' size='400' side='right' caption='3D Representation of 2O14’s structure.' scene=''>
-
 
+
-
<StructureSection load='4Q7Q' size='400' side='right' caption='3D Representation of 4Q7Q's structure' scene=''>
+
== Overview ==
== Overview ==
-
4Q7Q exists as a homodimer quaternary structure.F Analyzing primary and quaternary structures of 4Q7Q with SPRITE and Chimera revealed two chains identical in both shape and sequence. Each chain is 266 residues long, and the entire complex has a molecular weight of approximately 58.5 kDa.F
+
Through the 2024-2025 Spring Semester at Elizabethtown College, analysis of protein 2O14 was performed to determine its enzymatic function and protein class. Determining the function of unknown proteins is beneficial to help understand biological processes. For 2O14 in particular, this protein was found in ''Bacillus subtilis'' from soil and the gastoinestrial tract of ruminants and humans. Understanding its enzymatic function will pinpoint exactly how this organism interacts with the environment and humans.
-
4Q7Q proteins originate from bacterial species.G,H InterPro search results show how nearly every enzyme with similar sequencing to 4Q7Q is found in various bacteria, with a notable exception to eukaryotes.D Additionally, the PDB entry for 4Q7Q notes how it potentially can be found in Chitinophaga pinensis, a gram-negative bacterial species which can degrade chitin.G,H
+
2O14 exists as monomeric protein complex. Using SPRITE and Chimera revealed one chain that is very long and does not replicate the amino acid chain. This was further confirmed by looking at the X-ray structure of protein 2O14 and it is stating that there is only one unique protein chain. The chain is 366 residues long and the entire complex has a molecular weight of approximately 41.79kDa.
 +
 
 +
2O14 proteins originated from the bacterial species bacillus subtills. InterPro results showed how almost all similar enzymes came from bacterial species like the bacillus subtills. It was also found to not be in any viruses, but a small fraction could be found in archaea, eukaryote and other small organisms.
== Family and Superfamily ==
== Family and Superfamily ==
 +
The data found showed that 2O14 is like a Rhamnogalacturan_acetylesterase that falls under the SGNH Hydrolase protein super family. Using a SPRITE analysis, we found that the lower RMSD values match was with 1bwp_c01 a Platelet Act Factor with a value of 0.60 and matched with aspargine, aspartic acid and histidine residues, for the right-hand superposition. Looking at the left-hand superposition, the lowest RMSD value was 0.62 with 1xny_c01, propionyl-COA carboxylase complex B, and its matching residues were 1 alanine and two glycine groups.
-
Research shows that 4Q7Q is a member of the SGNH Hydrolase protein super family. BLAST and InterPro both suggested 4Q7Q’s inclusion in this family, and the known conserved residues seen from SPRITE analysis—Serine, Glycine, Asparagine, and Histidine—line up with those observed throughout this family.D,E Notably, this superfamily is also referred to as the GDSL Hydrolase superfamily.D,E
+
Using the data gotten from the DALI software, it was found that protein 1k7c or rhamnogalacturonan acetylesterase had the most matching amino acids, 98, creating a z score pf 25.1 create a 25.1% match to protein 2O14. Furthermore, using DALI stacked Pfam data graphics showed that protein 2O14 was a potential member of GDSL-like Lipase/Acylhyydrolase family. This would agree with data gather from InterPro that stated that 2O14 was most likely resembling a rhamnogalacturonan acetylesterase.
-
4Q7Q’s inclusion in this family also supports its SPRITE-derived hypothetical functionality. Rhamnogalacturonan Acetylesterase—the enzyme with one of the best SPRITE-based alignment relative to 4Q7Q—is a member of this family.F Proteins in this family are also known for containing a “unique hydrogen bond network that [stabilizes]” the active site.F
+
When using BLAST, the group was able to not only look at 2O14 amino acid sequence but also its taxonomy which showed that the protein was seen in the Bacillota superfamily which is a phylum of bacteria that have gram positive cell walls. Furthermore, for this phylum of bacteria it is very common for them to contain multiple hydrolases including SGNN hydrolases like 2O14.
-
Regarding what protein family 4Q7Q belongs to, DALI results suggest it is a part of a sub-family of the greater GDSL/SGNH superfamily. A PDB90% DALI search labels 4Q7Q as a part of the “Lipolytic Protein G-D-S-L Family,” which refers to enzymes that hydrolyze lipid substrates.I
 
== Sequence Analysis ==
== Sequence Analysis ==
-
The primary sequence of 4Q7Q shows several conserved sequences between it and esterase-like proteins. A sequence of GDSI—similar to the GDSL sequence seen from its family and superfamily—can be seen between 4Q7Q and enzymes like Isoamyl Acetate-Hydrolyzing Esterase. Other noteworthy conserved sequences between esterases and 4Q7Q include GxND and DGxH.
+
Analysis of the primary structure of 2O14 began in SPRITE. Right-handed superposition analysis related two possible enzymes to 2O14, 1BWP and 1PP4. RMSD values for 1BWP and 1PP4 were 0.92 and 0.93 respectively and both proteins matched five residues with 2O14. 1BWP is defined as “Platelet-activating factor acetyl hydrolase,” and 1PP4 is defined as “Rhamnogalacturonan acetyl esterase.”
 +
The primary structure of 2O14 exhibits its active site through the specific residues of Ser171, Gly209, Asn241, Asp339, and His342. This active site spans residues 170 to 350 with this region defined as “Rhamnogalacturonan acetyl esterase like” and falling into the SGNH hydrolase superfamily through BLAST analysis of the primary structure.
 +
Analysis of the primary sequence alone showed promising signs of 2O14 acting as a hydrolase due to its relation to RGAE through its active site and the SGNH hydrolase superfamily.
-
These enzymes also share similar secondary structures. Segments of alpha-helixes and beta-sheet strands appear and remain nearly entirely conserved throughout esterase analysis. A few conserved coils appear, but these sections do not appear as often as the other two secondary structures.
+
== Structural Analysis ==
-
Similar conserved sequences could be found between 4Q7Q and lipases. The GDSI, GxND, and DGxH sequences can be seen from lipases like 7BXD.? The same secondary structure segments can also be located in the lipases analyzed.
+
2O14 contains several similarities secondary structures to proteins 1K7C and 1PP4. 2O14 exhibits a GDSL 2 sequence motif, a variant of the GDSL motif exhibited in both 1K7C and 1PP4. Structural analysis using the Dali software showed both proteins superimposed over 2O14 with z-scores of 25.1 and 24.8 respectively. These two proteins are defined as rhamnogalacturonan acetyl esterases.
 +
After obtaining data from SPRITE, 2O14 was superimposed onto 1PP4 giving an angstroms difference of 0.736Å. This mostly came in the alpha helix region both proteins where the proposed active site is located. Through all analyses, the beta sheet region of 2O14 had no hits relating to its structure or sequence.
 +
Analysis of the structure further emphasized the similarities to RGAE and the proposed function of 2O14 being a hydrolase of some kind, either a lipase or esterase.
-
== Structural Analysis ==
+
== Substrates ==
-
Right-hand SPRITE analysis revealed 4Q7Q exhibited residues like those seen from enzymes operating with acetyl-like substrates. Specifically, residues Ser. 30, Gly. 69, Asn. 97, Asp. 251, and His. 254 on the A and B chains of 4Q7Q line up with similarly positioned residues on esterases like Platelet-Activating Factor Acetylhyd (PAFA), which exhibited an RMSD of 0.25 angstroms when compared to 4Q7Q.
+
SwissDock analysis was done on specific substrates including 4-Nitrophenyl alpha-glucoside, 2-acetamido-2-deoxy-beta-D-glucopyranose, and p-nitrophenyl phosphate. Both PNPG and GlcNac exhibited high binding affinities to 2O14. This would be due 2O14’s proposed enzymatic function.
-
Other proteins with similar motifs of note are Thioesterase I and Rhamnogalacturonan Acetylesterase, with RMSD values of 0.46 and 0.61, respectively. These alignments focus on the same active site as PAFA did, suggesting the acetyl-like substrates 4Q7Q focuses on are similar to esters.
+
== Theoretical Functionality and Proposed Bodily Purpose ==
-
PFAM graphics from DALI revealed significant structural equivalence between 4Q7Q, a lipase-like protein, Rhamnogalacturonan Acetylesterase, and Sialate O-acetylesterase.
+
During our research and experiments the group had used SwissDock to potentially bind to protein 2O14. The determined smile code for the protein was C1=CC(=CC=C1[N+](=O)[O-])O[C@@H]2[C@@H]([C@H]([C@@H]([C@H](O2)CO)O)O)O, and found relatively low calculated affinity values ranging from -6.133 to -3.387. The SwissDock predicted four binding sites it believed molecules could interact with the protein at. One inside the protein itself as shown in Figure 1. Other spaces that were seen to have good binding affinity were outside of the protein embedded in the side, Figure 2, And then finally on the side in Figure 3. The two more likely cases of binding are figures 2 and 3, however, figure 1 had the lowest binding affinity.
-
== Substrates ==
+
Each of the binding affinities correlated with some kind of saccharide, each saccharide was found to interact with acetyl esterases. Furthermore, all the prior data had shown that protein 2O14 was most like an acetyl esterase being most closely related to Rhamnogalacturan acetylesterase, the group decided to experiment on the protein using PNPP to see whether their protein interacted with it.
-
SwissDock analysis showed a preference for larger molecules, specifically fatty acids. Lactide, Ethyl Butyrate, and Triethylene Glycol exhibited noticeably weak binding affinities to the theorized active site of 4Q7Q. These ligands may be ill-suited to act as substrates for 4Q7Q as they are remarkably polar, and lipids—one of the potential categories of substrates for 4Q7Q—are mostly non-polar.
+
After performing all the necessary steps to extract the protein, it was found that most of the protein came out in the first elution, elution 1. Then using UV-Vis, 10µL of protein solution was added to a cuvette with a 10:1 ratio of PNPP and PNPP buffer solution. This showed an instantly yellow solution, so it was believed that protein was in the elution. We did the same test again with elution 2, to confirm that the protein was in elution 1, and had similar results. The group then attempted to adjust the pH to be more like dirt and soil pH and got completely opposite results, where the absorbance of the first two tests was around 0.95 and the pH test was around 0.09. With these completely opposite results, it was unclear if elution 1 had the protein. Figures 4, 5, and 6 show the explained experiments above in order.
-
Despite this, these ligands show noticeable hydrophobic interactions with the active site. This implies 4Q7Q uses hydrophobic regions to help guide substrates into the right orientation for enzymatic processes. This also further supports the possibility that 4Q7Q primarily operates with hydrophobic lipid-based substrates. This also explains why Methyl Acetate exhibited a relatively weaker affinity for 4Q7Q, as its smaller structure prevented hydrophobic interactions.
+
== Protein Extraction ==
-
== Theoretical Functionality and Proposed Bodily Purpose ==
+
The protein was grown inside of ''E.coli'' with a His-tag for resistance to the antibiotic, Ampicillin. Once the protein was grown, the protein solution was centrifuged and sonicated multiple times to extract the expressed protein. For purification, the protein solution was ran through a Ni-NTA column gathering approximately 5 mL of protein elutions in total.
-
Highlight the data that helped you come to your conclusion here including any relevant figures. Make sure include potential substrates and binding sites.
+
== Figure 4 ==
 +
[[Image:1_30mg1stSMALL.jpg]]
 +
Protein test using 10µL of protein from elution and 30mg of PNPP and 3mL of PNPP solution. Measuring absorbance at 405M.
 +
 +
== Figure 5 ==
 +
[[Image:2_30mg2ndelutSMALL.jpg]]
 +
 +
Protein test using 10µL of protein from elution 2 and 30mg of PNPP and 3mL of PNPP solution. Measuring absorbance at 405M.
 +
 +
== Figure 6 ==
 +
[[Image:3_ph560SMALL.jpg]]
 +
 +
Protein test using 10µL of protein from elution 1 and 30mg of PNPP and 3mL of PNPP solution, changing the pH to about 5.60. Measuring absorbance at 405M.
 +
 +
== Figure 7 ==
 +
[[Image:ProteinGel.jpg]]
 +
 +
Image of SDS-PAGE gel that proves that protein was present in the sample.
 +
 +
The following week two more experiments were run, one with a 1:1 ratio of PNPP and PNPP solution with 10µL of protein solution for elution 1. The first was just with the PNPP mixture and showed no positive results, with absorbance values bouncing back and forth between values in the range of 0.15 to 0.096. After this test, a drop of 3.0M HCL was added to the solution to increase acidity slightly, which harbored almost identical results to the previous test. After, an experiment with PNPA was performed to see if a change in substrate would cause the protein to react. Figures 8, 9, and 10 show the results of these reactions.
 +
 +
== Figure 8 ==
 +
[[Image:4_3mgElut1SMALL.jpg]]
 +
 +
Protein test using 10µL of protein from elution 1 and 3mg of PNPP and 3mL of PNPP solution. Measuring absorbance at 405M.
 +
 +
== Figure 9 ==
 +
[[Image:5_3mg1HClSMALL.jpg]]
 +
 +
Protein test using 10µL of protein from elution 1 and 3mg of PNPP and 3mL of PNPP solution and 1 drop of HCL with a pH of about 10.50. Measuring absorbance at 405M.
 +
 +
== Figure 10 ==
 +
[[Image:6_10mg10mLPNPASMALL.png]]
 +
 +
Protein test using 10µL of protein from elution 1 and 10mg of PNPA and 10mL of PNPP solution and 1 drop of HCL with a pH of about 10.50. Measuring absorbance at 405M.
 +
 +
The following week two more reactions were run to prove that the protein function was as an acetyl esterase. The second to last experiment was run with a ratio for 5:1 PNPP and PNPP solution which resulted in no absorbance change throughout the experiment. Then as a final used PNPA with a 5:1 ratio solution, and during this experiment 5µL of protein solution was added to the cuvette solution being measured. This was done to see if more protein concentration was needed to perform the reaction, and after 30 minutes and 155µL of protein solution, no absorbance change was observed. Figures 11 and 12 show these reactions.
 +
 +
== Figure 11 ==
 +
[[Image:7_15mgPNPPSMALL.png]]
 +
 +
Protein test using 10µL of protein from elution 1 and 15mg of PNPA and 3mL of PNPP solution. Measuring absorbance at 405M.
 +
 +
== Figure 12 ==
 +
[[Image:8_PNPAaddproteinSMALL.jpg]]
 +
 +
Protein test with increasing protein solution of 5µL every minute from elution 1 with 15mg of PNPA and 3mL PNPP solution. The pH of the solution is 8.35 and it was measuring absorbance at 405M.
 +
 +
This data shows two possible case results. One is that the protein sample might have been out of a cold environment for too long which would cause the protein to denature and not operate as predicted. Another potential case is that the predicted function of the protein was incorrect.
</StructureSection>
</StructureSection>
 +
 +
== Figure 1 ==
 +
[[Image:Firstswissdock.png]]
 +
 +
Lowest Calculated affinity image from SwissDock of -6.133 kcal/mol, with the ligand 4-nitrophenyl alpha-glucoside.
 +
 +
== Figure 2 ==
 +
[[Image:OutsideActiveSite.png]]
 +
 +
The second lowest calculated binding affinity value, -5.629 kcal/mol, predicted bonding site. Believed that the protein’s binding site is in the center between the beta sheet group on top and the alpha helices on the bottom with the ligand 2-acetamido-2-deoxy-beta-D-glucopyranose.
 +
 +
== Figure 3 ==
 +
[[Image:Smallotherprotein.png]]
 +
 +
Image shows a similar protein, 1PP4, having a different bonding site on the outside of the protein with a binding affinity of -4.429 kcal/mole with the ligand 2-acetamido-2-deoxy-beta-D-glucopyranose. This is interesting due to the similarities of 2O14 and 1PP4, but emphasizes the differences in their active sites
== References ==
== References ==
-
A) 1WAB. Protein Database, 1997. https://www.rcsb.org/structure/1WAB
+
A) Mølgaard, A.; Kauppinen, S.; Larsen, S. Rhamnogalacturonan Acetylesterase Elucidates the Structure and Function of a New Family of Hydrolases. Structure 2000, 8 (4), 373–383. https://doi.org/10.1016/S0969-2126(00)00118-0.
-
B) Ho, Y. S.; Sewnson, L.; Derewenda, U.; Serre, L.; Wei, Y.; Dauter, Z.; Hattori, M.; Adachi, T.; Aoki, J.; Arai, H.; Inoue, K.; Derewenda, Z. S. Brain acetylhydrolase that inactivates platelet-activating factor is a G-protein-like trimer. Nature, 1997, 385, 89-93. https://www.nature.com/articles/385089a0 https://www.nature.com/articles/385089a0
+
B) Schoch, C. L.; Ciufo, S.; Domrachev, M.; Hotton, C. L.; Kannan, S.; Khovanskaya, R.; Leipe, D.; Mcveigh, R.; O’Neill, K.; Robbertse, B.; Sharma, S.; Soussov, V.; Sullivan, J. P.; Sun, L.; Turner, S.; Karsch-Mizrachi, I. NCBI Taxonomy: A Comprehensive Update on Curation, Resources and Tools. Database 2020, 2020, baaa062. https://doi.org/10.1093/database/baaa062.
C) Miesfeld, R. L.; McEvoy, M. M. Biochemistry, 2nd ed.; W. W. Norton & Company, 2021.
C) Miesfeld, R. L.; McEvoy, M. M. Biochemistry, 2nd ed.; W. W. Norton & Company, 2021.
D) SGNH hydrolase superfamily. InterPro, 2017. https://www.ebi.ac.uk/interpro/entry/InterPro/IPR036514/
D) SGNH hydrolase superfamily. InterPro, 2017. https://www.ebi.ac.uk/interpro/entry/InterPro/IPR036514/
-
E) Molgaard, A.; Kauppinen, S.; Larsen, S. Rhamnogalacturonan acetylesterase elucidates the structure and function of a new family of hydrolases. Struct., 2000, 8(4), 373-383. https://www.sciencedirect.com/science/article/pii/S0969212600001180?via%3Dihub
+
E) Bugnon M, Röhrig UF, GOullieux M, Perez MAS, Daina A, Michielin O, Zoete V. SwissDock 2024: major enhancements for small-molecule docking with Attracting Cavities and AutoDock Vina. Nucleic Acids Res. 2024
-
F) 4Q7Q. Protein Database, 2014. https://www.rcsb.org/structure/4Q7Q
+
F) Eberhardt J, Santos-Martins D, Tillack AF, Forli S.. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model., 2021
-
G) Rio, T. G. D.; et al. Complete genome sequence of Chitinophaga pinensis type strain (UQM 2034). Stand. Genomic. Sci., 2010, 2(1), 87-95. https://pmc.ncbi.nlm.nih.gov/articles/PMC3035255/
+
-
H) Akoh, C. C.; Lee, G.; Liaw, Y.; Huang, T.; Shaw, J. GDSL family of serine esterases/lipases. Prog. Lipid Res., 2004, 43(6), 534-552. https://pubmed.ncbi.nlm.nih.gov/15522763/
+
-
I) 7BXD. Protein Database, 2021. https://www.rcsb.org/structure/7BXD
+
-
J) Madej,T.; Lanczycki, C. J.; Zhang, D.; Thiessen, P. A.; Geer, R. C.; Marchler-Bauer, A.; Bryant, S. H. MMDB and VAST+: tracking structural similarities between macromolecular complexes. Nucleic Acids Res., 2014, 42(Database), D297-303. https://doi.org/10.1093/nar/gkt1208
+
<references/>
<references/>

Current revision

2O14 Structure and Proposed Functionality

(NOTE TO ALL EDITORS: This page is part of a final project for a biochemistry lab at Elizabethtown College. Please do not edit this.)

2O14 is a monomeric protein complex that originates from the bacterial species Bacillus subtilis and has a mass of 41.79 kDa. Using InterPro this protein was assumed to be like Rhamnogalacturan acetylesterase, which falls under the SGNH Hydrolase Superfamily with structural and sequential similarities to lipases, esterases, along with similar functional domains to galactose-binding. With the research performed with current protein databases available, the evidence suggests that this protein removes acetyl groups from the backbone proteins by hydrolyzing the acetyl esters. It accomplishes this most likely through the catalytic triad of serine-histidine-aspartic acid.

3D Representation of 2O14’s structure.

Drag the structure with the mouse to rotate

Contents

Figure 1

Image:Firstswissdock.png

Lowest Calculated affinity image from SwissDock of -6.133 kcal/mol, with the ligand 4-nitrophenyl alpha-glucoside.

Figure 2

Image:OutsideActiveSite.png

The second lowest calculated binding affinity value, -5.629 kcal/mol, predicted bonding site. Believed that the protein’s binding site is in the center between the beta sheet group on top and the alpha helices on the bottom with the ligand 2-acetamido-2-deoxy-beta-D-glucopyranose.

Figure 3

Image:Smallotherprotein.png

Image shows a similar protein, 1PP4, having a different bonding site on the outside of the protein with a binding affinity of -4.429 kcal/mole with the ligand 2-acetamido-2-deoxy-beta-D-glucopyranose. This is interesting due to the similarities of 2O14 and 1PP4, but emphasizes the differences in their active sites

References

A) Mølgaard, A.; Kauppinen, S.; Larsen, S. Rhamnogalacturonan Acetylesterase Elucidates the Structure and Function of a New Family of Hydrolases. Structure 2000, 8 (4), 373–383. https://doi.org/10.1016/S0969-2126(00)00118-0. B) Schoch, C. L.; Ciufo, S.; Domrachev, M.; Hotton, C. L.; Kannan, S.; Khovanskaya, R.; Leipe, D.; Mcveigh, R.; O’Neill, K.; Robbertse, B.; Sharma, S.; Soussov, V.; Sullivan, J. P.; Sun, L.; Turner, S.; Karsch-Mizrachi, I. NCBI Taxonomy: A Comprehensive Update on Curation, Resources and Tools. Database 2020, 2020, baaa062. https://doi.org/10.1093/database/baaa062. C) Miesfeld, R. L.; McEvoy, M. M. Biochemistry, 2nd ed.; W. W. Norton & Company, 2021. D) SGNH hydrolase superfamily. InterPro, 2017. https://www.ebi.ac.uk/interpro/entry/InterPro/IPR036514/ E) Bugnon M, Röhrig UF, GOullieux M, Perez MAS, Daina A, Michielin O, Zoete V. SwissDock 2024: major enhancements for small-molecule docking with Attracting Cavities and AutoDock Vina. Nucleic Acids Res. 2024 F) Eberhardt J, Santos-Martins D, Tillack AF, Forli S.. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model., 2021


Personal tools