Sandbox 326

From Proteopedia

(Difference between revisions)
Jump to: navigation, search
Current revision (16:41, 28 April 2025) (edit) (undo)
 
(10 intermediate revisions not shown.)
Line 2: Line 2:
-
3B7F is a currently unknown protein in terms of its function. Based on current structural analysis, it consists of one unique chain with a mass of 45.04 kDa and an atom count of 3,216. Based on previous studies, 3B7F is assumed to be a glycosyl hydrolase, however, the function is still not entirely known.[1] Through the following procedures and data collection, the goal of this research was to analyze the sequence and structure of 3B7F in order to better understand its enzymatic function.
+
3B7F is a currently unknown protein in terms of its function. Based on current structural analysis, it consists of one unique chain with a mass of 43.34 kDa and an atom count of 3,216. The mass was found by multiply the 394 amino acids of the protein by the mass of each amino acid (~0.110 kDa). Based on previous studies, 3B7F is assumed to be a glycosyl hydrolase, however, the function is still not entirely known.[1] Through the following procedures and data collection, the goal of this research was to analyze the sequence and structure of 3B7F in order to better understand its enzymatic function.
'''Research Question:''' What is the function of the 3B7F protein, and how can this be determined through both computational and wet lab techniques?
'''Research Question:''' What is the function of the 3B7F protein, and how can this be determined through both computational and wet lab techniques?
-
'''Relevance:''' The goal of this research is to determine the function of the 3BF7 protein in order to evaluate whether it can degrade xyloglucan or other types of carbohydrates and glycoconjugates in plants. Knowing this will allow future researchers to be able to better understand xyloglucan/carbohydrate glycoconjugate degradation in plants, and allow for known pathways to be expanded upon. Through this experimentation, we can also learn more about the ways in which both computational bioinformatics and wet lab techniques can aid in determining the function of a protein with a known structure.
+
'''Relevance:''' The goal of this research is to determine the function of the 3BF7 protein in order to evaluate whether it can degrade xyloglucan or other types of carbohydrates and glycoconjugates in plants. Knowing this will allow future researchers to be able to better understand xyloglucan/carbohydrate/glycoconjugate degradation in plants, and allow for known pathways to be expanded upon. Through this experimentation, we can also learn more about the ways in which both computational bioinformatics and wet lab techniques can aid in determining the function of a protein with a known structure.
'''Hypothesis:''' 3B7F is a xyloglucanase, a type of glycosyl hydrolase that acts to degrade xyloglucan in plant cell walls. It presents optimal activity in fairly acidic conditions and demonstrates potentially satisfactory binding with PNP phosphate and lysine p nitroanilide.
'''Hypothesis:''' 3B7F is a xyloglucanase, a type of glycosyl hydrolase that acts to degrade xyloglucan in plant cell walls. It presents optimal activity in fairly acidic conditions and demonstrates potentially satisfactory binding with PNP phosphate and lysine p nitroanilide.
-
<StructureSection load='3B7F' size='400' side='right' caption='Structure of 3B7F' scene=''>
+
<StructureSection load='3B7F' size='400' side='right' caption='Secondary Structure of 3B7F' scene=''>
== Methods ==
== Methods ==
-
'''SPRITE:
+
'''SPRITE:'''
-
'''
+
 
The PBD ID for the protein of interest (3B7F) was entered and 2-residue hits excluded. Once the search was complete, "List of Hits" results were obtained. "Full details" result alignments also analyzed. Hits by each side of protein viewed by clicking "Arranged by sites" function. Alignments with an RMSD below 2.0 Angstroms were reviewed.
The PBD ID for the protein of interest (3B7F) was entered and 2-residue hits excluded. Once the search was complete, "List of Hits" results were obtained. "Full details" result alignments also analyzed. Hits by each side of protein viewed by clicking "Arranged by sites" function. Alignments with an RMSD below 2.0 Angstroms were reviewed.
Line 75: Line 75:
Based on the findings through SPRITE and Chimera, 1XNY_C00 had the lowest RMSD value at 1.875 angstroms. Therefore, it is hypothesized that 3B7F was a carboxylase.
Based on the findings through SPRITE and Chimera, 1XNY_C00 had the lowest RMSD value at 1.875 angstroms. Therefore, it is hypothesized that 3B7F was a carboxylase.
-
A yellow and green objects
 
-
AI-generated content may be incorrect
+
[[Image:SPRITE.jpg]] [[Image:Chimera.jpg]]
-
Dali gave mainly xyloglucanase matches and no matches were carboxylases, the hypothesis that 3B7F was a carboxylase was proven to be wrong. It is not hypothesized that 3B7F is a xyloglucanase. This was due to the fact that 1XNY did not show up as a match in Dali.
+
A 183 GLY matches A 207 GLY/
 +
B 419 GLY matches A 154 GLY/
 +
B 420 ALA matches A 153 ALA
-
Xyloglucanases break down xyloglucan, a hemicellulose in plant cell walls. The substrate is xyloglucan and water for the (endo-)beta-1,4-xyloglucanases, and a common cofactor is calcium.
+
'''Figure 1.''' SPRITE and Chimera binding of 3B7F (yellow) with 1XNY_C00 (green).
-
The BLAST search shows glycosyl hydrolases having similar sequences to 3B7F.
+
Dali provided mainly xyloglucanase matches and no matches were carboxylases. Therefore, the initial hypothesis that 3B7F was a carboxylase was no longer supported. Based on the Dali results, it is now hypothesized that 3B7F is a xyloglucanase. [2]
-
== Using InterPro to Predict Protein Function ==
+
'''Table 1.''' Top Matches on Dali (Z: greater than 4.0 indicates structural significance; RMSD: root mean square deviation (distance in Angstroms between superimposed molecules); LALI: length of alignment; NRES: number of residues in hit structure; %ID: similarity between hit sequence and query sequence).
 +
[[Image:TableDali.jpg]]
-
Research shows that 4Q7Q is a member of the SGNH Hydrolase protein super family. BLAST and InterPro both suggested 4Q7Q’s inclusion in this family, and the known conserved residues seen from SPRITE analysis—Serine, Glycine, Asparagine, and Histidine—line up with those observed throughout this family.D,E Notably, this superfamily is also referred to as the GDSL Hydrolase superfamily.D,E
+
[[Image:BLAST.jpg]]
 +
'''Figure 2.''' Superimposition of 6P2N:A (orange) and 3B7F (green).
-
4Q7Q’s inclusion in this family also supports its SPRITE-derived hypothetical functionality. Rhamnogalacturonan Acetylesterase—the enzyme with one of the best SPRITE-based alignment relative to 4Q7Q—is a member of this family.F Proteins in this family are also known for containing a “unique hydrogen bond network that [stabilizes]” the active site.F
 
-
Regarding what protein family 4Q7Q belongs to, DALI results suggest it is a part of a sub-family of the greater GDSL/SGNH superfamily. A PDB90% DALI search labels 4Q7Q as a part of the “Lipolytic Protein G-D-S-L Family,which refers to enzymes that hydrolyze lipid substrates.I
+
Xyloglucanases break down xyloglucan, a hemicellulose in plant cell walls. The substrate is xyloglucan and water for the (endo-)beta-1,4-xyloglucanases, and a common cofactor is calcium. [3]
 +
 
 +
The BLAST search shows glycosyl hydrolases having similar sequences to 3B7F. The xyloglucanases/glycosyl hydrolases from the BLAST search come from Cupriavidus, a type of gram-negative bacteria. The E-values were 0.0 for most of the results, signifying high matches. The scores were high as well, with percent identification in the 90 percentage range. The query coverage was also 100% for most of the results. Based on the results, the protein family of 3B7F is likely sialidase, which breaks down sialic acid into carbohydrates. Sialidases are a family of proteins that include glycosyl hydrolases and xyloglucanases, which further supports the hypothesis that 3B7F is a xyloglucanase. [4]
 +
 
 +
== Using InterPro to Predict Protein Function ==
 +
 
 +
The family with the closest similarities to 3B7F is xyloglucanase, which matches the BLAST and Dali findings. These enzymes act on xyloglucan, a plant wall polysaccharide, by breaking the glucosidic bonds of unbranched glucose residues. Most of the proteins in this family bind to cell wall structures or participate in photosystem II, which correlates with the expected activity of 3B7F, since it would be acting on plant cell walls. This family consists mainly of bacteria, as well as eukaryotes and archaea. This also matches the BLAST data because it suggested that the sequence was commonly found in bacteria. The proteomes are a lot of bacteria and marine life. The most closely matched structures are different xyloglucanases and oligoxyloglucan reducing end-specific cellobiohydralases, which is the superfamily that the protein belongs to. The main pathways are also structural degradation pathways, which also aligns with the results we have gained so far. [5]
== Molecular Docking with SwissDock ==
== Molecular Docking with SwissDock ==
-
The primary sequence of 4Q7Q shows several conserved sequences between it and esterase-like proteins. A sequence of GDSI—similar to the GDSL sequence seen from its family and superfamily—can be seen between 4Q7Q and enzymes like Isoamyl Acetate-Hydrolyzing Esterase. Other noteworthy conserved sequences between esterases and 4Q7Q include GxND and DGxH.
+
Based on the hydrolase ligands that we analyzed, PNP phosphate seems to be the best ligand for 3B7F, with the highest binding affinity of -8.189 kcal/mol, and the lowest binding affinity of -6.146 kcal/mol. This is a very strong binding affinity, so this substrate was used for protein assays. Lysine p nitroanilide also has its highest binding affinity at -7.286 kcal/mol and the lowest binding affinity of -6.551 kcal/mol, indicating that it is another suitable substrate for 3B7F. [6]
-
These enzymes also share similar secondary structures. Segments of alpha-helixes and beta-sheet strands appear and remain nearly entirely conserved throughout esterase analysis. A few conserved coils appear, but these sections do not appear as often as the other two secondary structures.
+
[[Image:SwissDock1.jpg]]
 +
'''Figure 3.''' 3B7F Binding with PNPP at -8.189 kcal/mol.
-
Similar conserved sequences could be found between 4Q7Q and lipases. The GDSI, GxND, and DGxH sequences can be seen from lipases like 7BXD.? The same secondary structure segments can also be located in the lipases analyzed.
+
== Protein Concentration ==
-
== Protein Purification ==
+
Based on the Bradford Assay, elution 4 had the most protein (4.118 mg/mL), while elution 3 had the second most amount of protein (3.261 mg/mL). The rest of the elutions had protein concentrations around 2 mg/mL. The R² value of the standard curve was 0.9857, indicating fairly accurate results.
-
Right-hand SPRITE analysis revealed 4Q7Q exhibited residues like those seen from enzymes operating with acetyl-like substrates. Specifically, residues Ser. 30, Gly. 69, Asn. 97, Asp. 251, and His. 254 on the A and B chains of 4Q7Q line up with similarly positioned residues on esterases like Platelet-Activating Factor Acetylhyd (PAFA), which exhibited an RMSD of 0.25 angstroms when compared to 4Q7Q.
+
[[Image:StandardCurve.jpg]]
 +
'''Figure 4.''' Standard Curve at 595 nm for 3B7F Elutions.
-
Other proteins with similar motifs of note are Thioesterase I and Rhamnogalacturonan Acetylesterase, with RMSD values of 0.46 and 0.61, respectively. These alignments focus on the same active site as PAFA did, suggesting the acetyl-like substrates 4Q7Q focuses on are similar to esters.
+
'''Table 2.''' Final Protein Concentration of Each Elution
 +
[[Image:FinalConc.jpg]]
-
PFAM graphics from DALI revealed significant structural equivalence between 4Q7Q, a lipase-like protein, Rhamnogalacturonan Acetylesterase, and Sialate O-acetylesterase.
+
== Protein Analysis ==
-
== Protein Concentration ==
+
Possibly due to errors during purification, no bands were seen on the SDS-PAGE gel for 3B7F.
-
SwissDock analysis showed a preference for larger molecules, specifically fatty acids. Lactide, Ethyl Butyrate, and Triethylene Glycol exhibited noticeably weak binding affinities to the theorized active site of 4Q7Q. These ligands may be ill-suited to act as substrates for 4Q7Q as they are remarkably polar, and lipids—one of the potential categories of substrates for 4Q7Q—are mostly non-polar.
+
Several conditions were analyzed through enzyme activity assays. Initially, 50 uL of 3B7F elution 4 with 1 mg/mL PNPP demonstrated some linearity between 0-600 seconds. However, this was not able to be replicated. Both 25 uL and 75 uL of 3B7F elution 4 were also used with PNPP, and no linearity was found. Due to the enzyme’s optimal conditions, it was also cooled down in -20°C for five minutes, however, no linearity was seen. The assay was also perfomed with elution 1 to identify if any protein was present after the wash. Again, no linearity was shown. PNPA was also used as a substrate with elution 4, and pHs of 1.5 and 5.0 were also used during separate trials in order to identify the optimal conditions of the enzyme. [7] However, these assays still showed no linearity. It was determined that there was an error during purification, leading to unidentifiable concentrations of 3B7F for enzymatic study.
-
Despite this, these ligands show noticeable hydrophobic interactions with the active site. This implies 4Q7Q uses hydrophobic regions to help guide substrates into the right orientation for enzymatic processes. This also further supports the possibility that 4Q7Q primarily operates with hydrophobic lipid-based substrates. This also explains why Methyl Acetate exhibited a relatively weaker affinity for 4Q7Q, as its smaller structure prevented hydrophobic interactions.
+
[[Image:Assay.jpg]]
 +
'''Figure 5.''' 50 uL of 3B7F Elution 4 with 1 mg/mL PNPP.
-
== Protein Analysis ==
+
[[Image:Assay2.jpg]]
 +
Slope = 7 x 10⁻⁵ Abs/sec
 +
C = 7 x 10⁻⁵/18000*1
 +
= 3.89 x 10⁻⁹ M/sec
-
Highlight the data that helped you come to your conclusion here including any relevant figures. Make sure include potential substrates and binding sites.
+
'''Figure 6.''' Zoom in with linear trendline of Figure 5 and enzyme activity calculations.
 +
 
 +
== Conclusion ==
 +
Based on the results, no enzymatic function of 3B7F can be appropriately determined. However, based on the computational results, it is likely that 3B7F has xyloglucanase activity. In the future, the protein should be grown again in order to retest the protein’s enzymatic activity after a proper purification. The results that were successfully gathered did show accuracy and precision. However, likely due to incorrect solution making, the protein was not able to be properly purified, and therefore, was not present in the elutions. Interestingly enough, the Bradford Assay did show protein concentrations, but they likely were not for the desired protein (3B7F), which may be the reason for no protein present at the expected mass of approximately 43 kDa. If 3B7F is confirmed as a xyloglucanase, it can possibly be used for drug targeting in the colon through xyloglucan coatings. [8]
</StructureSection>
</StructureSection>
Line 122: Line 140:
== References ==
== References ==
-
A) 1WAB. Protein Database, 1997. https://www.rcsb.org/structure/1WAB
+
[1] RCSB PDB - 3B7F: Crystal structure of a putative glycosyl hydrolase with bnr repeats (reut_b4987) from ralstonia eutropha jmp134 at 2.20 A resolution. Rcsb.org. https://www.rcsb.org/structure/3b7f (accessed 2025-03-09).
-
B) Ho, Y. S.; Sewnson, L.; Derewenda, U.; Serre, L.; Wei, Y.; Dauter, Z.; Hattori, M.; Adachi, T.; Aoki, J.; Arai, H.; Inoue, K.; Derewenda, Z. S. Brain acetylhydrolase that inactivates platelet-activating factor is a G-protein-like trimer. Nature, 1997, 385, 89-93. https://www.nature.com/articles/385089a0 https://www.nature.com/articles/385089a0
+
 
-
C) Miesfeld, R. L.; McEvoy, M. M. Biochemistry, 2nd ed.; W. W. Norton & Company, 2021.
+
[2] Dali server. ekhidna2.biocenter.helsinki.fi. http://ekhidna2.biocenter.helsinki.fi/dali/.
-
D) SGNH hydrolase superfamily. InterPro, 2017. https://www.ebi.ac.uk/interpro/entry/InterPro/IPR036514/
+
 
-
E) Molgaard, A.; Kauppinen, S.; Larsen, S. Rhamnogalacturonan acetylesterase elucidates the structure and function of a new family of hydrolases. Struct., 2000, 8(4), 373-383. https://www.sciencedirect.com/science/article/pii/S0969212600001180?via%3Dihub
+
[3] Glycoside hydrolases - CAZypedia. www.cazypedia.org. https://www.cazypedia.org/index.php/Glycoside_hydrolases.
-
F) 4Q7Q. Protein Database, 2014. https://www.rcsb.org/structure/4Q7Q
+
 
-
G) Rio, T. G. D.; et al. Complete genome sequence of Chitinophaga pinensis type strain (UQM 2034). Stand. Genomic. Sci., 2010, 2(1), 87-95. https://pmc.ncbi.nlm.nih.gov/articles/PMC3035255/
+
[4] NCBI. BLAST: Basic Local Alignment Search Tool. Nih.gov. https://blast.ncbi.nlm.nih.gov/Blast.cgi.
-
H) Akoh, C. C.; Lee, G.; Liaw, Y.; Huang, T.; Shaw, J. GDSL family of serine esterases/lipases. Prog. Lipid Res., 2004, 43(6), 534-552. https://pubmed.ncbi.nlm.nih.gov/15522763/
+
 
-
I) 7BXD. Protein Database, 2021. https://www.rcsb.org/structure/7BXD
+
[5] InterPro EMBL-EBI. InterPro protein sequence analysis & classification < InterPro < EMBL-EBI. Ebi.ac.uk. https://www.ebi.ac.uk/interpro/.
-
J) Madej,T.; Lanczycki, C. J.; Zhang, D.; Thiessen, P. A.; Geer, R. C.; Marchler-Bauer, A.; Bryant, S. H. MMDB and VAST+: tracking structural similarities between macromolecular complexes. Nucleic Acids Res., 2014, 42(Database), D297-303. https://doi.org/10.1093/nar/gkt1208
+
 
 +
[6] SwissDock. www.swissdock.ch. https://www.swissdock.ch/.
 +
 
 +
[7] Shi, H.; Guo, J.; Yan, X.; Cui, G.; Tan, Z.; Zhu, X.; Zhou, J.; He, S.; Wang, T.; Li, X. Characterization of a Xyloglucananse in Biodegradation of Woody Plant Xyloglucan from Caldicellulosiruptor Kronotskyensis. BioResources 2021, 17 (1), 673–681. https://doi.org/10.15376/biores.17.1.673-681.
 +
 
 +
[8] Doggwiler, V.; Lanz, M.; Paredes, V.; Lipps, G.; Imanidis, G. Tablet Formulation with Dual Control Concept for Efficient Colonic Drug Delivery. International Journal of Pharmaceutics 2023, 631, 122499. https://doi.org/10.1016/j.ijpharm.2022.122499.
<references/>
<references/>

Current revision

Characterization and Preliminary Functionality of 3B7F


3B7F is a currently unknown protein in terms of its function. Based on current structural analysis, it consists of one unique chain with a mass of 43.34 kDa and an atom count of 3,216. The mass was found by multiply the 394 amino acids of the protein by the mass of each amino acid (~0.110 kDa). Based on previous studies, 3B7F is assumed to be a glycosyl hydrolase, however, the function is still not entirely known.[1] Through the following procedures and data collection, the goal of this research was to analyze the sequence and structure of 3B7F in order to better understand its enzymatic function.

Research Question: What is the function of the 3B7F protein, and how can this be determined through both computational and wet lab techniques?

Relevance: The goal of this research is to determine the function of the 3BF7 protein in order to evaluate whether it can degrade xyloglucan or other types of carbohydrates and glycoconjugates in plants. Knowing this will allow future researchers to be able to better understand xyloglucan/carbohydrate/glycoconjugate degradation in plants, and allow for known pathways to be expanded upon. Through this experimentation, we can also learn more about the ways in which both computational bioinformatics and wet lab techniques can aid in determining the function of a protein with a known structure.

Hypothesis: 3B7F is a xyloglucanase, a type of glycosyl hydrolase that acts to degrade xyloglucan in plant cell walls. It presents optimal activity in fairly acidic conditions and demonstrates potentially satisfactory binding with PNP phosphate and lysine p nitroanilide.

Secondary Structure of 3B7F

Drag the structure with the mouse to rotate

References

[1] RCSB PDB - 3B7F: Crystal structure of a putative glycosyl hydrolase with bnr repeats (reut_b4987) from ralstonia eutropha jmp134 at 2.20 A resolution. Rcsb.org. https://www.rcsb.org/structure/3b7f (accessed 2025-03-09).

[2] Dali server. ekhidna2.biocenter.helsinki.fi. http://ekhidna2.biocenter.helsinki.fi/dali/.

[3] Glycoside hydrolases - CAZypedia. www.cazypedia.org. https://www.cazypedia.org/index.php/Glycoside_hydrolases.

[4] NCBI. BLAST: Basic Local Alignment Search Tool. Nih.gov. https://blast.ncbi.nlm.nih.gov/Blast.cgi.

[5] InterPro EMBL-EBI. InterPro protein sequence analysis & classification < InterPro < EMBL-EBI. Ebi.ac.uk. https://www.ebi.ac.uk/interpro/.

[6] SwissDock. www.swissdock.ch. https://www.swissdock.ch/.

[7] Shi, H.; Guo, J.; Yan, X.; Cui, G.; Tan, Z.; Zhu, X.; Zhou, J.; He, S.; Wang, T.; Li, X. Characterization of a Xyloglucananse in Biodegradation of Woody Plant Xyloglucan from Caldicellulosiruptor Kronotskyensis. BioResources 2021, 17 (1), 673–681. https://doi.org/10.15376/biores.17.1.673-681.

[8] Doggwiler, V.; Lanz, M.; Paredes, V.; Lipps, G.; Imanidis, G. Tablet Formulation with Dual Control Concept for Efficient Colonic Drug Delivery. International Journal of Pharmaceutics 2023, 631, 122499. https://doi.org/10.1016/j.ijpharm.2022.122499.


Personal tools