Sandbox 326

From Proteopedia

(Difference between revisions)
Jump to: navigation, search
Current revision (16:41, 28 April 2025) (edit) (undo)
 
(11 intermediate revisions not shown.)
Line 2: Line 2:
-
3B7F is a currently unknown protein in terms of its function. Based on current structural analysis, it consists of one unique chain with a mass of 45.04 kDa and an atom count of 3,216. Based on previous studies, 3B7F is assumed to be a glycosyl hydrolase, however, the function is still not entirely known.[1] Through the following procedures and data collection, the goal of this research was to analyze the sequence and structure of 3B7F in order to better understand its enzymatic function.
+
3B7F is a currently unknown protein in terms of its function. Based on current structural analysis, it consists of one unique chain with a mass of 43.34 kDa and an atom count of 3,216. The mass was found by multiply the 394 amino acids of the protein by the mass of each amino acid (~0.110 kDa). Based on previous studies, 3B7F is assumed to be a glycosyl hydrolase, however, the function is still not entirely known.[1] Through the following procedures and data collection, the goal of this research was to analyze the sequence and structure of 3B7F in order to better understand its enzymatic function.
-
<StructureSection load='3B7F' size='400' side='right' caption='Structure of 3B7F' scene=''>
+
'''Research Question:''' What is the function of the 3B7F protein, and how can this be determined through both computational and wet lab techniques?
-
== Methods ==
+
'''Relevance:''' The goal of this research is to determine the function of the 3BF7 protein in order to evaluate whether it can degrade xyloglucan or other types of carbohydrates and glycoconjugates in plants. Knowing this will allow future researchers to be able to better understand xyloglucan/carbohydrate/glycoconjugate degradation in plants, and allow for known pathways to be expanded upon. Through this experimentation, we can also learn more about the ways in which both computational bioinformatics and wet lab techniques can aid in determining the function of a protein with a known structure.
-
SPRITE:
+
'''Hypothesis:''' 3B7F is a xyloglucanase, a type of glycosyl hydrolase that acts to degrade xyloglucan in plant cell walls. It presents optimal activity in fairly acidic conditions and demonstrates potentially satisfactory binding with PNP phosphate and lysine p nitroanilide.
-
Enter the PBD ID for the protein of interest and exclude 2-residue hits. Once the search is complete, browse "List of Hits" results obtained. View the "Full details" result alignments. It is also possible to view the alignment between the protein of interest and the matched protein. View hits by each side of protein by viewing "Arranged by sites" function. Review alignments with an RMSD below 2.0 Angstroms and determine whether results are consistent with established function of the protein. Capture images of alignments that match proteins of interest the best and record RMSD values.
+
<StructureSection load='3B7F' size='400' side='right' caption='Secondary Structure of 3B7F' scene=''>
-
Chimera:
+
== Methods ==
-
Enter PBD ID of protein of interest and click fetch in order to show the structure of the protein. Using the results from SPRITE, load the protein of known function into Chimera using its PBD ID. Hide everything but the subunit of interest, and then align the active site motif (type in "match... followed by any atoms of interest). An RMSD value will be shown, and use that to determine the quality of alignment (anything below 2.0 Angstroms is considered high quality). To better visualize the alignment, delete "match" and replace it with "sel". Make it so that the structures align in a similar way that they did on SPRITE. Name and save the file for future use.
+
'''SPRITE:'''
-
Dali:
+
The PBD ID for the protein of interest (3B7F) was entered and 2-residue hits excluded. Once the search was complete, "List of Hits" results were obtained. "Full details" result alignments also analyzed. Hits by each side of protein viewed by clicking "Arranged by sites" function. Alignments with an RMSD below 2.0 Angstroms were reviewed.
-
Open the website and click on PDB search lab, enter the four-letter PDB of the assigned protein (3B7F). Next, submit the structure with the chain identifier (never use one with DNA, pick one representing the sequence the group is interested in). Enter a meaningful description in the job name field, enter email address, submit the job and wait for a link to be emailed. Once the link is emailed, download the results.
+
'''Chimera:
 +
'''
 +
PBD ID of protein of interest entered and “fetch” was selected in order to show the structure of the protein. Based on the results from SPRITE, the protein of known function was loaded into Chimera using its PBD ID. Everything but the subunit of interest was hidden, and then the active site motif was aligned. The RMSD value was shown and used to determine the quality of alignment (anything below 2.0 Angstroms was considered high quality). To better visualize the alignment, "match" was deleted and replaced with "sel".
-
BLAST:
+
'''Dali:
 +
'''
 +
PDB search tab was selected, and the four-letter PDB of the assigned protein (3B7F) was entered. The structure with the chain identifier was submitted. The job was viewed after completion.
-
Go to RCSB webpage and search for the protein by PDB ID, explore the page for the protein. Select "FASTA sequence: and look at the protein sequence and copy it. Go into the NCBI BLAST search page and paste the protein sequence into the sequence field. Using the sequence that shows up, try and identify proteins similar to the query protein.
+
'''BLAST:
 +
'''
 +
The PDB ID was searched for in the RCSB webpage. "FASTA sequence” was selected and the protein sequence was copied into the NCBI BLAST search page. Proteins similar to the query protein were identified based on sequence.
-
InterPro:
+
'''InterPro:
 +
'''
 +
An InterPro search was performed for the sequence of 3B7F. Protein superfamily identification and domains were reviewed. Related proteins in the domain organization were identified. The structures were also analyzed.
-
Perform an InterPro search for the sequence of 3B7F. Conifer protein superfamily identification and domains found. Identify related proteins in the domain organization. Think about what these related proteins have in common with the protein of interest, and what the function of the domains are. Repeat the InterPro search using "View a Structure" link on the main page. Explore the links that InterPro provides and compare these findings with the seqience search.
+
'''SwissDock:
 +
'''
 +
The "Docking with AutoDock Vina" tab was selected on SwissDock. A ligand by submitted by using SMILES string (found through the PubChem database). "Prepare ligand" was clicked, and a target (can use PDB ID) was submitted. The search space was defined, and x, y, and z coordinates for the center of the space being searched were chosen. The parameters were checked and docking was started. Results were analyzed.
-
Molecular Docking with SwissDock:
+
'''Buffers and Solutions:
 +
'''
 +
General steps for Buffers and solutions included adding 80% of total DiH2O to a container, weighing and adding chemicals, adjusting pH as needed, and adding DiH2O to the total volume.
-
Molecular Docking with SwissDock:
+
'''Hand Casting Polyacrylamide Gels:
 +
'''
 +
Made a 4% stacking gel at a pH of 6.8, and a 10% resolving gel at a pH of 8.8.
-
Select "docking with AutoDock Vina" tab. Submit a ligand by using SMILES string (found through the PubChem database). Click "prepare ligand", and submit a target (can use PDB ID). Define the search space, choose x, y, and z coordinates for the center of the space being searched. Check the parameters and start docking. To view the results: click on "here" link provided in the email. A window with an image of the protein will show up. Below the structure, a table with ranked sets of models and energies will show up. Can overlap multiple ligand poses, and a .zip file can be exported and viewed in Chimera.
+
Resolving and stacking gel solutions were prepared without APS or TEMED. A comb was placed into the assembled gel sandwich with a marker. A mark was placed on the glass plate 1 cm below the teeth of the comb, and the comb was removed. The APS and TEMED were added to the resolving gel, and the solution was poured to the mark. The gel was allowed to polymerize for 45-60 minutes. APS and TEMED were added to the stacking solution and poured above the resolving gel. The comb was placed in the cassette and tilted so that the teeth are at a 10º angle. This prevented air from becoming trapped under the comb. The gel was allowed to polymerize for 30-45 minutes. The gels were wrapped in a wet paper towel in the 4ºC fridge for storage.
-
Making Buffers and Solutions:
+
'''Expression of Proteins from Lactose-Inducible Vectors:
 +
'''
 +
The LB Broth was made by adding 10g of tryptone, 10g of NaCl, and 5g of yeast extract together. This was added to 1000 mL of Millipore water. Five mL of this mixture was poured into an overnight culture tube. The other broth was autoclaved for 1.5 hours.
-
General steps for Buffers and solutions include adding 80% of total DiH2O to a container, weighing and adding chemicals, adjusting pH as needed, and adding DiH2O to the total volume.
+
'''Protein Purification:
 +
'''
 +
500 mL of the protein was grown rather than 1L in an attempt to speed up induction. The overnights were grown at night on 03/16/2025 for 11 hours with 50 µg/mL of kanamycin. It was inoculated at 8 am on 03/17/2025. OD600 nm was taken in the morning until 0.4-0.8 OD (an OD of 0.4 was reached around 10 am). It was then induced with 1 nM IPTG and left to grow for three hours before centrifuging. In order to purify, the samples were centrifuged at 5000 x g for 20 minutes. Ten mL of lysis buffer was added, and the pellets were resuspended with a pipette (50 µL was added to a separate centrifuge tube). The cells were sonicated 5x for 30 seconds on ice in between each sonication (50 µL of a sample was added to a new centrifuge tube). The samples were centrifuged for 20 minutes at 15,000 x g (50 µL of a sample was added to a new centrifuge tube). The protein column was set up with 500 µL Ni-NTA beads and a lysis buffer was ran through it to equilibrate it (5x column volumes). The resin was pre-washed with a binding buffer. The cell extract was applied to the resin and allowed to enter. All of the supernatant was added (50 µL of a sample was added to a new centrifuge tube). The column was washed with 5 column volumes of buffer (50 µL of a sample was added to a new centrifuge tube). The column was eluted with 8 column volumes of buffer, the fractions were collected in 1 mL volumes and stored in 5 tubes. To store the column, 5 column volumes of water, and 1-2 mL of 20% EtOH were added, and the column was capped off and stored.
-
Hand Casting Polyacrylamide Gels:
+
'''Protein Concentration:
 +
'''
 +
Seven BSA standards were prepared using the buffer that the protein was stored in (elution buffer). One mL of Bradford Reagent was added to each cuvette. Twenty µL of water was added to the zero cuvette, while 20 µL of each BSA standard or elution was added to the rest of cuvettes. The cuvettes were covered with parafilm and mixed several times by inversion. They were maintained at room temperature for 5-45 minutes. Absorbance was recorded at 595 nm with a Vernier spectrophotometer. A standard curve was constructed and used to find the concentration of the unknown protein in each elution.
-
Made a 4% stacking gel at a pH of 6.8, and a 10% resolving gel at a pH of 8.8.
+
'''SDS-PAGE:
 +
'''
 +
For sample prep, the protein samples were treated with SDS sample buffer and boiled before application. The final concentration of SDS sample buffer loaded onto the gel was 1x.
-
Prepare the resolving and stacking gel solutions without APS or TEMED. Place a comb into the assembled gel sandwich with a marker, place a mark on the glass plate 1 cm below the teeth of the comb, and remove the comb. Add the APS and TEMED to the resolving gel and pour the solution to the mark. Using a Pasteur pipet overlay the monomer solution with water-saturated n-butanol. Allow the gel to polymerize for 45-60 minutes. Once done, pour off the overlay solution and rinse the top of the gel with diH2O. Dry the area above the separating gel with filter paper before pouring the stacking gel. Place the comb in the cassette and tilt so that the teeth are at a 10º angle. This prevents air from becoming trapped under the comb. Add APS and TEMED to the stacking solution and pour the solution down the spacer. Pour until all of the teeth are covered by the solution. Realign the comb in the sandwich and add monomer to fill the cassette completely. Allow the gel to polymerize for 30-45 minutes. Store the gels wrapped in a wet paper towel in the 4ºC fridge.
+
The set up was loaded with 1x running buffer. The ladder and samples were then loaded into the lanes (20 µL of samples loaded). The protective cover and cables were attached and connected to the power supply. The gel was run at a constant voltage of 120V. The gel was run until the sample line was about 1 cm from the bottom of the gel.
-
Expression of Proteins from Lactose-Inducible Vectors:
+
In order to stain the gel, it was removed from the gel sandwich and gently added to a shallow plastic container. InstantBlue stain was added to cover the gel, and it was agitated overnight. The stain was removed and the destaining solution added. The gel incubated for 30 minutes with a paper towel. It was then rinsed with water and captured on the gel imager.
-
First, make the LB Broth by adding 10g of tryptone, 10g of NaCl, and 5g of yeast extract. Add all of it into 1000 mL of millipore water. Move 5 mL of this mixture into an overnight culture tube. Autoclave it for 1.5 hours.
+
'''Protein Activity Assay:
 +
'''
 +
Three mL of buffer and 3 mg of substrate (PNPP or PNPA) were added to a cuvette. It was then placed in the spectrophotometer, and the instrument was zeroed at 405 nm. The desired amount of the protein of interest (25-75 uL) was added, and the solution was mixed quickly. The absorbance at 405 nm was read over time. Measurements were taken every minute until a change was observed, and then measurements were taken every 30 seconds.
-
Protein Purification:
+
== Structural Alignment Through SPRITE, Chimera, Dali, and BLAST ==
-
500 mL of the protein was grown rather than 1L in an attempt to speed up induction. The overnights were grown at night on 03/16/2025 for 11 hours with 50 µg/mL of kanamycin. It was inoculated at 8 am on 03/17/2025. OD600 nm was taken in the morning until 0.4-0.8 OD (an of of 0.4 was reached around 10 am). It was then induced with 1 nM IPTG, and left to grow for three hours before centrifuging. In order to purify, the samples were centrifuged at 5000 x g for 20 minutes. 10 mL of lysis buffer was added and the pellets were resuspended with a pipette (50 µL was added to a seperate centrifuge tube). The cells were sonicated 5x for 30 seconds on ice in between each sonication (50 µL of a sample was added to a new centrifuge tube). The samples were centrifuged for 20 minutes at 15,000 x g (50 µL of a sample was added to a new centrifuge tube). The protein column was set up with 500 µL Ni-NTA beads and a lysis buffer was ran through it to equilibrate it (5x column volumes). The resin was pre-washed with a binding buffer. The cell extract was applied to the resin and allowed to enter. ALL of the supernatant was added (50 µL of a sample was added to a new centrifuge tube). The column was washed with 5 column volumes of buffer (50 µL of a sample was added to a new centrifuge tube). The column was eluted with 8 column volumes of buffer, the fractions were collected in 1 mL volumes and stored in 5 tubes. To store the column, 5 column volumes of water, and 1-2 mL of 20% EtOH was added and the column was capped off and stored.
+
Based on the findings through SPRITE and Chimera, 1XNY_C00 had the lowest RMSD value at 1.875 angstroms. Therefore, it is hypothesized that 3B7F was a carboxylase.
-
Protein Concentration:
+
[[Image:SPRITE.jpg]] [[Image:Chimera.jpg]]
 +
A 183 GLY matches A 207 GLY/
 +
B 419 GLY matches A 154 GLY/
 +
B 420 ALA matches A 153 ALA
-
Obtain cuvettes for
+
'''Figure 1.''' SPRITE and Chimera binding of 3B7F (yellow) with 1XNY_C00 (green).
-
SDS-PAGE:
+
Dali provided mainly xyloglucanase matches and no matches were carboxylases. Therefore, the initial hypothesis that 3B7F was a carboxylase was no longer supported. Based on the Dali results, it is now hypothesized that 3B7F is a xyloglucanase. [2]
-
Protein Activity Assay:
+
'''Table 1.''' Top Matches on Dali (Z: greater than 4.0 indicates structural significance; RMSD: root mean square deviation (distance in Angstroms between superimposed molecules); LALI: length of alignment; NRES: number of residues in hit structure; %ID: similarity between hit sequence and query sequence).
 +
[[Image:TableDali.jpg]]
-
== Structural Alignment Through SPRITE, Chimera, Dali, and BLAST ==
+
[[Image:BLAST.jpg]]
 +
'''Figure 2.''' Superimposition of 6P2N:A (orange) and 3B7F (green).
-
Based on the findings through SPRITE and Chimera, 1XNY_C00 had the lowest RMSD value at 1.875 angstroms. Therefore, it is hypothesized that 3B7F was a carboxylase.
 
- 
-
Dali gave mainly xyloglucanase matches and no matches were carboxylases, the hypothesis that 3B7F was a carboxylase was proven to be wrong. It is not hypothesized that 3B7F is a xyloglucanase. This was due to the fact that 1XNY did not show up as a match in Dali.
 
-
Xyloglucanases break down xyloglucan, a hemicellulose in plant cell walls. The substrate is xyloglucan and water for the (endo-)beta-1,4-xyloglucanases, and a common cofactor is calcium.
+
Xyloglucanases break down xyloglucan, a hemicellulose in plant cell walls. The substrate is xyloglucan and water for the (endo-)beta-1,4-xyloglucanases, and a common cofactor is calcium. [3]
-
The BLAST search shows glycosyl hydrolases having similar sequences to 3B7F.
+
The BLAST search shows glycosyl hydrolases having similar sequences to 3B7F. The xyloglucanases/glycosyl hydrolases from the BLAST search come from Cupriavidus, a type of gram-negative bacteria. The E-values were 0.0 for most of the results, signifying high matches. The scores were high as well, with percent identification in the 90 percentage range. The query coverage was also 100% for most of the results. Based on the results, the protein family of 3B7F is likely sialidase, which breaks down sialic acid into carbohydrates. Sialidases are a family of proteins that include glycosyl hydrolases and xyloglucanases, which further supports the hypothesis that 3B7F is a xyloglucanase. [4]
== Using InterPro to Predict Protein Function ==
== Using InterPro to Predict Protein Function ==
-
Research shows that 4Q7Q is a member of the SGNH Hydrolase protein super family. BLAST and InterPro both suggested 4Q7Q’s inclusion in this family, and the known conserved residues seen from SPRITE analysis—Serine, Glycine, Asparagine, and Histidine—line up with those observed throughout this family.D,E Notably, this superfamily is also referred to as the GDSL Hydrolase superfamily.D,E
+
The family with the closest similarities to 3B7F is xyloglucanase, which matches the BLAST and Dali findings. These enzymes act on xyloglucan, a plant wall polysaccharide, by breaking the glucosidic bonds of unbranched glucose residues. Most of the proteins in this family bind to cell wall structures or participate in photosystem II, which correlates with the expected activity of 3B7F, since it would be acting on plant cell walls. This family consists mainly of bacteria, as well as eukaryotes and archaea. This also matches the BLAST data because it suggested that the sequence was commonly found in bacteria. The proteomes are a lot of bacteria and marine life. The most closely matched structures are different xyloglucanases and oligoxyloglucan reducing end-specific cellobiohydralases, which is the superfamily that the protein belongs to. The main pathways are also structural degradation pathways, which also aligns with the results we have gained so far. [5]
-
 
+
-
4Q7Q’s inclusion in this family also supports its SPRITE-derived hypothetical functionality. Rhamnogalacturonan Acetylesterase—the enzyme with one of the best SPRITE-based alignment relative to 4Q7Q—is a member of this family.F Proteins in this family are also known for containing a “unique hydrogen bond network that [stabilizes]” the active site.F
+
-
 
+
-
Regarding what protein family 4Q7Q belongs to, DALI results suggest it is a part of a sub-family of the greater GDSL/SGNH superfamily. A PDB90% DALI search labels 4Q7Q as a part of the “Lipolytic Protein G-D-S-L Family,which refers to enzymes that hydrolyze lipid substrates.I
+
== Molecular Docking with SwissDock ==
== Molecular Docking with SwissDock ==
-
The primary sequence of 4Q7Q shows several conserved sequences between it and esterase-like proteins. A sequence of GDSI—similar to the GDSL sequence seen from its family and superfamily—can be seen between 4Q7Q and enzymes like Isoamyl Acetate-Hydrolyzing Esterase. Other noteworthy conserved sequences between esterases and 4Q7Q include GxND and DGxH.
+
Based on the hydrolase ligands that we analyzed, PNP phosphate seems to be the best ligand for 3B7F, with the highest binding affinity of -8.189 kcal/mol, and the lowest binding affinity of -6.146 kcal/mol. This is a very strong binding affinity, so this substrate was used for protein assays. Lysine p nitroanilide also has its highest binding affinity at -7.286 kcal/mol and the lowest binding affinity of -6.551 kcal/mol, indicating that it is another suitable substrate for 3B7F. [6]
-
These enzymes also share similar secondary structures. Segments of alpha-helixes and beta-sheet strands appear and remain nearly entirely conserved throughout esterase analysis. A few conserved coils appear, but these sections do not appear as often as the other two secondary structures.
+
[[Image:SwissDock1.jpg]]
 +
'''Figure 3.''' 3B7F Binding with PNPP at -8.189 kcal/mol.
-
Similar conserved sequences could be found between 4Q7Q and lipases. The GDSI, GxND, and DGxH sequences can be seen from lipases like 7BXD.? The same secondary structure segments can also be located in the lipases analyzed.
+
== Protein Concentration ==
-
== Protein Purification ==
+
Based on the Bradford Assay, elution 4 had the most protein (4.118 mg/mL), while elution 3 had the second most amount of protein (3.261 mg/mL). The rest of the elutions had protein concentrations around 2 mg/mL. The R² value of the standard curve was 0.9857, indicating fairly accurate results.
-
Right-hand SPRITE analysis revealed 4Q7Q exhibited residues like those seen from enzymes operating with acetyl-like substrates. Specifically, residues Ser. 30, Gly. 69, Asn. 97, Asp. 251, and His. 254 on the A and B chains of 4Q7Q line up with similarly positioned residues on esterases like Platelet-Activating Factor Acetylhyd (PAFA), which exhibited an RMSD of 0.25 angstroms when compared to 4Q7Q.
+
[[Image:StandardCurve.jpg]]
 +
'''Figure 4.''' Standard Curve at 595 nm for 3B7F Elutions.
-
Other proteins with similar motifs of note are Thioesterase I and Rhamnogalacturonan Acetylesterase, with RMSD values of 0.46 and 0.61, respectively. These alignments focus on the same active site as PAFA did, suggesting the acetyl-like substrates 4Q7Q focuses on are similar to esters.
+
'''Table 2.''' Final Protein Concentration of Each Elution
 +
[[Image:FinalConc.jpg]]
-
PFAM graphics from DALI revealed significant structural equivalence between 4Q7Q, a lipase-like protein, Rhamnogalacturonan Acetylesterase, and Sialate O-acetylesterase.
+
== Protein Analysis ==
-
== Protein Concentration ==
+
Possibly due to errors during purification, no bands were seen on the SDS-PAGE gel for 3B7F.
-
SwissDock analysis showed a preference for larger molecules, specifically fatty acids. Lactide, Ethyl Butyrate, and Triethylene Glycol exhibited noticeably weak binding affinities to the theorized active site of 4Q7Q. These ligands may be ill-suited to act as substrates for 4Q7Q as they are remarkably polar, and lipids—one of the potential categories of substrates for 4Q7Q—are mostly non-polar.
+
Several conditions were analyzed through enzyme activity assays. Initially, 50 uL of 3B7F elution 4 with 1 mg/mL PNPP demonstrated some linearity between 0-600 seconds. However, this was not able to be replicated. Both 25 uL and 75 uL of 3B7F elution 4 were also used with PNPP, and no linearity was found. Due to the enzyme’s optimal conditions, it was also cooled down in -20°C for five minutes, however, no linearity was seen. The assay was also perfomed with elution 1 to identify if any protein was present after the wash. Again, no linearity was shown. PNPA was also used as a substrate with elution 4, and pHs of 1.5 and 5.0 were also used during separate trials in order to identify the optimal conditions of the enzyme. [7] However, these assays still showed no linearity. It was determined that there was an error during purification, leading to unidentifiable concentrations of 3B7F for enzymatic study.
-
Despite this, these ligands show noticeable hydrophobic interactions with the active site. This implies 4Q7Q uses hydrophobic regions to help guide substrates into the right orientation for enzymatic processes. This also further supports the possibility that 4Q7Q primarily operates with hydrophobic lipid-based substrates. This also explains why Methyl Acetate exhibited a relatively weaker affinity for 4Q7Q, as its smaller structure prevented hydrophobic interactions.
+
[[Image:Assay.jpg]]
 +
'''Figure 5.''' 50 uL of 3B7F Elution 4 with 1 mg/mL PNPP.
-
== Protein Analysis ==
+
[[Image:Assay2.jpg]]
 +
Slope = 7 x 10⁻⁵ Abs/sec
 +
C = 7 x 10⁻⁵/18000*1
 +
= 3.89 x 10⁻⁹ M/sec
-
Highlight the data that helped you come to your conclusion here including any relevant figures. Make sure include potential substrates and binding sites.
+
'''Figure 6.''' Zoom in with linear trendline of Figure 5 and enzyme activity calculations.
 +
 
 +
== Conclusion ==
 +
Based on the results, no enzymatic function of 3B7F can be appropriately determined. However, based on the computational results, it is likely that 3B7F has xyloglucanase activity. In the future, the protein should be grown again in order to retest the protein’s enzymatic activity after a proper purification. The results that were successfully gathered did show accuracy and precision. However, likely due to incorrect solution making, the protein was not able to be properly purified, and therefore, was not present in the elutions. Interestingly enough, the Bradford Assay did show protein concentrations, but they likely were not for the desired protein (3B7F), which may be the reason for no protein present at the expected mass of approximately 43 kDa. If 3B7F is confirmed as a xyloglucanase, it can possibly be used for drug targeting in the colon through xyloglucan coatings. [8]
</StructureSection>
</StructureSection>
Line 108: Line 140:
== References ==
== References ==
-
A) 1WAB. Protein Database, 1997. https://www.rcsb.org/structure/1WAB
+
[1] RCSB PDB - 3B7F: Crystal structure of a putative glycosyl hydrolase with bnr repeats (reut_b4987) from ralstonia eutropha jmp134 at 2.20 A resolution. Rcsb.org. https://www.rcsb.org/structure/3b7f (accessed 2025-03-09).
-
B) Ho, Y. S.; Sewnson, L.; Derewenda, U.; Serre, L.; Wei, Y.; Dauter, Z.; Hattori, M.; Adachi, T.; Aoki, J.; Arai, H.; Inoue, K.; Derewenda, Z. S. Brain acetylhydrolase that inactivates platelet-activating factor is a G-protein-like trimer. Nature, 1997, 385, 89-93. https://www.nature.com/articles/385089a0 https://www.nature.com/articles/385089a0
+
 
-
C) Miesfeld, R. L.; McEvoy, M. M. Biochemistry, 2nd ed.; W. W. Norton & Company, 2021.
+
[2] Dali server. ekhidna2.biocenter.helsinki.fi. http://ekhidna2.biocenter.helsinki.fi/dali/.
-
D) SGNH hydrolase superfamily. InterPro, 2017. https://www.ebi.ac.uk/interpro/entry/InterPro/IPR036514/
+
 
-
E) Molgaard, A.; Kauppinen, S.; Larsen, S. Rhamnogalacturonan acetylesterase elucidates the structure and function of a new family of hydrolases. Struct., 2000, 8(4), 373-383. https://www.sciencedirect.com/science/article/pii/S0969212600001180?via%3Dihub
+
[3] Glycoside hydrolases - CAZypedia. www.cazypedia.org. https://www.cazypedia.org/index.php/Glycoside_hydrolases.
-
F) 4Q7Q. Protein Database, 2014. https://www.rcsb.org/structure/4Q7Q
+
 
-
G) Rio, T. G. D.; et al. Complete genome sequence of Chitinophaga pinensis type strain (UQM 2034). Stand. Genomic. Sci., 2010, 2(1), 87-95. https://pmc.ncbi.nlm.nih.gov/articles/PMC3035255/
+
[4] NCBI. BLAST: Basic Local Alignment Search Tool. Nih.gov. https://blast.ncbi.nlm.nih.gov/Blast.cgi.
-
H) Akoh, C. C.; Lee, G.; Liaw, Y.; Huang, T.; Shaw, J. GDSL family of serine esterases/lipases. Prog. Lipid Res., 2004, 43(6), 534-552. https://pubmed.ncbi.nlm.nih.gov/15522763/
+
 
-
I) 7BXD. Protein Database, 2021. https://www.rcsb.org/structure/7BXD
+
[5] InterPro EMBL-EBI. InterPro protein sequence analysis & classification < InterPro < EMBL-EBI. Ebi.ac.uk. https://www.ebi.ac.uk/interpro/.
-
J) Madej,T.; Lanczycki, C. J.; Zhang, D.; Thiessen, P. A.; Geer, R. C.; Marchler-Bauer, A.; Bryant, S. H. MMDB and VAST+: tracking structural similarities between macromolecular complexes. Nucleic Acids Res., 2014, 42(Database), D297-303. https://doi.org/10.1093/nar/gkt1208
+
 
 +
[6] SwissDock. www.swissdock.ch. https://www.swissdock.ch/.
 +
 
 +
[7] Shi, H.; Guo, J.; Yan, X.; Cui, G.; Tan, Z.; Zhu, X.; Zhou, J.; He, S.; Wang, T.; Li, X. Characterization of a Xyloglucananse in Biodegradation of Woody Plant Xyloglucan from Caldicellulosiruptor Kronotskyensis. BioResources 2021, 17 (1), 673–681. https://doi.org/10.15376/biores.17.1.673-681.
 +
 
 +
[8] Doggwiler, V.; Lanz, M.; Paredes, V.; Lipps, G.; Imanidis, G. Tablet Formulation with Dual Control Concept for Efficient Colonic Drug Delivery. International Journal of Pharmaceutics 2023, 631, 122499. https://doi.org/10.1016/j.ijpharm.2022.122499.
<references/>
<references/>

Current revision

Characterization and Preliminary Functionality of 3B7F


3B7F is a currently unknown protein in terms of its function. Based on current structural analysis, it consists of one unique chain with a mass of 43.34 kDa and an atom count of 3,216. The mass was found by multiply the 394 amino acids of the protein by the mass of each amino acid (~0.110 kDa). Based on previous studies, 3B7F is assumed to be a glycosyl hydrolase, however, the function is still not entirely known.[1] Through the following procedures and data collection, the goal of this research was to analyze the sequence and structure of 3B7F in order to better understand its enzymatic function.

Research Question: What is the function of the 3B7F protein, and how can this be determined through both computational and wet lab techniques?

Relevance: The goal of this research is to determine the function of the 3BF7 protein in order to evaluate whether it can degrade xyloglucan or other types of carbohydrates and glycoconjugates in plants. Knowing this will allow future researchers to be able to better understand xyloglucan/carbohydrate/glycoconjugate degradation in plants, and allow for known pathways to be expanded upon. Through this experimentation, we can also learn more about the ways in which both computational bioinformatics and wet lab techniques can aid in determining the function of a protein with a known structure.

Hypothesis: 3B7F is a xyloglucanase, a type of glycosyl hydrolase that acts to degrade xyloglucan in plant cell walls. It presents optimal activity in fairly acidic conditions and demonstrates potentially satisfactory binding with PNP phosphate and lysine p nitroanilide.

Secondary Structure of 3B7F

Drag the structure with the mouse to rotate

References

[1] RCSB PDB - 3B7F: Crystal structure of a putative glycosyl hydrolase with bnr repeats (reut_b4987) from ralstonia eutropha jmp134 at 2.20 A resolution. Rcsb.org. https://www.rcsb.org/structure/3b7f (accessed 2025-03-09).

[2] Dali server. ekhidna2.biocenter.helsinki.fi. http://ekhidna2.biocenter.helsinki.fi/dali/.

[3] Glycoside hydrolases - CAZypedia. www.cazypedia.org. https://www.cazypedia.org/index.php/Glycoside_hydrolases.

[4] NCBI. BLAST: Basic Local Alignment Search Tool. Nih.gov. https://blast.ncbi.nlm.nih.gov/Blast.cgi.

[5] InterPro EMBL-EBI. InterPro protein sequence analysis & classification < InterPro < EMBL-EBI. Ebi.ac.uk. https://www.ebi.ac.uk/interpro/.

[6] SwissDock. www.swissdock.ch. https://www.swissdock.ch/.

[7] Shi, H.; Guo, J.; Yan, X.; Cui, G.; Tan, Z.; Zhu, X.; Zhou, J.; He, S.; Wang, T.; Li, X. Characterization of a Xyloglucananse in Biodegradation of Woody Plant Xyloglucan from Caldicellulosiruptor Kronotskyensis. BioResources 2021, 17 (1), 673–681. https://doi.org/10.15376/biores.17.1.673-681.

[8] Doggwiler, V.; Lanz, M.; Paredes, V.; Lipps, G.; Imanidis, G. Tablet Formulation with Dual Control Concept for Efficient Colonic Drug Delivery. International Journal of Pharmaceutics 2023, 631, 122499. https://doi.org/10.1016/j.ijpharm.2022.122499.


Personal tools