User:Neel Bhagat/Sandbox 1
From Proteopedia
Line 2: | Line 2: | ||
=== Overview === | === Overview === | ||
- | The SRp20 protein is an alternative splicing factor found in homo sapiens as well as many other [https://en.wikipedia.org/wiki/Eukaryote eukaryotes]. It is a relatively small protein with a length of 164 amino acids and a weight of about 19kDa. In fact, it is the smallest member of the SR protein family. The protein contains two domains: a serine-arginine rich (SR) domain and a RNA-recognition domain (RRM)<ref>https://doi.org/10.1016/j.bbrc.2013.05.027</ref>. | + | The SRp20 protein is an alternative splicing factor found in homo sapiens as well as many other [https://en.wikipedia.org/wiki/Eukaryote eukaryotes]. It is a relatively small protein with a length of 164 amino acids and a weight of about 19kDa. In fact, it is the smallest member of the SR protein family. The protein contains two domains: a serine-arginine rich (SR) domain and a RNA-recognition domain (RRM)<ref>[https://doi.org/10.1016/j.bbrc.2013.05.027]</ref>. |
=== History === | === History === | ||
Splicing is one step in the process of RNA maturation that cuts out introns and joins exons together. Both the spliceosome, a complex of snRNAs (U1, U2, etc.), and splicing factors like SRp20 interact with intron consensus sequences in the pre-mRNA to regulate this process. [https://en.wikipedia.org/wiki/Alternative_splicing Alternative splicing] allows one mRNA molecule to produce numerous proteins that perform different functions in a cell by inclusion and exclusion of RNA sequences. There are two main families of splicing factors: Serine-Arginine rich (SR) proteins and heterogeneous nuclear RiboNucleoProteins ([https://en.wikipedia.org/wiki/Heterogeneous_ribonucleoprotein_particle hnRNPs]). | Splicing is one step in the process of RNA maturation that cuts out introns and joins exons together. Both the spliceosome, a complex of snRNAs (U1, U2, etc.), and splicing factors like SRp20 interact with intron consensus sequences in the pre-mRNA to regulate this process. [https://en.wikipedia.org/wiki/Alternative_splicing Alternative splicing] allows one mRNA molecule to produce numerous proteins that perform different functions in a cell by inclusion and exclusion of RNA sequences. There are two main families of splicing factors: Serine-Arginine rich (SR) proteins and heterogeneous nuclear RiboNucleoProteins ([https://en.wikipedia.org/wiki/Heterogeneous_ribonucleoprotein_particle hnRNPs]). | ||
- | The SRp20 protein belongs to the SR protein family. All SR proteins are defined by a RNA-binding domain at the N-terminus and a serine-arginine rich domain at the C-terminus<ref>[10.1101/gad.6.5.837]</ref>. The discovery of this family started in the 1900s with the [https://en.wikipedia.org/wiki/Serine/arginine-rich_splicing_factor_1 SF2] (SRp30a) protein and has since come to include twelve proteins, all of which act as splicing factors. SRp20 was first discovered in calf thymus when it was separated with several other SR proteins based on their molecular weight<ref> | + | The SRp20 protein belongs to the SR protein family. All SR proteins are defined by a RNA-binding domain at the N-terminus and a serine-arginine rich domain at the C-terminus<ref>[10.1101/gad.6.5.837]</ref>. The discovery of this family started in the 1900s with the [https://en.wikipedia.org/wiki/Serine/arginine-rich_splicing_factor_1 SF2] (SRp30a) protein and has since come to include twelve proteins, all of which act as splicing factors. SRp20 was first discovered in calf thymus when it was separated with several other SR proteins based on their molecular weight<ref>10.1101/gad.6.5.837</ref>. |
- | An identical protein, called [http://www.uniprot.org/uniprot/Q9V3V0 X16], was discovered in an earlier paper studying different genes that change expression during [https://en.wikipedia.org/wiki/B_cell B-cell] development<ref>[https://doi.org/10.1016/ | + | An identical protein, called [http://www.uniprot.org/uniprot/Q9V3V0 X16], was discovered in an earlier paper studying different genes that change expression during [https://en.wikipedia.org/wiki/B_cell B-cell] development<ref>[https://doi.org/10.1016/j.bbrc.2013.05.027]</ref>. At the time, the protein was assumed to play a role in RNA processing and cellular proliferation, a finding that was later proved to be true<ref>[https://doi.org/10.1093/nar/19.6.1273]</ref><ref>[https://doi.org/10.1016/S0168-9525(01)02626-9]</ref>. |
The SRp20 protein has been shown to play a role in cancer progression and neurological disorders, specifically through alternative splicing. For example, SRp20 has been shown to play a role in alternative splicing of the Tau protein, an integral protein in the progression of Alzheimer’s disease<ref>[https://doi.org/10.1093/nar/19.6.1273]</ref>. SRp20 has even been found to serve as a splicing factor for its own mRNA, influencing the inclusion of exon 4<ref>[https://doi.org/10.1093/nar/19.6.1273]</ref>. Another function of SRp20 is its role in export of mRNA out of the nucleus, notably [https://en.wikipedia.org/wiki/Histone_H2A H2A histone] mRNA export<ref>[10.1038/sj.emboj.7601385]</ref>. | The SRp20 protein has been shown to play a role in cancer progression and neurological disorders, specifically through alternative splicing. For example, SRp20 has been shown to play a role in alternative splicing of the Tau protein, an integral protein in the progression of Alzheimer’s disease<ref>[https://doi.org/10.1093/nar/19.6.1273]</ref>. SRp20 has even been found to serve as a splicing factor for its own mRNA, influencing the inclusion of exon 4<ref>[https://doi.org/10.1093/nar/19.6.1273]</ref>. Another function of SRp20 is its role in export of mRNA out of the nucleus, notably [https://en.wikipedia.org/wiki/Histone_H2A H2A histone] mRNA export<ref>[10.1038/sj.emboj.7601385]</ref>. | ||
<StructureSection load='2i2y' size='400' side='right' caption='SRp20 bound to RNA ligand and IgG binding domain 1 (PDB entry [[2i2y]])' scene=''> | <StructureSection load='2i2y' size='400' side='right' caption='SRp20 bound to RNA ligand and IgG binding domain 1 (PDB entry [[2i2y]])' scene=''> | ||
Line 13: | Line 13: | ||
=== Structure Determination === | === Structure Determination === | ||
- | Attempts to determine the structure of native SRp20 have been largely unsuccessful due to the low solubility of the protein. This is likely due to the hydrophobic core of the RRM and exposed hydrophobic residues for RNA recognition on the β-sheets. As a solution, researchers removed the SR domain from the C terminus, leaving only the SRp20 RRM and a small arginine rich segment at the C terminus, then fused with a soluble <scene name='78/782597/Imager0/2'>IgG binding domain</scene> of Streptococcal protein G to the N terminus of the protein, providing the first published structure of the SRp20 RRM via NMR3. However, the solution of the structure via [https://en.wikipedia.org/wiki/Nuclear_magnetic_resonance NMR], in addition to fusion with a globular tag, results in multiple possible conformations of the protein, meaning measurements such as bond angles, lengths, and substrate interactions are variable. Further, information concerning structural aspects of the SR domain are still limited to experimental data of protein function with certain mutations or deletions, and by comparison to sister proteins such as 9G8. To date, structure of the SR domain or the protein without the globular tag have not been solved, nor has a crystal structure for any part of the protein been determined<ref>[https://doi.org/10.1016/ | + | Attempts to determine the structure of native SRp20 have been largely unsuccessful due to the low solubility of the protein. This is likely due to the hydrophobic core of the RRM and exposed hydrophobic residues for RNA recognition on the β-sheets. As a solution, researchers removed the SR domain from the C terminus, leaving only the SRp20 RRM and a small arginine rich segment at the C terminus, then fused with a soluble <scene name='78/782597/Imager0/2'>IgG binding domain</scene> of Streptococcal protein G to the N terminus of the protein, providing the first published structure of the SRp20 RRM via NMR3. However, the solution of the structure via [https://en.wikipedia.org/wiki/Nuclear_magnetic_resonance NMR], in addition to fusion with a globular tag, results in multiple possible conformations of the protein, meaning measurements such as bond angles, lengths, and substrate interactions are variable. Further, information concerning structural aspects of the SR domain are still limited to experimental data of protein function with certain mutations or deletions, and by comparison to sister proteins such as 9G8. To date, structure of the SR domain or the protein without the globular tag have not been solved, nor has a crystal structure for any part of the protein been determined<ref>[https://doi.org/10.1016/j.bbrc.2013.05.027]</ref>. |
== Splicing Activity == | == Splicing Activity == | ||
Line 24: | Line 24: | ||
The SRP20 RRM (aa 1-86) a βαββαβ <scene name='78/782597/Imager1/2'>pattern</scene>, common of many other RRMs3. For substrate binding, researchers used a 4 base RNA with sequence CAUC, which matches the SRP20 recognition sequence found on corresponding H2A mRNA. The RNA bases each <scene name='78/782597/Imager2/4'>stack</scene> onto an aromatic side chain protruding from one of the SRP20 β-sheets, forming the primary interactions which allow substrate binding to the protein. In particular, C1 <scene name='78/782597/Imager3/2'>stacks</scene> on Y13 in β1, <scene name='78/782597/Imager4/2'>A2</scene> stacks on F50 in β3, and F48 of β3 sits in between the sugar rings of C1 and A2. It should also be noted that A2 adopts an irregular <scene name='78/782597/Imager5/2'>syn</scene> conformation when bound to the RRM, something that was observed only for guanine in the 2 position previously3. U3 <scene name='78/782597/Imager8/2'>Stacks</scene> onto F48 in β3, also W40 and A42 in β2, however when bound, U3 <scene name='78/782597/Imager9/2'>bulges</scene> out of line in comparison to the rest of the substrate. C4 partially stacks over <scene name='78/782597/Imager6/2'>A2</scene>, and also forms hydrogen <scene name='78/782597/Imager7/2'>bonds</scene> between the C4 amino group and A2 2’ oxygen. | The SRP20 RRM (aa 1-86) a βαββαβ <scene name='78/782597/Imager1/2'>pattern</scene>, common of many other RRMs3. For substrate binding, researchers used a 4 base RNA with sequence CAUC, which matches the SRP20 recognition sequence found on corresponding H2A mRNA. The RNA bases each <scene name='78/782597/Imager2/4'>stack</scene> onto an aromatic side chain protruding from one of the SRP20 β-sheets, forming the primary interactions which allow substrate binding to the protein. In particular, C1 <scene name='78/782597/Imager3/2'>stacks</scene> on Y13 in β1, <scene name='78/782597/Imager4/2'>A2</scene> stacks on F50 in β3, and F48 of β3 sits in between the sugar rings of C1 and A2. It should also be noted that A2 adopts an irregular <scene name='78/782597/Imager5/2'>syn</scene> conformation when bound to the RRM, something that was observed only for guanine in the 2 position previously3. U3 <scene name='78/782597/Imager8/2'>Stacks</scene> onto F48 in β3, also W40 and A42 in β2, however when bound, U3 <scene name='78/782597/Imager9/2'>bulges</scene> out of line in comparison to the rest of the substrate. C4 partially stacks over <scene name='78/782597/Imager6/2'>A2</scene>, and also forms hydrogen <scene name='78/782597/Imager7/2'>bonds</scene> between the C4 amino group and A2 2’ oxygen. | ||
While all 4 bases form a number of hydrophobic stacking interactions, alteration to the last 3 bases of substrate sequence does not significantly impact binding affinity, while C to G mutation of C1 results in a 10-fold decrease in binding affinity. This suggests that C1 interacts specifically with the protein, while positions 2-4 interact nonspecifically3. The Srp20 RRM is able to recognize C1 with high specificity primarily through 4 <scene name='78/782597/Imager10/3'>hydrogen bonds</scene>: from the C1 amino protons to Leu 80 backbone carbonyl oxygen and to Glu 79 side-chain carbonyl oxygen, from C1 N3 to Asn82 amide, and C1 O2 with Ser 81 side chain hydroxyl group. | While all 4 bases form a number of hydrophobic stacking interactions, alteration to the last 3 bases of substrate sequence does not significantly impact binding affinity, while C to G mutation of C1 results in a 10-fold decrease in binding affinity. This suggests that C1 interacts specifically with the protein, while positions 2-4 interact nonspecifically3. The Srp20 RRM is able to recognize C1 with high specificity primarily through 4 <scene name='78/782597/Imager10/3'>hydrogen bonds</scene>: from the C1 amino protons to Leu 80 backbone carbonyl oxygen and to Glu 79 side-chain carbonyl oxygen, from C1 N3 to Asn82 amide, and C1 O2 with Ser 81 side chain hydroxyl group. | ||
- | The semi specific RNA recognition is a mechanism which reduces evolutionary pressure on bound mRNA by increasing the number of possible RNA recognition sequences. As a result, tolerance for possible mutation in the RNA sequence is increased, meaning Srp20 can bind a more diverse range of substrates, or even original substrates that were mutated during replication (eg. H2A mRNA with a point mutation) thereby increasing organism survival chance by reducing the probability of physiological impact as a result of certain mutations<ref>[ | + | The semi specific RNA recognition is a mechanism which reduces evolutionary pressure on bound mRNA by increasing the number of possible RNA recognition sequences. As a result, tolerance for possible mutation in the RNA sequence is increased, meaning Srp20 can bind a more diverse range of substrates, or even original substrates that were mutated during replication (eg. H2A mRNA with a point mutation) thereby increasing organism survival chance by reducing the probability of physiological impact as a result of certain mutations<ref>[10.1038/sj.emboj.7601385]</ref>. |
=== Tip Associated Protein Binding Domain === | === Tip Associated Protein Binding Domain === | ||
[[Image:AASequence.jpg|350px|right|thumb|'''Figure 3.''' SRp20 Domain Representation. Shown are the RRM (green), the TAP binding linker (red line) and SR-rich domain (blue).]] | [[Image:AASequence.jpg|350px|right|thumb|'''Figure 3.''' SRp20 Domain Representation. Shown are the RRM (green), the TAP binding linker (red line) and SR-rich domain (blue).]] | ||
- | In addition to RNA recognition and alternative splicing functions, SRP20 has been shown to associate with Tip Associated Protein (TAP), an mRNA export factor, to promote transport of bound mRNA out of the nucleus for eventual translation3. In particular SRp20 promotes the export of H2A histone mRNA, by binding the CAUC consensus sequence on the mRNA and binding TAP. Previous experiments have shown that Srp20 binding TAP is dependent on presence of both the <scene name='78/782597/Rrmredgreen/1'>SRp20 RRM</scene> and a short arginine rich C-terminal segment after the RRM (aa 1-83 and 84-90 respectively) ('''Figure 3'''). Previous research also shows that mutation of any one of the three arginine residues present between residues 84-90 to glutamate prevents TAP binding, indicative of the importance of these arginine residues in TAP association. The structure is only solved to residue 86 so only the <scene name='78/782597/Arginine1/4'>first arginine residue</scene> is present. The same study also found that transfer of the TAP-binding motif to a non-functional REF2 RRM still allowed for TAP-binding and nuclear export of the target protein, suggesting that not only is the TAP-binding motif transferable, it does not depend on interaction with the host RRM to retain function<ref>[ | + | In addition to RNA recognition and alternative splicing functions, SRP20 has been shown to associate with Tip Associated Protein (TAP), an mRNA export factor, to promote transport of bound mRNA out of the nucleus for eventual translation3. In particular SRp20 promotes the export of H2A histone mRNA, by binding the CAUC consensus sequence on the mRNA and binding TAP. Previous experiments have shown that Srp20 binding TAP is dependent on presence of both the <scene name='78/782597/Rrmredgreen/1'>SRp20 RRM</scene> and a short arginine rich C-terminal segment after the RRM (aa 1-83 and 84-90 respectively) ('''Figure 3'''). Previous research also shows that mutation of any one of the three arginine residues present between residues 84-90 to glutamate prevents TAP binding, indicative of the importance of these arginine residues in TAP association. The structure is only solved to residue 86 so only the <scene name='78/782597/Arginine1/4'>first arginine residue</scene> is present. The same study also found that transfer of the TAP-binding motif to a non-functional REF2 RRM still allowed for TAP-binding and nuclear export of the target protein, suggesting that not only is the TAP-binding motif transferable, it does not depend on interaction with the host RRM to retain function<ref>[10.1038/sj.emboj.7601385]</ref>. |
=== SR Domain === | === SR Domain === | ||
While the RNA-protein interaction occurs at the RRMs, SR domains are typically responsible for the recruitment of other proteins that act in the splicing mechanism. The serines within the SR domain are phosphorylated by kinases within the cell to direct them to pre-mRNA sites (Figure 4). [[Image:phosphorylatedserine.png|300px|right|thumb|'''Figure 4.''' An example of one phosphorylated serine within the RS domain.]] Phosphorylation acts as recruiting tools for the SRp20 protein and other SR proteins to promote splicing. However, some SR proteins have shown that phosphorylation actually leads to a decrease in splicing, as in the [https://www.ncbi.nlm.nih.gov/pubmed/14765198 SRp38] protein undergoing heat shock. Research has recently revealed that serines are dephosphorylated as splicing continues, indicating how far along the mRNA strand is in splicing. Dephosphorylation then serves as a signal to the cell that the mRNA is ready to be exported out of the nucleus. Rephosphorylation then appears to trigger the SR protein to enter back into the nucleus. | While the RNA-protein interaction occurs at the RRMs, SR domains are typically responsible for the recruitment of other proteins that act in the splicing mechanism. The serines within the SR domain are phosphorylated by kinases within the cell to direct them to pre-mRNA sites (Figure 4). [[Image:phosphorylatedserine.png|300px|right|thumb|'''Figure 4.''' An example of one phosphorylated serine within the RS domain.]] Phosphorylation acts as recruiting tools for the SRp20 protein and other SR proteins to promote splicing. However, some SR proteins have shown that phosphorylation actually leads to a decrease in splicing, as in the [https://www.ncbi.nlm.nih.gov/pubmed/14765198 SRp38] protein undergoing heat shock. Research has recently revealed that serines are dephosphorylated as splicing continues, indicating how far along the mRNA strand is in splicing. Dephosphorylation then serves as a signal to the cell that the mRNA is ready to be exported out of the nucleus. Rephosphorylation then appears to trigger the SR protein to enter back into the nucleus. | ||
- | The SR domain also appears to modulate mRNA stability, though different SR proteins have been shown to have greater effects. Some SR proteins appears to increase the sensitivity of certain mRNA sequences to degradation; this degradation appears to be controlled by SR domain interactions with the 3’ UTR of the pre-mRNA. These effects are not present without the SR domain of the protein. The destabilizing effect of SRp20 specifically has yet to be studied<ref>[ | + | The SR domain also appears to modulate mRNA stability, though different SR proteins have been shown to have greater effects. Some SR proteins appears to increase the sensitivity of certain mRNA sequences to degradation; this degradation appears to be controlled by SR domain interactions with the 3’ UTR of the pre-mRNA. These effects are not present without the SR domain of the protein. The destabilizing effect of SRp20 specifically has yet to be studied<ref>[10.1038/sj.emboj.7601385]</ref>. |
== 9G8 and SRP20 == | == 9G8 and SRP20 == | ||
Line 48: | Line 48: | ||
=== Other Genetic Disorders === | === Other Genetic Disorders === | ||
SRp20 and other SR proteins have been shown to prevent [https://en.wikipedia.org/wiki/R-loop R-loops] from forming, 3-stranded nucleic acid structures consisting of RNA and DNA. R-loops have been known to promote mutations, recombination, and chromosome rearrangement. One proposed mechanism for R-loop prevention by SRp20 is that SRp20, being a protein involved in RNA metabolism, is a binding partner of the [https://www.uniprot.org/uniprot/P11387 TOP1] protein. Underexpression of TOP1 also promotes R-loop formation. TOP1 has kinase activity that potentially phosphorylates the SR domain of SRp20, which contributes to its function. Underexpression of TOP1 would lead to loss-of-function of SRp20, which could lead to cancer and Alzheimers as mentioned above, as well as cause R-loops to form. R-loops have been associated with disorders such as [https://www.ndss.org/about-down-syndrome/down-syndrome/ Down Syndrome]<ref>[https://doi.org/10.3389/fgene.2015.00142]</ref>. | SRp20 and other SR proteins have been shown to prevent [https://en.wikipedia.org/wiki/R-loop R-loops] from forming, 3-stranded nucleic acid structures consisting of RNA and DNA. R-loops have been known to promote mutations, recombination, and chromosome rearrangement. One proposed mechanism for R-loop prevention by SRp20 is that SRp20, being a protein involved in RNA metabolism, is a binding partner of the [https://www.uniprot.org/uniprot/P11387 TOP1] protein. Underexpression of TOP1 also promotes R-loop formation. TOP1 has kinase activity that potentially phosphorylates the SR domain of SRp20, which contributes to its function. Underexpression of TOP1 would lead to loss-of-function of SRp20, which could lead to cancer and Alzheimers as mentioned above, as well as cause R-loops to form. R-loops have been associated with disorders such as [https://www.ndss.org/about-down-syndrome/down-syndrome/ Down Syndrome]<ref>[https://doi.org/10.3389/fgene.2015.00142]</ref>. | ||
- | |||
- | |||
- | </StructureSection> | ||
Line 58: | Line 55: | ||
2. Zahler, A. M., Lane, W. S., Stolk, J. A., & Roth, M. B. 1992. “SR proteins: a conserved family of pre-mRNA splicing factors.” Genes & development; 6(5):837-847.<ref>[10.1101/gad.6.5.837]</ref> | 2. Zahler, A. M., Lane, W. S., Stolk, J. A., & Roth, M. B. 1992. “SR proteins: a conserved family of pre-mRNA splicing factors.” Genes & development; 6(5):837-847.<ref>[10.1101/gad.6.5.837]</ref> | ||
- | 3. <ref>[https://doi.org/10.1093/nar/19.6.1273]</ref> | + | 3. Ayane M, Preuss U, Köhler G, & Nielsen PJ. 1991. "A differentially expressed murine RNA encoding a protein with similarities to two types of nucleic acid binding motifs." Nucleic acids research; 19(6);1273-1278.<ref>[https://doi.org/10.1093/nar/19.6.1273]</ref> |
4. Hargous Y, Hautbergue GM, Tintaru AM, Skrisovska L, Golocanov AP, et. al. 2006. “Molecular basis of RNA recognition and TAP binding by the SR proteins SRp20 and 9G8.” The EMBO Journal; 25(21):5126-5137. <ref>[10.1038/sj.emboj.7601385]</ref> | 4. Hargous Y, Hautbergue GM, Tintaru AM, Skrisovska L, Golocanov AP, et. al. 2006. “Molecular basis of RNA recognition and TAP binding by the SR proteins SRp20 and 9G8.” The EMBO Journal; 25(21):5126-5137. <ref>[10.1038/sj.emboj.7601385]</ref> | ||
Line 64: | Line 61: | ||
5. Cáceres JF and Kornblihtt AR. 2002. “Alternative splicing: multiple control mechanisms and involvement in human disease.” TRENDS in Genetics; 18(4):186-193. <ref>[https://doi.org/10.1016/S0168-9525(01)02626-9]</ref> | 5. Cáceres JF and Kornblihtt AR. 2002. “Alternative splicing: multiple control mechanisms and involvement in human disease.” TRENDS in Genetics; 18(4):186-193. <ref>[https://doi.org/10.1016/S0168-9525(01)02626-9]</ref> | ||
- | 6. <ref>[https://doi.org/10.1016/S1097-2765(03)00089-3]</ref> | + | 6. Huang Y, Gattoni R, Stévenin J, and Steitz JA. 2003. "SR splicing factors serve as adapter proteins for TAP-dependent mRNA export." Molecular cell; 11(3):837-843.<ref>[https://doi.org/10.1016/S1097-2765(03)00089-3]</ref> |
7. Shepard, P. J., & Hertel, K. J. 2009. “The SR protein family.” Genome biology; 10(10):242.<ref>[https://doi.org/10.1186/gb-2009-10-10-242]</ref> | 7. Shepard, P. J., & Hertel, K. J. 2009. “The SR protein family.” Genome biology; 10(10):242.<ref>[https://doi.org/10.1186/gb-2009-10-10-242]</ref> | ||
- | 8. Corbo C, Orrù S, Gemei M, Di Noto R, Mirabelli P, et. al. 2012. “Protein cross-talk in CD133+ colon cancer cells indicates activation of the Wnt pathway and upregulation of SRp20 that is potentially involved in | + | 8. Corbo C, Orrù S, Gemei M, Di Noto R, Mirabelli P, et. al. 2012. “Protein cross-talk in CD133+ colon cancer cells indicates activation of the Wnt pathway and upregulation of SRp20 that is potentially involved in tumerogenicity.” Proteomics; 12(12):2045-2059. <ref>[10.1002/pmic.201100370]</ref> |
9. Jia R, Li C, McCoy JP, Deng C, and Zheng Z. 2010. “SRp20 is a proto-oncogene critical for cell proliferation and tumor induction and maintenance.” International Journal of Biological <ref>[21179588]</ref> | 9. Jia R, Li C, McCoy JP, Deng C, and Zheng Z. 2010. “SRp20 is a proto-oncogene critical for cell proliferation and tumor induction and maintenance.” International Journal of Biological <ref>[21179588]</ref> |
Revision as of 04:17, 3 April 2018
Introduction
Overview
The SRp20 protein is an alternative splicing factor found in homo sapiens as well as many other eukaryotes. It is a relatively small protein with a length of 164 amino acids and a weight of about 19kDa. In fact, it is the smallest member of the SR protein family. The protein contains two domains: a serine-arginine rich (SR) domain and a RNA-recognition domain (RRM)[1].
History
Splicing is one step in the process of RNA maturation that cuts out introns and joins exons together. Both the spliceosome, a complex of snRNAs (U1, U2, etc.), and splicing factors like SRp20 interact with intron consensus sequences in the pre-mRNA to regulate this process. Alternative splicing allows one mRNA molecule to produce numerous proteins that perform different functions in a cell by inclusion and exclusion of RNA sequences. There are two main families of splicing factors: Serine-Arginine rich (SR) proteins and heterogeneous nuclear RiboNucleoProteins (hnRNPs). The SRp20 protein belongs to the SR protein family. All SR proteins are defined by a RNA-binding domain at the N-terminus and a serine-arginine rich domain at the C-terminus[2]. The discovery of this family started in the 1900s with the SF2 (SRp30a) protein and has since come to include twelve proteins, all of which act as splicing factors. SRp20 was first discovered in calf thymus when it was separated with several other SR proteins based on their molecular weight[3]. An identical protein, called X16, was discovered in an earlier paper studying different genes that change expression during B-cell development[4]. At the time, the protein was assumed to play a role in RNA processing and cellular proliferation, a finding that was later proved to be true[5][6]. The SRp20 protein has been shown to play a role in cancer progression and neurological disorders, specifically through alternative splicing. For example, SRp20 has been shown to play a role in alternative splicing of the Tau protein, an integral protein in the progression of Alzheimer’s disease[7]. SRp20 has even been found to serve as a splicing factor for its own mRNA, influencing the inclusion of exon 4[8]. Another function of SRp20 is its role in export of mRNA out of the nucleus, notably H2A histone mRNA export[9].
|