User:Neel Bhagat/Sandbox 1

From Proteopedia

(Difference between revisions)
Jump to: navigation, search
Line 22: Line 22:
=== RNA Recognition Motif ===
=== RNA Recognition Motif ===
-
The SRP20 RRM (aa 1-86) a βαββαβ <scene name='78/782597/Imager1/2'>pattern</scene>, common of many other RRMs3. For substrate binding, researchers used a 4 base RNA with sequence CAUC, which matches the SRP20 recognition sequence found on corresponding H2A mRNA. The RNA bases each <scene name='78/782597/Imager2/4'>stack</scene> onto an aromatic side chain protruding from one of the SRP20 β-sheets, forming the primary interactions which allow substrate binding to the protein. In particular, C1 <scene name='78/782597/Imager3/2'>stacks</scene> on Y13 in β1, <scene name='78/782597/Imager4/2'>A2</scene> stacks on F50 in β3, and F48 of β3 sits in between the sugar rings of C1 and A2. It should also be noted that A2 adopts an irregular <scene name='78/782597/Imager5/2'>syn</scene> conformation when bound to the RRM, something that was observed only for guanine in the 2 position previously3. U3 <scene name='78/782597/Imager8/2'>Stacks</scene> onto F48 in β3, also W40 and A42 in β2, however when bound, U3 <scene name='78/782597/Imager9/2'>bulges</scene> out of line in comparison to the rest of the substrate. C4 partially stacks over <scene name='78/782597/Imager6/2'>A2</scene>, and also forms hydrogen <scene name='78/782597/Imager7/2'>bonds</scene> between the C4 amino group and A2 2’ oxygen.
+
The SRP20 RRM (aa 1-86) a βαββαβ <scene name='78/782597/Imager1/2'>pattern</scene>, common of many other RRMs3. For substrate binding, researchers used a 4 base RNA with sequence CAUC, which matches the SRP20 recognition sequence found on corresponding H2A mRNA. The RNA bases each <scene name='78/782597/Imager2/5'>stack</scene> onto an aromatic side chain protruding from one of the SRP20 β-sheets, forming the primary interactions which allow substrate binding to the protein. In particular, C1 <scene name='78/782597/Imager3/2'>stacks</scene> on Y13 in β1, <scene name='78/782597/Imager4/2'>A2</scene> stacks on F50 in β3, and F48 of β3 sits in between the sugar rings of C1 and A2. It should also be noted that A2 adopts an irregular <scene name='78/782597/Imager5/3'>syn</scene> conformation when bound to the RRM, something that was observed only for guanine in the 2 position previously3. U3 <scene name='78/782597/Imager8/2'>Stacks</scene> onto F48 in β3, also W40 and A42 in β2, however when bound, U3 <scene name='78/782597/Imager9/2'>bulges</scene> out of line in comparison to the rest of the substrate. C4 partially stacks over <scene name='78/782597/Imager6/2'>A2</scene>, and also forms hydrogen <scene name='78/782597/Imager7/2'>bonds</scene> between the C4 amino group and A2 2’ oxygen.
While all 4 bases form a number of hydrophobic stacking interactions, alteration to the last 3 bases of substrate sequence does not significantly impact binding affinity, while C to G mutation of C1 results in a 10-fold decrease in binding affinity. This suggests that C1 interacts specifically with the protein, while positions 2-4 interact nonspecifically3. The Srp20 RRM is able to recognize C1 with high specificity primarily through 4 <scene name='78/782597/Imager10/3'>hydrogen bonds</scene>: from the C1 amino protons to Leu 80 backbone carbonyl oxygen and to Glu 79 side-chain carbonyl oxygen, from C1 N3 to Asn82 amide, and C1 O2 with Ser 81 side chain hydroxyl group.
While all 4 bases form a number of hydrophobic stacking interactions, alteration to the last 3 bases of substrate sequence does not significantly impact binding affinity, while C to G mutation of C1 results in a 10-fold decrease in binding affinity. This suggests that C1 interacts specifically with the protein, while positions 2-4 interact nonspecifically3. The Srp20 RRM is able to recognize C1 with high specificity primarily through 4 <scene name='78/782597/Imager10/3'>hydrogen bonds</scene>: from the C1 amino protons to Leu 80 backbone carbonyl oxygen and to Glu 79 side-chain carbonyl oxygen, from C1 N3 to Asn82 amide, and C1 O2 with Ser 81 side chain hydroxyl group.
The semi specific RNA recognition is a mechanism which reduces evolutionary pressure on bound mRNA by increasing the number of possible RNA recognition sequences. As a result, tolerance for possible mutation in the RNA sequence is increased, meaning Srp20 can bind a more diverse range of substrates, or even original substrates that were mutated during replication (eg. H2A mRNA with a point mutation) thereby increasing organism survival chance by reducing the probability of physiological impact as a result of certain mutations<ref name="Hargous">PMID:17036044</ref>.
The semi specific RNA recognition is a mechanism which reduces evolutionary pressure on bound mRNA by increasing the number of possible RNA recognition sequences. As a result, tolerance for possible mutation in the RNA sequence is increased, meaning Srp20 can bind a more diverse range of substrates, or even original substrates that were mutated during replication (eg. H2A mRNA with a point mutation) thereby increasing organism survival chance by reducing the probability of physiological impact as a result of certain mutations<ref name="Hargous">PMID:17036044</ref>.

Revision as of 16:45, 6 April 2018

Introduction

Overview

The SRp20 protein is an alternative splicing factor found in homo sapiens as well as many other eukaryotes. It is a relatively small protein with a length of 164 amino acids and a weight of about 19kDa. In fact, it is the smallest member of the SR protein family. The protein contains two domains: a serine-arginine rich (SR) domain and a RNA-recognition domain (RRM)[1].

History

Splicing is one step in the process of RNA maturation that cuts out introns and joins exons together. Both the spliceosome, a complex of snRNAs (U1, U2, etc.), and splicing factors like SRp20 interact with intron consensus sequences in the pre-mRNA to regulate this process. Alternative splicing allows one mRNA molecule to produce numerous proteins that perform different functions in a cell by inclusion and exclusion of RNA sequences. There are two main families of splicing factors: Serine-Arginine rich (SR) proteins and heterogeneous nuclear RiboNucleoProteins (hnRNPs). The SRp20 protein belongs to the SR protein family. All SR proteins are defined by a RNA-binding domain at the N-terminus and a serine-arginine rich domain at the C-terminus[2]. The discovery of this family started in the 1900s with the SF2 (SRp30a) protein and has since come to include twelve proteins, all of which act as splicing factors. SRp20 was first discovered in calf thymus when it was separated with several other SR proteins based on their molecular weight[2]. An identical protein, called X16, was discovered in an earlier paper studying different genes that change expression during B-cell development[1]. At the time, the protein was assumed to play a role in RNA processing and cellular proliferation, a finding that was later proved to be true[3][4]. The SRp20 protein has been shown to play a role in cancer progression and neurological disorders, specifically through alternative splicing. For example, SRp20 has been shown to play a role in alternative splicing of the Tau protein, an integral protein in the progression of Alzheimer’s disease[1]. SRp20 has even been found to serve as a splicing factor for its own mRNA, influencing the inclusion of exon 4[1]. Another function of SRp20 is its role in export of mRNA out of the nucleus, notably H2A histone mRNA export[5].

SRp20 bound to RNA ligand and IgG binding domain 1 (PDB entry 2i2y)

Drag the structure with the mouse to rotate

Proteopedia Page Contributors and Editors (what is this?)

Neel Bhagat

Personal tools