This old version of Proteopedia is provided for student assignments while the new version is undergoing repairs. Content and edits done in this old version of Proteopedia after March 1, 2026 will eventually be lost when it is retired in about June of 2026.
Apply for new accounts at the new Proteopedia. Your logins will work in both the old and new versions.
Amino acid composition
From Proteopedia
| Line 5: | Line 5: | ||
Other, weaker influences are: | Other, weaker influences are: | ||
*'''Growth temperatures''' (mesophily/thermophily/hyperthermophily). Thermophiles have more glutamic acid (with reduction in glutamine), and more lysine and arginine<ref name="tekala-genomes" />. This likely relates to the larger number of [[salt bridges]] in proteins of thermophiles, believe to contribute to thermostability. | *'''Growth temperatures''' (mesophily/thermophily/hyperthermophily). Thermophiles have more glutamic acid (with reduction in glutamine), and more lysine and arginine<ref name="tekala-genomes" />. This likely relates to the larger number of [[salt bridges]] in proteins of thermophiles, believe to contribute to thermostability. | ||
| - | *'''Chain length'''. | + | *'''Chain length'''. Proteins of thermophiles are, on average, shorter than those of mesophiles. Average lengths are 283 and 340, respectively<ref name="tekala-genomes" />. A study of ~550,000 proteins with lengths 50-200 amino acids<ref name="length">PMID:18780815</ref> concluded: |
| - | Proteins of thermophiles are, on average, shorter than those of mesophiles. Average lengths are 283 and 340, respectively<ref name="tekala-genomes" />. | + | **Increased with length, reaching a plateau: Ala, Asp, Glu, Gly, Pro, Val. |
| + | **Decreased with length: Cys, Phe, His, Ile, Lys, Met, Asn, Ser. | ||
| + | **Leu and Tyr are highest in short and long chains, and less frequent in middle-sized proteins. | ||
| + | **Arg peaks in middle-sized proteins. | ||
==References== | ==References== | ||
<references /> | <references /> | ||
Revision as of 23:41, 23 April 2020
The amino acid composition of a protein refers to the percentages of each amino acid in the sequence of that protein. The percentage, sometimes called the Mole percentage, is calculated as the number of a given amino acid divided by the total number of amino acids in the protein chain or molecule.
GC-content of the organism's genome is the strongest genome-level determinant of amino acid composition.[1].
Other, weaker influences are:
- Growth temperatures (mesophily/thermophily/hyperthermophily). Thermophiles have more glutamic acid (with reduction in glutamine), and more lysine and arginine[1]. This likely relates to the larger number of salt bridges in proteins of thermophiles, believe to contribute to thermostability.
- Chain length. Proteins of thermophiles are, on average, shorter than those of mesophiles. Average lengths are 283 and 340, respectively[1]. A study of ~550,000 proteins with lengths 50-200 amino acids[2] concluded:
- Increased with length, reaching a plateau: Ala, Asp, Glu, Gly, Pro, Val.
- Decreased with length: Cys, Phe, His, Ile, Lys, Met, Asn, Ser.
- Leu and Tyr are highest in short and long chains, and less frequent in middle-sized proteins.
- Arg peaks in middle-sized proteins.
References
- ↑ 1.0 1.1 1.2 Tekaia F, Yeramian E, Dujon B. Amino acid composition of genomes, lifestyles of organisms, and evolutionary trends: a global picture with correspondence analysis. Gene. 2002 Sep 4;297(1-2):51-60. doi: 10.1016/s0378-1119(02)00871-5. PMID:12384285 doi:http://dx.doi.org/10.1016/s0378-1119(02)00871-5
- ↑ Carugo O. Amino acid composition and protein dimension. Protein Sci. 2008 Dec;17(12):2187-91. doi: 10.1110/ps.037762.108. Epub 2008 Sep, 9. PMID:18780815 doi:http://dx.doi.org/10.1110/ps.037762.108
