This old version of Proteopedia is provided for student assignments while the new version is undergoing repairs. Content and edits done in this old version of Proteopedia after March 1, 2026 will eventually be lost when it is retired in about June of 2026.
Apply for new accounts at the new Proteopedia. Your logins will work in both the old and new versions.
Unusual sequence numbering
From Proteopedia
The numbering of protein and nucleic acid sequences is arbitrary in structure files from the World Wide Protein Data Bank (PDB). Here are some examples. These PDB entries are not shown here. To explore these, the links below will display them in FirstGlance in Jmol (link with arrow) or in Proteopedia.
Not Monotonic
Rarely, sequence numbers do not increase monotonically from N to C terminus. An example[1] is 4zwj / 4zwj. In this chimeric protein, chain A is numbered 1002-1161 continuing 1-326 continuing 2012-2361. That is, there are sudden jumps in numbering of consecutive amino acids: 1161 to 1, and 326 to 2012.
References
- ↑ Thanks to Rachel Kramer Green of RCSB.org for this example.
