This old version of Proteopedia is provided for student assignments while the new version is undergoing repairs. Content and edits done in this old version of Proteopedia after March 1, 2026 will eventually be lost when it is retired in about June of 2026.


Apply for new accounts at the new Proteopedia. Your logins will work in both the old and new versions.


Unusual sequence numbering

From Proteopedia

Revision as of 18:12, 4 December 2017 by Eric Martz (Talk | contribs)
Jump to: navigation, search

The numbering of protein and nucleic acid sequences is arbitrary in structure files from the World Wide Protein Data Bank (PDB). Here are some examples. These PDB entries are not shown here. To explore these, the links below will display them in FirstGlance in Jmol (link with arrow) or in Proteopedia (link in parentheses).

Not Monotonic

Rarely, sequence numbers do not increase monotonically from N to C terminus. An example[1] is 4zwj (4zwj). In this chimeric protein, chain A is numbered 1002-1161 continuing 1-326 continuing 2012-2361. That is, there are sudden jumps in numbering of consecutive amino acids: 1161 to 1, and 326 to 2012.

References

  1. Thanks to Rachel Kramer Green of RCSB.org for this example.

Proteopedia Page Contributors and Editors (what is this?)

Eric Martz

Personal tools