Like many viruses, SARS-CoV2 synthesizes its proteins in long, polypeptide chains that must be cleaved to form functional proteins. The coronavirus ORF 1 polyprotein can be divided into an N-terminal region that is processed by one or two Papain-like proteases and a C-terminal region which is processed by the main protease. [1] While papain-like protease(s) cleave only three sites, the main protease cleaves 11 sites in the polyprotein to generate functional proteins. Additionally, the main protease cleaves its own N- and C-terminal autoprocessing sites. The cleaved functional proteins include viral enzymes needed for replication such as the RNA-dependant RNA polymerase, a helicase and other non-structural or accessory proteins such as an exoribonuclease, an endoribonuclease, a ssRNA binding protein and a 2’-O-ribose methyltransferase. [2]
Overall Structure and Active Site of M protease
The main protease is a cysteine protease that is essential for the viral life cycle. It is folded like an augmented serine-protease which forms a homodimer consisting of the perpendicular protomers A and B. One protomer consists of . Domain I and II (N-terminal domain) form an antiparallel chymotrypsin-like ß-barrel structure. Domain III (C-terminal end) consist of five alpha-helices arranged in an antiparallel cluster. [3] [4] For maximal protease activity, the protease forms a homodimer as the substrate binding site is located in a catalytic cleft between the two N-terminal ß-barrel structures (between domain I and II). The substrate binding site involves a consisting of the residues Cys145 and His41. The N- and C-terminal domains are connected by a long loop. [5] N-terminal residues of each protomer which are called N-finger, make contact between the N- and C-terminal domains of the other protomer and thus are necessary for dimerization.
[6] S1 is a which lies next to the catalytic dyad and consists of the side chains Phe 140, His 163 and the main chains of Glu166, Asn142, Gly 143 and His172. It confers absolute specificity for the Gln-P1 substrate residue on the enzyme as the carbonyl oxygen of Gln-P1 is stabilized by an which is formed by amide groups of Gly143 and the catalytic Cys145. [7] [8] Hence, polyproteins are cleaved within the Leu-Gln↓(Ser, Ala, Gly) sequence. [9]
[10]
Peptidic inhibitors
A number of structures of MPro with candidate inhibitors have been determined, including , , , , , and 6WTT.