Cas9 Overview
CRISPR is a bacterial immune response to bacteriophages to prevent subsequent infections and is a form of acquired immunity. Within the CRISPR system, Cas9 is a protein responsible for cutting the viral DNA, rendering it inert. structure in Staphylococcus aureus (SaCas9) utilizes a single-stranded guide RNA (sgRNA) to bind the target DNA that will be cut. Cas9 utilizes the sgRNA as an RNA guide to cut the target DNA sequence and binds complimentary to target DNA so Cas9 create a double-stranded DNA break in the proper location. The target DNA must also have a PAM sequence to bind for Cas9 to cut target DNA. The PAM sequence acts as a two-factor authentication in junction with the sgRNA that tells the Cas9 to cut this portion of DNA. The main domains in the are the REC lobe (residues 41–425) and NUC lobe (residues 1–40 and 435–1053). These lobes are connected by an arginine rich bridge helix (residues 41–73) and a linker loop (residues 426–434). The contains RuvC, HNH, WED, and PI domains in Cas9 [1]. The is responsible for recognizing the nucleic acids present causing a conformational change in the HNH locking the HNH into the cleavage site [2]. The RuvC nuclease domain (residues 1–40, 435–480 and 650–774) cleaves the strand of target DNA that is not bound complimentary to the sgRNA [3]. The HNH domain (residues 520–628) cleaves the target strand of DNA bound to the sgRNA [3]. The WED domain (residues 788–909) is responsible for recognizing the sgRNA scaffold and consists of a twisted five-stranded beta sheet flanked by four alpha helices [4]. The PI domain (residues 910–1053) recognizes the PAM sequence on the target DNA that is not complimentary to the sgRNA [5]. There are also two linker domains, L1 (residues 481–519) and L2 (residues 629–649), that connect the RuvC and HNH [1]. Furthermore, there is a phosphate lock loop (scene of loop) (residues 775–787) that connect the WED and RuvC domains. Cas9 has four main mechanisms that are important for successful cleavage, including recognition of the sgRNA-target heteroduplex, recognition of the PAM sequence, recognition of the sgRNA scaffold, and endonuclease activity by HNH and RuvC.
Recognition of the sgRNA-target heteroduplex
The recognition of the sgRNA-target heteroduplex in Cas9 begins by inserting itself into the central channel between the REC and NUC lobes. A heteroduplex is the binding of the complimentary strands of the sgRNA and target DNA. The REC lobe and bridge helix interacts with the seed region of the (C13-C20). The seed region is in the , so it can bind the target DNA. Only the REC lobe interacts with the PAM distal region pf the sgRNA (A3-U6) through the . The target DNA binds to the for the proper conformation for base paring between the target DNA and sgRNA[1].
Recognition of the PAM sequence
For the recognition of the , the target DNA with the PAM sequence (5’-NNGRRN-3’) is bound to SaCas9 through bidentate hydrogen bonds (scene of bonds) as well as direct and water mediated hydrogen bonds through the major groove in the PI domain. The WED domain recognizes the minor groove phosphate backbone of the duplex [1].
Recognition of the sgRNA scaffold
The SaCas9 recognizes the sgRNA scaffold within the . The WED contains five stranded beta sheets flanked with four alpha helices to allow binding of the repeat: anti-repeat duplex. REC lob binds the scaffold and secures it into the SaCas9 [1].
Endonuclease Activity of Cas9
Finally, are in endonuclease activity (Scene). RuvC uses a two-metal ion mechanism of manganese to cleave the non-target DNA and causes a conformational change in L1. This is modeled as manganese however, it is often magnesium in cell. The magnesium allows a histidine to become a general base and cleave the target DNA. This conformational change leads to the phosphate group of the target strand to be cleaved by HNH. HNH includes a beta beta alpha metal fold and uses a one metal ion mechanism to cleave the target DNA [1].