SEE CRISPR-Cas Part I
 CRISPR adaptation
The spacers of a CRISPR array represent a chronological archive of previous invader encounters. The captured spacer sequences are integrated into the CRISPR loci after exposure to MGEs, at the leader end of the array that contains the start site of CRISPR transcription. Analysis of invader target sequences (also called protospacers) has revealed a short motif directly adjacent to the target sequence, called the protospacer adjacent motif (PAM). This PAM motif allows self/nonself discrimination by the host in two ways: (i) because its presence in alien targets is required for nonself interference, and (ii) because its absence in the host’s CRISPR array avoids self-targeting. In class 1–type I and class 2–type II systems, the PAM is not only involved in interference, but also plays a role in spacer selection during the adaptation stage, implying the acquisition of functional spacers only. The PAM is a short [2 to 7 nucleotides (nt)], partially redundant sequence that in itself cannot preclude incorporation of spacers from the host DNA because of the low information content of the motif. The short PAM appears to be the result of an evolutionary trade-off between efficient incorporation of spacers from nonself DNA and preventing an autoimmune reaction.[1]
Examples of PAM:
Although host chromosomal fragments can be incorporated as new CRISPR spacers, detection of such events obviously implies that this did not result in a lethal phenotype, either due to a modified PAM and/or to an inactivated CRISPR-Cas effector module [2]. Indeed, in the absence of the effector module, elevated frequencies of self-spacer acquisition occur in Escherichia coli [3]. Similarly, Streptococcus thermophilus with a catalytically inactive Cas9 results in a major increase of spacers derived from the host genome [4]. In addition, there is a strong preference for the integration of plasmid over chromosomal spacer sequences [3], with plasmid sequences incorporated more frequently than host DNA by two to three orders of magnitude [5]. Spacer acquisition in E. coli requires active replication of the protospacercontaining DNA [5]. Thus, small, fast-replicating plasmid genomes are a much better source of spacers than the large host DNA, and such findings are consistent with acquisition of spacers from an infecting virus genome in the archaeon Sulfolobus islandicus requiring its active replication [6]. In E. coli, the CRISPR-Cas system derives the spacers primarily from products of RecBCD-catalyzed DNA degradation that are formed during the repair of double-stranded breaks associated with stalled replication forks [7]. Other possible sources of substrates for CRISPR adaptation include DNA fragments generated
either by other defense systems, such as restriction-modification systems [8], or by the CRISPR-Cas system itself [9].[1]
 play crucial roles in spacer acquisition in all CRISPR-Cas systems [3]. In addition, these proteins can function in trans, provided that the repeats involved are sufficiently similar in size and structure. Accordingly, cas1 and cas2 genes are missing in many active CRISPR-Cas loci—in particular, of type III as well as types IV and VI [10]. Overexpression of Cas1 and Cas2 from the E. coli type I-E system has been shown to be sufficient for the extension of the CRISPR array [3]. Mutations in the active site of Cas1 abolish spacer integration in E. coli [3], whereas the nuclease activity of Cas2 is dispensable [11]. In E. coli, a  (5dqz; Fig. 3A [11], [12]), after which the newly generated spacers can be integrated into a CRISPR array via a recombination mechanism akin to that of retroviral integrases and transposases [13] (Fig. 3B). , resulting in a DNA cleavage product. . The DNA cleavage site is labeled in red. 
 
  Fig. 3 Spacer acquisition. (A) Crystal structure of the complex of Cas1-Cas2 bound to the dual-forked DNA (PDB accession 
5dqz). The target DNA is shown in dark blue; the Cas1 and Cas2 dimers of the complex are indicated in blue and yellow, respectively. (B) Model explaining the capture of new DNA sequences from invading nucleic acid and the subsequent DNA integration into the host CRISPR array. The numbers on the left correspond to the order of events as described in the text. The dashed lines indicate nucleotides; the nucleotides C and N on the two sides of the protospacer are shown in red and green to clarify the orientation. From 
[1]In several type III CRISPR-Cas systems, Cas1 is fused to reverse transcriptase [14], and it was recently shown that these systems are capable of acquisition of RNA spacers by direct incorporation of an RNA segment into the CRISPR array followed by reverse transcription and replacement of the RNA strand by DNA [15]. Although the biological function of this process remains to be elucidated, these findings demonstrate remarkable versatility of adaptation pathways. Spacer acquisition (adaptation) in type I systems proceeds along two distinct paths: (i) naïve acquisition, which occurs during an initial infection, and (ii) primed acquisition, when the CRISPR contains a previously integrated spacer that is complementary to the invading DNA [16]. According to the proposed model, naïve spacer adaptation involves five steps (Fig. 3B):
1) Fragmentation of (mainly) invasive nucleic acids by non-Cas systems [e.g., by RecBCD after stalling a replication fork, or by restriction enzymes (restriction-modification systems) [5][8]] or by CRISPR-associated nucleases [9]. Although this step may be non-essential, it probably enhances the efficiency of the overall process and its specificity toward invading DNA.
2) Selection of DNA fragments for (proto) spacers by scanning for potential PAMs (after partial target unwinding) by one of the four Cas1 subunits of the Cas1-Cas2 complex [17].
3) Measuring of the selected protospacer generating fragments of the correct size with 3′ hydroxyl groups by Cas1 nuclease.
4) Nicking of both strands of the leaderproximal repeat of the CRISPR array at the 5′ ends through a direct nucleophilic attack by the generated 3′ OH groups, resulting in covalent links of each of the strands of the newly selected spacer to the single-stranded repeat ends.
5) Second-strand synthesis and ligation of the repeat flanks by a non-Cas repair system [18][13].[1]
Primed spacer adaptation so far has been demonstrated only in type I systems [19][20][21]. This priming mechanism constitutes a positive feedback loop that facilitates the acquisition of new spacers from formerly encountered genetic elements [22]. Priming can occur even with spacers that contain several mismatches, making them incompetent as guides for targeting the cognate foreign DNA [22]. Based on PAM selection, functional spacers are preferentially acquired during naïve adaptation. This initial acquisition event triggers a rapid priming response after subsequent infections. Priming appears to be a major pathway of CRISPR adaptation, at least for some type I systems [20]. Primed adaptation strongly depends on the spacer sequence [23], and the acquisition efficiency is highest in close proximity to the priming site. In addition, the orientation of newly inserted spacers indicates a strand bias, which is consistentwith the involvement of singlestranded adaption intermediates [24]. According to one proposed model [25], replication forks in the invader’s DNA are blocked by the Cascade complex bound to the priming crRNA, enabling the RecG helicase and the Cas3 helicase/nuclease proteins to attack the DNA. The ends at the collapsed forks then could be targeted by RecBCD, which provides DNA fragments for new spacer generation [25]. Given that the use of crRNA for priming has much less strict sequence requirements than direct targeting of the invading DNA, priming is a powerful strategy that might have evolved in the course of the host-parasite arms race to reduce the escape by viral mutants, to provide robust resistance against invading DNA, and to enhance self/nonself discrimination. Naïve as well as primed adaptation in the subtype I-F system of Pseudomonas aeruginosa CRISPR-Cas require both the adaptation and the effector module [24].[1]
In the type II-A system, the Cas9-tracrRNA complex and Csn2 are involved in spacer acquisition along with the Cas1-Cas2 complex [4][26]; the involvement of Cas9 in adaptation is likely to be a general feature of type II systems. Although the key residues of Cas9 involved in PAM recognition are dispensable for spacer acquisition, they are essential for the incorporation of new spacers with the correct PAM sequence [26].
The involvement of Cas9 in PAM recognition and protospacer selection [26] suggests that in type II systems Cas1 may have lost this role. Similarly, Cas4 that is present in subtypes IA-D and II-B has been proposed to be involved in the CRISPR adaptation process, and this prediction has been validated experimentally for type I-B [20]. Cas4 is absent in the subtype II-C system of Campylobacter jejuni. Nonetheless, a conserved Cas4-like protein found in Campylobacter bacteriophages can activate spacer acquisition to use host DNA as an effective decoy to bacteriophage DNA. Bacteria that acquire self-spacers and escape phage infection must either overcome CRISPR-mediated autoimmunity by loss of the interference functions, leaving them susceptible to foreign DNA invasions, or tolerate changes in gene regulation [27]. Furthermore, in subtypes I-U and V-B, Cas4 is fused to Cas1, which implies cooperation between these proteins during adaptation. In type I-F systems, Cas2 is fused to Cas3 [28], which suggests a dual role for Cas3 [29]: involvement in adaptation as well as in interference. These findings support the coupling between the adaptation and interference stages of CRISPR-Cas defense during priming.[1]
 Biogenesis of crRNAs
The short mature crRNAs contain spacer sequences, which are the guides that are responsible for the specificity of CRISPR-Cas immunity [30]. They associate with one or more Cas proteins to form effector complexes that target invading MGEs through crRNA:target sequence–specific recognition. The CRISPR arrays are transcribed as long precursors, known as precrRNA, that may contain secondary structured elements (hairpins) in those cases where the CRISPR contains palindromic repeats. The processing of the pre-crRNA typically yields 30- to 65-nt mature crRNAs that consist of a single spacer flanked by a partial repeat at either one or both ends [30][31].[1]
The pathways of crRNA biogenesis differ among the different CRISPR-Cas types. In class 1 systems, the Cas6 protein is critical for the primary processing of pre-crRNA. Cas6 is a metal-independent endoribonuclease that recognizes and cleaves a single phosphodiester bond in the repeat sequences of a pre-crRNA transcript. Members of the Cas6 family contain two RRM-type RNA-binding domains. The primary cleavage by Cas6 results in crRNAs containing a repeat-derived 5′ “handle” of 8 nt with a 5′ hydroxyl group, followed by the complete spacer sequence and a repeat-derived 3′ handle of variable size that in some subtypes forms a hairpin structure with either a 3′-phosphate or a cyclic 2′,3′-phosphate. The Cas6 family proteins show considerable structural variation that might reflect the cleavage specificity.[1]
 Summary of the most extensively characterized CRISPR endoribonucleases[32][1]
 Representatives of class 1 and class 2
Class 1 CRISPR-Cas systems are considered to be the evolutionary ancestral systems. The class 2 systems have evolved from class 1 systems via the insertion of transposable elements encoding various nucleases, and are now being used as tools for genome editing.[1]
 (4qyz).
 (3x1l).
 (5f9r).
 5b43.
 Class 1
 CRISPR subtype III-B (Cmr complex)
 CRISPR subtype Orphan
- TthCas6A (TTHB78) from T. thermophilus.  (4c8z). Other representatives: 4c8y, 4c97.
 CRISPR type IV (Csf1)
 Class 2
 CRISPR type V (Cpf1) - SEE CRISPR type V
 CRISPR type VI (C2c2)
 See aslo