<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">988569</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2022.988569</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Structural and functional characterization of Cas2 of CRISPR-Cas subtype I-C lacking the CRISPR component</article-title>
<alt-title alt-title-type="left-running-head">Anand et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fmolb.2022.988569">10.3389/fmolb.2022.988569</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Anand</surname>
<given-names>Vineet</given-names>
</name>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1903627/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Prabhakaran</surname>
<given-names>Harshini Sheeja</given-names>
</name>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gogoi</surname>
<given-names>Prerana</given-names>
</name>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Kanaujia</surname>
<given-names>Shankar Prasad</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1602722/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Kumar</surname>
<given-names>Manish</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1107763/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Department of Biosciences and Bioengineering</institution>, <institution>Indian Institute of Technology Guwahati</institution>, <addr-line>Guwahati</addr-line>, <addr-line>Assam</addr-line>, <country>India</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/142943/overview">Andrea Mozzarelli</ext-link>, University of Parma, Italy</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/126084/overview">Santosh Panjikar</ext-link>, Australian Synchrotron, Australia</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/60643/overview">Antonello Merlino</ext-link>, University of Naples Federico II, Italy</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/276611/overview">Francisco Mart&#xed;nez-Abarca</ext-link>, Spanish National Research Council (CSIC), Spain</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Shankar Prasad Kanaujia, <email>spkanaujia@iitg.ac.in</email>; Manish Kumar, <email>mkumar1@iitg.ac.in</email>
</corresp>
<fn fn-type="equal" id="fn1">
<label>
<sup>&#x2020;</sup>
</label>
<p>These authors have contributed equally to this work</p>
</fn>
<fn fn-type="other">
<p>This article was submitted to Protein Biochemistry for Basic and Applied Sciences, a section of the journal Frontiers in Molecular Biosciences</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>09</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>9</volume>
<elocation-id>988569</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>07</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>08</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Anand, Prabhakaran, Gogoi, Kanaujia and Kumar.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Anand, Prabhakaran, Gogoi, Kanaujia and Kumar</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The genome of pathogenic <italic>Leptospira interrogans</italic> serovars (Copenhageni and Lai) are predicted to have CRISPR-Cas of subtypes I-B and I-C. Cas2, one of the core Cas proteins, has a crucial role in adaptive defense against foreign nucleic acids. However, subtype I-C lacks the CRISPR element at its loci essential for RNA-mediated adaptive immunity against foreign nucleic acids. The reason for sustaining the expense of cas genes are unknown in the absence of a CRISPR array. Thus, Cas2C was chosen as a representative Cas protein from two well-studied serovars of <italic>Leptospira</italic> to address whether it is functional. In this study, the recombinant Cas2C of <italic>Leptospira</italic> serovars Copenhageni (rLinCas2C, 12&#xa0;kDa) and Lai (rLinCas2C_Lai, 8.6&#xa0;kDa) were overexpressed and purified. Due to natural frameshift mutation in the cas2c gene of serovar Lai, rLinCas2C_Lai was overexpressed and purified as a partially translated protein. Nevertheless, the recombinant Cas2C from each serovar exhibited metal-dependent DNase and metal-independent RNase activities. The crystal structure of rLinCas2C obtained at the resolution of 2.60&#xa0;&#xc5; revealed the protein is in apostate conformation and contains N- (1&#x2013;71 amino acids) and C-terminal (72&#x2013;90 amino acids) regions, with the former possessing a ferredoxin fold. Substitution of the conserved residues (Tyr7, Asp8, Arg33, and Phe39) with alanine and deletion of Loop L2 resulted in compromised DNase activity. On the other hand, a moderate reduction in RNase activity was evident only in selective rLinCas2C mutants. Overall, in the absence of an array, the observed catalytic activity of Cas2C may be required for biological processes distinct from the CRISPR-Cas-associated function.</p>
</abstract>
<kwd-group>
<kwd>Leptospira</kwd>
<kwd>CRISPR-Cas</kwd>
<kwd>endodeoxyribonuclease</kwd>
<kwd>ribonuclease</kwd>
<kwd>deoxyribonuclease</kwd>
</kwd-group>
<contract-sponsor id="cn001">Department of Biotechnology, Ministry of Science and Technology, India<named-content content-type="fundref-id">10.13039/501100001407</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>The genus <italic>Leptospira</italic> is a spiral-shaped bacteria, the pathogenic species of which are known for causing leptospirosis disease in humans and a wide range of animals (<xref ref-type="bibr" rid="B14">Faine, 1974</xref>). In nature, the genus <italic>Leptospira</italic> exists in pathogenic, intermediate, and saprophytic forms. These forms can be classified into 26 serogroups and over 300 serovars (<xref ref-type="bibr" rid="B19">Guglielmini et al., 2019</xref>). The genome of two well-studied pathogenic leptospires (<italic>L. interrogans</italic> serovars Copenhageni and Lai) harbors genetic elements of an adaptive defense system against foreign nucleic acids known as <underline>c</underline>lustered <underline>r</underline>egularly <underline>i</underline>nterspaced <underline>s</underline>hort <underline>p</underline>alindromic <underline>r</underline>epeats and their associated genes (CRISPR-Cas) (<xref ref-type="bibr" rid="B15">Fouts et al., 2016</xref>).</p>
<p>CRISPR-Cas systems involved in combatting exotic nucleic acids are cataloged into two classes, six types, and thirty-three subtypes according to the association of the signature <italic>cas</italic> genes (<xref ref-type="bibr" rid="B33">Makarova et al., 2019</xref>). Based on the signature <italic>cas</italic> genes, the CRISPR-Cas type I system has been classified into seven subtypes (I-A, I-B, I-C, I-D, I-E, I-F, and I-U) (<xref ref-type="bibr" rid="B33">Makarova et al., 2019</xref>). The CRISPR-Cas is composed of a CRISPR array preceded by an AT-rich leader sequence and a set of effector <italic>cas</italic> genes encoding nucleases (<xref ref-type="bibr" rid="B22">Jansen et al., 2002</xref>). In the genome of pathogenic leptospires (<italic>L. interrogans</italic> serovars Copenhageni and Lai), there are two predetermined subtypes (I-B and I-C) of CRISPR-Cas (<xref ref-type="bibr" rid="B32">Makarova et al., 2015</xref>). In <italic>Leptospira</italic>, the CRISPR-Cas subtype I-C lacks the CRISPR array component and is thus considered an orphan CRISPR-Cas system (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). Therefore, it was interesting to investigate whether <italic>C</italic>RISPR-<italic>as</italic>sociated genes (<italic>cas</italic>) of subtype I-C (<italic>cas1</italic> to <italic>cas8</italic>) of <italic>Leptospira,</italic> which lack an array component, are functionally active. Thus, in our preliminary study, out of the eight <italic>cas</italic> genes, we chose <italic>cas2</italic> to clone, overexpress and check the nuclease activity of the purified recombinant protein.</p>
<p>The Cas2 proteins (80&#x2013;120 residues) are core metallonucleases found universally in all CRISPR-bearing taxa (<xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>). Although the Cas2 proteins are not involved in synthesizing the pre-crRNAs or their processing, the genetic studies signify their role in framing the initial stage (adaptation) of immunity against exotic nucleic acids (<xref ref-type="bibr" rid="B56">Yosef et al., 2012</xref>; <xref ref-type="bibr" rid="B39">Nu&#xf1;ez et al., 2014</xref>). The structural and functional characterization of several Cas2 orthologs has been conducted; however, the catalytic role of Cas2 in CRISPR biology is not well-illustrated to date (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>; <xref ref-type="bibr" rid="B27">Kwon et al., 2012</xref>; <xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>; <xref ref-type="bibr" rid="B23">Jung et al., 2016</xref>). The tertiary structure of pure Cas2 from various organisms, including SsoCas2 (<italic>Sulfolobus solfataricus</italic>), BhaCas2 (<italic>Bacillus halodurans</italic>), SpyCas2 (<italic>Streptococcus pyogenes</italic>), DvuCas2 (<italic>Desulfovibrio vulgaris</italic>), and TonCas2 (<italic>Thermococcus onnurineus</italic>) contains N- and C-terminal regions, with the former having a ferredoxin (&#x3b2;&#x3b1;&#x3b2;&#x3b2;&#x3b1;&#x3b2;) fold (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>; <xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>; <xref ref-type="bibr" rid="B23">Jung et al., 2016</xref>). The pure SsoCas2, SpyCas2, BhaCas2, DvuCas2, and TonCas2 form a dimer by the interaction of the &#x3b2;5 strand of each subunit at the C-termini (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>; <xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>; <xref ref-type="bibr" rid="B23">Jung et al., 2016</xref>). In the SsoCas2 dimer, a pair of conserved aspartate residues (Asp10) are involved in catalytic activity (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). In <italic>E. coli,</italic> one dimeric unit of Cas2 interacts with the two units of Cas1 dimers to form a heterohexameric complex. Henceforth, Cas2 of <italic>E. coli</italic> facilitates the acquisition of exotic nucleic acid (protospacers) non-catalytically into the CRISPR array (<xref ref-type="bibr" rid="B39">Nu&#xf1;ez et al., 2014</xref>; <xref ref-type="bibr" rid="B40">Rollie et al., 2015</xref>; <xref ref-type="bibr" rid="B29">Lee et al., 2019</xref>). The active-site mutation of Cas2 does not abolish the spacer acquisition (adaptation) by the heterohexameric complex of Cas1-Cas2 in <italic>E. coli.</italic> Thus, the biological significance of Cas2 catalytic activity is equivocal in CRISPR biology. Indeed, the Cas2 of <italic>E. coli</italic> acts non-catalytically as a yardstick to gauge the protospacer length, while Cas1 functions as an integrase (endonuclease) on the cut and paste mechanism (<xref ref-type="bibr" rid="B47">Wang et al., 2015</xref>). However, the catalytic activity of Cas2 has been associated with the virulence process in <italic>Legionella pneumophila</italic>, the causative agent of Legionnaires&#x2019; disease (<xref ref-type="bibr" rid="B20">Gunderson et al., 2015</xref>). Among other functions, Cas2 is also associated with morphological changes in <italic>E. coli</italic> (<xref ref-type="bibr" rid="B48">Wang et al., 2019</xref>). Thus, the catalytic activity of Cas2 in bacteria may be utilized for biological processes distinct from the CRISPR-Cas-associated function.</p>
<p>In the genome of pathogenic <italic>L. interrogans</italic> serovar Copenhageni (LinCas2B, ORF id: LIC10941 and LinCas2C, ORF id: LIC12917) and <italic>L. interrogans</italic> serovar Lai (LinCas2B_Lai, ORF id: LA3182 and LinCas2C_Lai, ORF id: LA0683), there are two Cas2 proteins, each in the locus of CRISPR-Cas subtypes I-B and I-C, respectively. Although LinCas2B (LIC10941) and LinCas2B_Lai (LA3182) shared an identical protein sequence, the sequence similarity between LinCas2C (LIC12917) and LinCas2B (LIC10941) is 32%. Moreover, <italic>cas2c</italic> (<italic>LA0683</italic>) in <italic>Leptospira</italic> serovar Lai encodes only the first 58 amino acids (LinCas2C_Lai) because of the natural frameshift mutation (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). It was thus interesting to decipher the nuclease activity in the naturally truncated LinCas2C_Lai (LA0683) protein.</p>
<p>In this study, we sought to characterize the recombinant Cas2C protein of serovars Copenhageni and Lai and compared its activity with well-characterized LinCas2B. Unlike Cas2 from other organisms, the purified rLinCas2C and rLinCas2C_Lai exhibited metal-dependent DNase and metal-independent RNase activity. The determined crystal structure of rLinCas2C ascertained its existence in the dimeric form with the characteristic N-terminal ferredoxin fold (&#x3b2;&#x3b1;&#x3b2;&#x3b2;&#x3b1;&#x3b2;) and was further compared with its homologs. This is the first report concerning the crystal structure of CRISPR-Cas elements from spirochetes.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Materials and methods</title>
<sec id="s2-1">
<title>Bioinformatics analysis</title>
<p>Nucleotide sequences of CRISPR-Cas I-C harbored in <italic>L. interrogans</italic> serovars Copenhageni and Lai were retrieved from NCBI. The three-dimensional (3D) atomic coordinates of the Cas2 orthologs were downloaded from the Protein Data Bank (PDB) (<xref ref-type="bibr" rid="B3">Berman et al., 2002</xref>). The genetic architecture of CRISPR-Cas I-C was created based on the <italic>cas</italic> gene coordinates previously documented (<xref ref-type="bibr" rid="B32">Makarova et al., 2015</xref>) and using the CRISPRone program (<xref ref-type="bibr" rid="B57">Zhang and Ye, 2017</xref>). The phylogenetic tree was constructed by the maximum likelihood method and bootstrapped (1,000 replicates) to evaluate the reliability of the tree generated using the program MEGA11 (<xref ref-type="bibr" rid="B44">Tamura et al., 2013</xref>). The 3D structures of LinCas2C_Lai and LinCas2B were predicted using the programs I-TASSER (<xref ref-type="bibr" rid="B54">Yang and Zhang, 2015</xref>), Phyre2 (<xref ref-type="bibr" rid="B26">Kelley et al., 2015</xref>), and the Swiss model (<xref ref-type="bibr" rid="B6">Biasini et al., 2014</xref>). The predicted model&#x2019;s energy was minimized and then refined using the web server ModRefiner (<xref ref-type="bibr" rid="B53">Xu and Zhang, 2011</xref>). Multiple sequence alignment was conducted using the program Clustal Omega (<xref ref-type="bibr" rid="B43">Sievers et al., 2011</xref>) with the default set of parameters and decorated using the web tool ESPript for better visual effect (<xref ref-type="bibr" rid="B18">Gouet et al., 2003</xref>). Molecular docking of LinCas2C with a non-specific dsDNA was performed using the program NPDock (<xref ref-type="bibr" rid="B45">Tuszynska et al., 2015</xref>). The program PyMOL (<xref ref-type="bibr" rid="B9">DeLano, 2002</xref>) was used to generate the superimposition of structures. The polar contacts between LinCas2C protomers and the LinCas2C-DNA interface were identified within a distance radius of 3.5&#xa0;&#xc5;. The buried surface area of the Cas2C dimer was calculated using the webserver PDBePISA (<xref ref-type="bibr" rid="B46">Velankar et al., 2010</xref>). LinCas2C with Mg<sup>2&#x2b;</sup> ion was modeled using <italic>Enterococcus faecalis</italic> Cas1-Cas2/prespacer ternary complex as a template (PDB id: 5XVP) (<xref ref-type="bibr" rid="B52">Xiao et al., 2017</xref>).</p>
</sec>
<sec id="s2-2">
<title>Nucleic acid isolation and cloning</title>
<p>The spirochete <italic>L. interrogans</italic> serovar Copenhageni strain Fiocruz L1-130 or serovar Lai culture was maintained in Ellinghausen-McCullough-Johnson-Harris (EMJH) media at 29&#xb0;C supplemented with 1&#xd7;enrichment media (Difco) along with 5-fluorouracil (100&#xa0;&#x3bc;g/ml). After 7&#xa0;days of incubation, the grown culture was sub-cultured successively. Genomic DNA of <italic>L. interrogans</italic> serovars Copenhageni and Lai were isolated from a 7-day-old culture containing &#x223c;10<sup>8</sup> cells per ml using QIAamp DNA Blood Mini Kit (Qiagen) per manufacturer protocol. <italic>E. coli</italic> strains DH5&#x3b1; and BL21 (DE3) were grown in Luria Bertani (LB, Himedia) broth or agar for cloning, transformation, and expression.</p>
<p>The open reading frame (ORF) of <italic>LIC12917</italic> (<italic>cas2c</italic>, 273 bp) and <italic>LA0683</italic> (<italic>cas2c_Lai,</italic> 272 bp) were amplified using the genomic DNA templates of <italic>L. interrogans</italic> serovars Copenhageni and Lai, respectively. Both full-length <italic>cas2c</italic> genes were cloned in the pCDF-1b expression vector (Novagen), and cloning was confirmed by double digestion of insert (<italic>BamH</italic>I-<italic>Sal</italic>I) and sequencing of plasmids.</p>
</sec>
<sec id="s2-3">
<title>Nuclease activity assay</title>
<p>Nuclease activity of rLinCas2C was investigated on various DNA and RNA substrates. RNA transcript of the <italic>luciferase</italic> gene was synthesized using HiScribe T7 high yield RNA synthesis kit (NEB) as per the manufacturer protocol. The plasmid was isolated from a 5&#xa0;ml overnight grown culture of <italic>E. coli</italic> DH5&#x3b1; cells using a mini-prep kit (Thermo Scientific). Single-stranded viral DNA substrate (M13mp18, &#x424;x174) and all enzymes used for genetic engineering were purchased commercially (NEB or Fermentas). As previously reported, short DNA oligomers of 23-mer and 50-mer were used (<xref ref-type="bibr" rid="B40">Rollie et al., 2015</xref>). The substrates used for nuclease activity of rLinCas2C were circular double-stranded (ds) plasmid DNA (pET28a, pTZ57R/T, 0.5&#xa0;&#xb5;g), circular single-stranded (ss) DNA (M13mp18, 0.5&#xa0;&#xb5;g), linear ssDNA (&#x3a6;x174 genome, 0.5&#xa0;&#xb5;g), 23- and 50-mer nucleotides (0.4&#xa0;&#xb5;M), and firefly <italic>luciferase</italic> mRNA (0.5&#xa0;&#xb5;g) (<xref ref-type="sec" rid="s10">Supplementary Table S1</xref>). The given amount of each substrate was independently incubated with rLinCas2C (25&#xa0;&#x3bc;M) in a total reaction volume of 25&#xa0;&#x3bc;L of nuclease buffer (25&#xa0;mM Tris-HCl pH 8.0, 100&#xa0;mM KCl, and 2.5&#xa0;mM MgCl<sub>2</sub>) for an hour at 37&#xb0;C. DNase activity dependence for divalent metal ions (2.5&#xa0;mM) was determined by substituting various divalent metal ions (MgCl<sub>2</sub>, MnSO<sub>4</sub>, CaCl<sub>2</sub>, NiSO<sub>4</sub>, FeSO<sub>4</sub>, CuSO<sub>4</sub>, and ZnSO<sub>4</sub>). All the reaction products were separated on ethidium bromide-stained 2% (w/v) agarose gel electrophoresis. The nuclease reaction containing 23- and 50-mer nucleotides were assessed on 8&#xa0;M 15% urea-PAGE.</p>
</sec>
<sec id="s2-4">
<title>Site-directed mutagenesis</title>
<p>Using the Q5 site-directed mutagenesis kit (NEB), rLinCas2C mutant variants were generated. The mutants were generated using the template plasmid pCDF_LIC12917 and the primers used are listed in <xref ref-type="sec" rid="s10">Supplementary Table S1</xref>. In rLinCas2C, potential residues involved in nuclease activities were substituted with alanine at one or multiple sites to generate various mutant variants (rLinCas2C<sup>Y7A</sup>, rLinCas2C<sup>Y7A&#x2b;D8A</sup>, rLinCas2C<sup>R33A&#x2b;F39A</sup>, and rLinCas2C<sup>Y7A&#x2b;D8A&#x2b;R33A&#x2b;F39A</sup>). In one of the mutant variants (rLinCas2C<sup>&#x394;L2</sup>), residues involved in framing the loop L2 were deleted. All the generated constructs were outsourced for sequencing before overexpression, purification, and characterization of proteins.</p>
<p>Quantitative RNase activity of rLinCas2C, rLinCas2C_Lai, and the mutant variants of rLinCas2C was done using the RNaseAlert kit (Integrated DNA technology, IDT; Cat &#x23; 11-02-01-02). The RNaseAlert kit contains synthetic RNA oligo substrate labeled with fluorescein and a quencher at its end. When cleaved by an RNase, the substrate fluoresces green (490&#xa0;nm excitation and 520&#xa0;nm emission) and can be measured by a fluorometer. RNase activity was performed in black flat-bottom 96-well plates (Invitrogen) at 37&#xb0;C. Fluorogenic RNA substrate (10&#xa0;pmol) was incubated with rLinCas2C, its mutant variants, and LinCas2C_Lai (25&#xa0;&#xb5;M) in a total of 100&#xa0;&#x3bc;l reaction buffer (25&#xa0;mM Tris-Cl pH 8.0 and 100&#xa0;mM KCl). Fluorescence was measured at every 5&#xa0;min interval till 60&#xa0;min using the Infinite M200Pro plate reader (Tecan).</p>
</sec>
<sec id="s2-5">
<title>Crystallization, data collection, and structure determination</title>
<p>The purified protein (rLinCas2C, 5&#xa0;mg/ml) was screened for initial crystal hits using crystallization conditions available from Hampton Research utilizing the hanging-drop vapor-diffusion method at 4&#xb0;C. Diffraction quality crystals of rLinCas2C were obtained in 0.2&#xa0;M sodium citrate tribasic dihydrate pH 5.6, 5% 2-propanol, 20% polyethylene glycol (PEG) 4,000 and 0.2% low melting agarose (LMA). X-ray intensity diffraction data were collected at &#x2212;173&#xb0;C using the home source Rigaku MicroMax-007 HF diffractometer (operated at 40&#xa0;kV and 30&#xa0;mA) and R-Axis IV&#x2b;&#x2b; imaging-plate detector available at the central instrument facility (CIF) of the Indian Institute of Technology Guwahati, India. The crystal to detector distance was maintained at 170&#xa0;mm. The diffraction data were processed and scaled using the programs iMosflm (<xref ref-type="bibr" rid="B1">Battye et al., 2011</xref>) and Aimless (<xref ref-type="bibr" rid="B13">Evans and Murshudov, 2013</xref>) embedded in the CCP4 package (<xref ref-type="bibr" rid="B49">Winn et al., 2011</xref>). The intensities were converted to structure factors using the module ctruncate available in the CCP4 package. Summary for X-ray intensity data collection and processing statistics are provided in <xref ref-type="table" rid="T1">Table 1</xref>. Initial phases of the protein rLinCas2C were determined employing the molecular replacement method using the crystal structure of SpyCas2 (PDB id: 4QR0) from <italic>Streptococcus pyogenes</italic> having a sequence identity (query coverage) of 45 (98)% as a search model using the program Phaser (<xref ref-type="bibr" rid="B35">McCoy et al., 2007</xref>). To calculate the R<sub>free</sub>, 5% of the total reflections were kept aside as a test data set (<xref ref-type="bibr" rid="B7">Br&#xfc;nger, 1992</xref>). The atomic model building and iterative cycles of structural parameters refinement were carried out using Coot (<xref ref-type="bibr" rid="B12">Emsley et al., 2010</xref>) and Refmac5 (<xref ref-type="bibr" rid="B36">Murshudov et al., 2011</xref>), respectively. The structural quality of the final refined model was validated using programs PROCHECK (<xref ref-type="bibr" rid="B28">Laskowski et al., 1993</xref>) and MolProbity (<xref ref-type="bibr" rid="B8">Chen et al., 2010</xref>). As the final refined model did not contain a metal (Mg<sup>2&#x2b;</sup>) ion in its active site required for its activity, crystallization of the protein incubated with MgCl<sub>2</sub> was attempted. However, a diffractive crystal could not be obtained. The details of the structure refinement and validation of the final structure models are provided in <xref ref-type="table" rid="T1">Table 1</xref>. The three-dimensional atomic coordinates of the protein LinCas2 have been deposited in the RCSB Protein Data Bank (PDB id: 7F84) (<xref ref-type="bibr" rid="B4">Berman et al., 2000</xref>).</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Data collection and refinement statistics of rLinCas2C. The values in parenthesis are for the last resolution shell.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Parameters</th>
<th align="left">rLinCas2C</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Wavelength (&#xc5;)</td>
<td align="left">1.5418</td>
</tr>
<tr>
<td align="left">Temperature (K)</td>
<td align="left">100</td>
</tr>
<tr>
<td align="left">Space group</td>
<td align="left">
<italic>I</italic>422</td>
</tr>
<tr>
<td align="left">Unit-cell parameters (&#xc5;, &#xB0;)</td>
<td align="left">
<italic>a</italic> &#x3d; <italic>b</italic> &#x3d; 103.16, <italic>c</italic> &#x3d; 97.30, <italic>&#x3b1;</italic> &#x3d; <italic>&#x3b2;</italic> &#x3d; <italic>&#x3b3;</italic> &#x3d; 90.00</td>
</tr>
<tr>
<td align="left">Resolution (&#xc5;)</td>
<td align="left">72.94&#x2013;2.60 (2.72&#x2013;2.60)</td>
</tr>
<tr>
<td align="left">No. of observed reflections</td>
<td align="left">140,737 (17,109)</td>
</tr>
<tr>
<td align="left">No. of unique reflections</td>
<td align="left">8,390 (1,011)</td>
</tr>
<tr>
<td align="left">Mn(I) CC (1/2)</td>
<td align="left">0.998 (0.959)</td>
</tr>
<tr>
<td align="left">Completeness (%)</td>
<td align="left">100 (100)</td>
</tr>
<tr>
<td align="left">V<sub>M</sub> (&#xc5;<sup>3</sup> Da<sup>&#x2212;1</sup>)</td>
<td align="left">3.19</td>
</tr>
<tr>
<td align="left">Solvent content (%)</td>
<td align="left">61.44</td>
</tr>
<tr>
<td align="left">Mosaicity (&#xb0;)</td>
<td align="left">0.60</td>
</tr>
<tr>
<td align="left">Mean I/&#x3c3;(I)</td>
<td align="left">12.3 (4.8)</td>
</tr>
<tr>
<td align="left">R<sub>merge</sub>
<xref ref-type="table-fn" rid="Tfn1">
<sup>a</sup>
</xref> (%)</td>
<td align="left">15.8 (50.4)</td>
</tr>
<tr>
<td align="left">R<sub>pim</sub> (%)</td>
<td align="left">5.6 (17.8)</td>
</tr>
<tr>
<td align="left">R<sub>meas</sub> (%)</td>
<td align="left">16.8 (53.5)</td>
</tr>
<tr>
<td align="left">Multiplicity</td>
<td align="left">16.8 (16.9)</td>
</tr>
<tr>
<td align="left">R<sub>work</sub>/R<sub>free</sub> (%)</td>
<td align="left">20.87/26.96</td>
</tr>
<tr>
<td colspan="2" align="left">Protein model</td>
</tr>
<tr>
<td align="left">&#x2003;No. of subunits in ASU</td>
<td align="left">2</td>
</tr>
<tr>
<td align="left">&#x2003;Protein atoms</td>
<td align="left">1,428</td>
</tr>
<tr>
<td align="left">&#x2003;Water molecules</td>
<td align="left">51</td>
</tr>
<tr>
<td align="left">&#x2003;Other molecules</td>
<td align="left">GOL</td>
</tr>
<tr>
<td colspan="2" align="left">Deviation from ideal geometry</td>
</tr>
<tr>
<td align="left">&#x2003;Bond length (&#xc5;)</td>
<td align="left">0.013</td>
</tr>
<tr>
<td align="left">&#x2003;Bond angles (&#xb0;)</td>
<td align="left">1.869</td>
</tr>
<tr>
<td colspan="2" align="left">Average <italic>B</italic>-factor (&#xc5;<sup>2</sup>)</td>
</tr>
<tr>
<td align="left">&#x2003;Protein atoms</td>
<td align="left">27.39</td>
</tr>
<tr>
<td align="left">&#x2003;Water molecules</td>
<td align="left">33.52</td>
</tr>
<tr>
<td align="left">&#x2003;Other molecules</td>
<td align="left">63.98</td>
</tr>
<tr>
<td colspan="2" align="left">Ramachandran plot (%)</td>
</tr>
<tr>
<td align="left">&#x2003;Favored</td>
<td align="left">96.53</td>
</tr>
<tr>
<td align="left">&#x2003;Allowed</td>
<td align="left">3.47</td>
</tr>
<tr>
<td align="left">&#x2003;PDB id</td>
<td align="left">
<bold>7F84</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="Tfn1">
<label>a</label>
<p>
<italic>R</italic>
<sub>merge</sub> &#x3d; &#x2211;<sub>
<italic>hkl</italic>
</sub> &#x2211;<sub>
<italic>i</italic>
</sub>&#x7c;<italic>I</italic>
<sub>
<italic>i</italic>
</sub>(<italic>hkl</italic>)&#x2013;&#x27e8;<italic>I</italic> (<italic>hkl</italic>)&#x7c;/&#x2211;<sub>
<italic>hkl</italic>
</sub>&#x2211;<sub>
<italic>i</italic>
</sub>I<sub>
<italic>i</italic>
</sub>(<italic>hkl</italic>), where <italic>I</italic> (<italic>hkl</italic>) is the intensity of reflection <italic>hkl, &#x2211;hkl</italic> is the sum overall reflections and &#x2211;<italic>i</italic> is the sum over <italic>i</italic> measurements of reflection <italic>hkl</italic>, GOL:glycerol.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>
<italic>CRISPR-Cas I-C locus in the genome of L. interrogans</italic> serovar Copenhageni <italic>and L. interrogans</italic> serovar Lai</title>
<p>Based on the CRISPROne program (<xref ref-type="bibr" rid="B57">Zhang and Ye, 2017</xref>) and the data retrieved from the earlier report (<xref ref-type="bibr" rid="B32">Makarova et al., 2015</xref>), the CRISPR-Cas I-C locus (nucleotides coordinate 3535328&#x2013;3542766) containing the <italic>cas2c</italic> (ORF id: <italic>LIC12917</italic>) gene of size 273 bp in <italic>L. interrogans</italic> serovar Copenhageni is illustrated (<xref ref-type="fig" rid="F1">Figure 1A</xref>). In a similar <italic>in silico</italic> approach of genome analysis in <italic>L. interrogans</italic> serovar Lai, another well-studied pathogenic spirochete, the <italic>cas2c</italic> (ORF id: <italic>LA0683</italic>) gene in the CRISPR-Cas I-C locus (nucleotides coordinate 686432&#x2013;693873), appeared to be absent (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). Conversely, the NCBI genome database of <italic>Leptospira</italic> predicts the <italic>cas2c</italic> (<italic>LA0683</italic>) gene to be of 272&#xa0;bp size with a natural deletion of one nucleotide (adenine<sup>108</sup>) that may result in partial ORF translation. Thus, Xiao and co-workers (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>) reported that in <italic>L. interrogans</italic> serovars Lai genome, <italic>cas2c</italic> (<italic>LA0683</italic>) might encode for truncated (58 amino acids and frameshift after 35th residue) and inactive LinCas2C_Lai (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). We, thus, recount the <italic>LA0683</italic> partial reading frame (177 of 272&#xa0;bp) in the CRISPR I-C locus of <italic>L. interrogans</italic> serovar Lai (<xref ref-type="fig" rid="F1">Figure 1B</xref>), and all other <italic>cas</italic> genes are described with the CRISPR-Cas I-C locus of serovar Copenhageni (<xref ref-type="fig" rid="F1">Figure 1A</xref>). Unlike the CRISPR-Cas I-B locus, the CRISPR-Cas I-C in <italic>Leptospira</italic> lacks the CRISPR array essential for imparting RNA-mediated interference of foreign nucleic acids. CRISPR-Cas I-C locus in the absence of array, the role of Cas2 in CRISPR biology is questionable. Phylogenetic analysis was performed to explore the evolutionary relationship of LinCas2C with the selected Cas2 homologs (<xref ref-type="fig" rid="F1">Figure 1C</xref>). The lineage of LinCas2B appears to be closer to SsoCas2, whereas LinCas2C is closer to SpyCas2. However, the lineage of LinCas2C among Cas2 of <italic>Leptospira</italic> was closely related to LinCas2C_Linhai and LinCas2C_Lai (<xref ref-type="fig" rid="F1">Figure 1C</xref>). LinCas2B is a well-studied Cas protein of <italic>Leptospira</italic> from our group (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>); however, LinCas2B and LinCas2C proteins formed separate clades in the phylogenetic tree analysis. The phylogenetic study encouraged us to address if the LinCas2C nuclease property is different from LinCas2B. To date, independent research groups have ascertained the <italic>L. interrogans cas2</italic> genes of CRISPR-Cas to be transcriptionally active in different serovars<italic>,</italic> and the characterization of LinCas2B and LinCas2C_Linhai demonstrated to have metal-dependent DNase activity (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>; <xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). Nevertheless, there is a gap in understanding the role of LinCas2C found at subtype I-C, which lacks the essential array element.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>CRISPR-Cas I-C locus of <italic>L. interrogans</italic> and molecular phylogeny of Cas2C orthologs. <bold>(A,B)</bold> Schematic representation of the architecture of CRISPR-Cas I-C of serovar Copenhageni and Lai, respectively. <bold>(C)</bold> Phylogenetic analysis of Cas2 orthologs generated by way of the maximum likelihood algorithm. <italic>L. interrogans</italic> serovar Copenhageni Cas2B and Cas2C are represented as LinCas2B and LinCas2C, respectively. Similarly, in parenthesis, Cas2C of <italic>L. interrogans</italic> serovar Lai (LinCas2C_Lai), <italic>Streptococcus pyogenes</italic> (SpyCas2; Q99YS8), <italic>Xanthomonas albilineans</italic> (XalCas2; D2UG58), <italic>Bacillus halodurans</italic> (Bha_Cas2; Q9KFX8), <italic>Desulfovibrio vulgaris</italic> (DvuCas2; Q72WF4) and <italic>Sulfolobus solfataricus</italic> (SsoCas2; Q97YC2) are shown. &#x2b; and &#x2212; sign represents sense and anti-sense strands, respectively.</p>
</caption>
<graphic xlink:href="fmolb-09-988569-g001.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>Recombinant LinCas2C and LinCas2C_Lai nuclease activity on double-stranded DNA</title>
<p>The Cas proteins are known to possess nuclease activity. During the adaptation phase of the CRISPR-Cas immunity, the Cas1-Cas2 heterohexameric complex executes its nuclease activity for new spacer integration (<xref ref-type="bibr" rid="B29">Lee et al., 2019</xref>). The genes <italic>cas2c</italic> (<italic>LIC12917</italic>; 273&#xa0;bp) and <italic>cas2c_</italic>Lai (<italic>LA0683</italic>; 272&#xa0;bp) were cloned, and the recombinant proteins (rLinCas2C and rLinCas2C_Lai) were purified using Ni-NTA affinity chromatography to investigate the nuclease activity (<xref ref-type="sec" rid="s10">Supplementary Figure S1A</xref>). Due to natural frameshift mutation, rLinCas2C_Lai was purified in a truncated form (8.6&#xa0;kDa), lacking the C-terminal region essential for dimer formation. To our dismay, rLinCas2C_Lai, in addition to a monomeric state (12&#xa0;kDa), could self-assemble to a trimeric (34&#xa0;kDa) state instead of a dimer during size-exclusion chromatography (SEC) (<xref ref-type="sec" rid="s10">Supplementary Figure S1B</xref>). On the other hand, rLinCas2C (LIC12917) self-assembled in the dimeric (28&#xa0;kDa) and monomeric (15&#xa0;kDa) state when resolved through SEC (<xref ref-type="sec" rid="s10">Supplementary Figure S1B</xref>). Specific polyclonal antibodies raised in mice against rLinCas2C and rLinCas2B did not cross-react with each other (<xref ref-type="sec" rid="s10">Supplementary Figure S1C</xref>). Nevertheless, anti-LinCas2C could cross-react with rLinCas2C_Lai (<xref ref-type="sec" rid="s10">Supplementary Figure S1D</xref>) and agrees with the phylogenetic study shown in <xref ref-type="fig" rid="F1">Figure 1C</xref>. In addition, the monomeric and dimeric LinCas2C native expression could also be detected in <italic>L. interrogans</italic> serovar Copenhageni lysate (<xref ref-type="sec" rid="s10">Supplementary Figure S1E</xref>).</p>
<p>The purified Cas2C (rLinCas2C and rLinCas2C_Lai) was used to investigate the nuclease activity on various DNA substrates. We excluded LinCas2B_Lai in our analysis as it had 100% sequence similarity to the well-studied LinCas2B. In a nuclease assay, increasing concentrations (5&#x2013;25&#xa0;&#xb5;M) of each rLinCas2C and rLinCas2C_Lai were taken to optimize the cleavage of the dsDNA (plasmid DNA; 0.5&#xa0;&#xb5;g). Around 25&#xa0;&#xb5;M of each rLinCas2C and rLinCas2C_Lai could completely cleave the DNA (0.5&#xa0;&#xb5;g) in an hour at 37&#xb0;C (<xref ref-type="fig" rid="F2">Figure 2A</xref>; <xref ref-type="sec" rid="s10">Supplementary Figure S2A</xref>). The DNA cleavage assay of rLinCas2C and rLinCas2C_Lai on circular dsDNA suggested that both Cas2C are endodeoxyribonucleases.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>DNase activity of rLinCas2C on plasmid DNA. DNase activity reaction was carried out at 37&#xb0;C for an hour. <bold>(A)</bold> Concentration-dependent DNase activity of recombinant rLinCas2C on plasmid-1 substrate (5.3&#xa0;kb pET28a vector, 0.5&#xa0;&#xb5;g) in the presence of Mg<sup>2&#x2b;</sup> ion. Complete cleavage of the substrate was observed using rLinCas2C at 25&#xa0;&#xb5;M. <bold>(B)</bold> DNase activity of rLinCas2C in the presence of different divalent metal ions on plasmid exemplifies its optimum activity in Mg<sup>2&#x2b;</sup> and Mn<sup>2&#x2b;</sup> ions. <bold>(C)</bold> The substrate specificity of rLinCas2C on two different plasmid substrates. Substrate plasmid-1 and plasmid-2 (3.5&#xa0;kb pTZ57&#xa0;R/T vector, 0.5&#xa0;&#xb5;g) were employed for DNase activity. <bold>(D)</bold> DNase activity of rLinCas2C at different pH. The optimum activity was observed at pH 8.0 and 9.0. DNA ladder: 2 log DNA ladder (NEB). rLinCas2C: 25&#xa0;&#x3bc;M, EDTA: 2.5&#xa0;mM, Mg<sup>2&#x2b;</sup> and others divalent metal: 2.5&#xa0;mM. Reaction products were analyzed on 2% agarose gel.</p>
</caption>
<graphic xlink:href="fmolb-09-988569-g002.tif"/>
</fig>
<p>For studying metallonucleases, substituting metal ions is a common practice to understand their role in the nuclease activity (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>; <xref ref-type="bibr" rid="B10">Dixit et al., 2021</xref>). Mg<sup>2&#x2b;</sup> ion was substituted with other divalent metal ions (Mn<sup>2&#x2b;</sup>, Ca<sup>2&#x2b;</sup>, Ni<sup>2&#x2b;</sup>, Fe<sup>2&#x2b;</sup>, Cu<sup>2&#x2b;</sup>, and Zn<sup>2&#x2b;</sup>) to explore their (rLinCas2C and rLinCas2C_Lai) preference for DNase activity. Both the nucleases (rLinCas2C and rLinCas2C_Lai) displayed higher affinity towards Mg<sup>2&#x2b;</sup>, followed by Mn<sup>2&#x2b;</sup> and Fe<sup>2&#x2b;</sup> as a cofactor for its DNase activity. LinCas2B and BhaCas2 also preferred Mg<sup>2&#x2b;</sup> over other metal ions (<xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>) and are in coalition with the cofactor preference assay. In the presence of Ca<sup>2&#x2b;</sup>, Cu<sup>2&#x2b;</sup>, and Zn<sup>2&#x2b;</sup>, a curtailed or no DNA cleavage activity was exhibited by both the nucleases (<xref ref-type="fig" rid="F2">Figure 2B</xref>; <xref ref-type="sec" rid="s10">Supplementary Figure S2B</xref>). A shift in DNA mobility was also detected during agarose gel electrophoresis in the presence of cofactors Ca<sup>2&#x2b;</sup>, Cu<sup>2&#x2b;</sup>, and Zn<sup>2&#x2b;</sup>. Such a shift could be due to the retainment of the DNA binding property of LinCas2C.</p>
<p>The nuclease assays of LinCas2B and LinCas2C_Linhai under <italic>in vitro</italic> conditions demonstrated that Cas2 proteins exhibit divalent metal and pH-dependent nuclease activities, where the substrate preferences fluctuated incredibly (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>; <xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). Thus, it was intriguing to address whether the rLinCas2C nuclease activity is dependent on the nucleotide sequence. The DNase activity of rLinCas2C was conducted on two substrates (circular dsDNA plasmid). Both the nucleases (rLinCas2C and rLinCas2C_Lai) exhibited DNase activity non-specifically similar to that of LinCas2B (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>) and LinCas2C_Linhai (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). The divalent metal ions were prerequisites for DNase activity in rLinCas2C, as the addition of EDTA completely abolished the plasmid degradation (<xref ref-type="fig" rid="F2">Figure 2C</xref>). The rLinCas2C exhibited optimum DNase activity in the pH range of 7.0 and 9.0. Nuclease activity gets reduced at pH 10.0 to 11.0 and exhibits a moderate affinity for DNA (<xref ref-type="fig" rid="F2">Figure 2D</xref>). The pH-dependent DNase activity of rLinCas2C agreed with that of LinCas2B (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>). Similarly, it is proposed that at the optimum pH, Cas2 (BhaCas2) attains a metal-bound catalytically active conformation (<xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>).</p>
</sec>
<sec id="s3-3">
<title>Recombinant LinCas2C nuclease activity on single-stranded DNA and RNA</title>
<p>Since rLinCas2C degraded dsDNA, it was intriguing to evaluate its activity on ssDNA and ssRNA. In a previous study, LinCas2B and BhaCas2 were inert toward short DNA oligos (28&#x2013;32-mer) (<xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>). In agreement, rLinCas2C could not cleave short DNA oligos (23- and 50-mer) in the presence of a cofactor (<xref ref-type="fig" rid="F3">Figure 3A</xref>). The DNase activity of rLinCas2C and rLinCas2C_Lai on the viral ssDNA (linear M13mp18 and circular &#x424;x174) demonstrated cleavage in the presence of divalent metal ion (<xref ref-type="fig" rid="F3">Figures 3B,C</xref>; <xref ref-type="sec" rid="s10">Supplementary Figures S2C,D</xref>). On the same line, LinCas2B, in the presence of a cofactor, also cleaves viral ssDNA (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>). In addition, rLinCas2C and rLinCas2C_Lai exhibited cleavage of mRNA transcripts of <italic>luciferase</italic> gene independent of divalent metal ions (<xref ref-type="fig" rid="F3">Figure 3D</xref>; <xref ref-type="sec" rid="s10">Supplementary Figure S2E</xref>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Nuclease activity of rLinCas2C on single-stranded DNA and RNA. The nuclease activity reaction was carried out at 37&#xb0;C for an hour. <bold>(A)</bold> DNase activity of rLinCas2C on synthesized single-stranded linear DNA (oligo-1: 23-mer 0.4 &#xb5;M, oligo-2: 50-mer 0.4&#xa0;&#xb5;M). The nuclease reaction product was analyzed on 8&#xa0;M 15% urea-PAGE. <bold>(B)</bold> DNase activity of rLinCas2C on linear single-stranded DNA (0.5&#xa0;&#xb5;g of 6.4&#xa0;kb M13mp18). Complete degradation of linear single-stranded was observed in the presence of Mg<sup>2&#x2b;</sup> ions. <bold>(C)</bold> DNase activity of rLinCas2C on circular single-stranded DNA (3.6&#xa0;kb &#x3d5;x174, 0.5&#xa0;&#xb5;g). Complete degradation of circular single-stranded was observed in the presence of Mg<sup>2&#x2b;</sup> ion. <bold>(D)</bold> RNase activity of rLinCas2C on <italic>luciferase</italic> mRNA (0.5&#xa0;&#xb5;g). Complete degradation of RNA was observed even in the absence of Mg<sup>2&#x2b;</sup> ions. DNA ladder: 2 log DNA ladder (NEB). rLinCas2C: 25&#xa0;&#xb5;M and Mg<sup>2&#x2b;</sup>: 2.5&#xa0;mM. The nuclease reaction products shown in Figure <bold>(B)</bold>, <bold>(C)</bold>, and <bold>(D)</bold> were analyzed on 2% agarose gel.</p>
</caption>
<graphic xlink:href="fmolb-09-988569-g003.tif"/>
</fig>
</sec>
<sec id="s3-4">
<title>Overall structure of rLinCas2C</title>
<p>The crystal structure of rLinCas2C encloses the signature N-terminal ferredoxin domain (&#x3b2;&#x3b1;&#x3b2;&#x3b2;&#x3b1;&#x3b2;). LinCas2C crystal is composed of a total of three &#x3b1;-helices (&#x3b1;1&#x2013;&#x3b1;3) and five anti-parallel &#x3b2;-strands (&#x3b2;1&#x2013;&#x3b2;5) (<xref ref-type="fig" rid="F4">Figure 4A</xref>), as described before for the Cas2 orthologs enlisted in <xref ref-type="table" rid="T2">Table 2</xref>. The solvent-accessible surface area and Gibbs free energy of monomeric rLinCas2C were 6,563.8 &#xc5;<sup>2</sup> and &#x2212;73&#xa0;kcal/mol, respectively. There are two loops named loop L1 and L2 connecting &#x3b2;1-&#x3b1;1 and &#x3b1;2-&#x3b2;4, respectively, found in all Cas2 orthologs. Loops L1 and L2 are speculated to recognize DNA and RNA substrates, respectively (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). Structure superimposition of rLinCas2C over modeled rLinCas2B shows a shorter DNA binding loop L1 while the RNA binding loop L2 was comparable in size (<xref ref-type="fig" rid="F4">Figure 4B</xref>). The modeled three-dimensional structure of rLinCas2C_Lai disclosed the presence of two &#x3b1;-helices (&#x3b1;1 and &#x3b1;2) and three anti-parallel &#x3b2;-strands (&#x3b2;1-&#x3b2;3). However, &#x3b2;4 and &#x3b2;5, the two &#x3b2;-strands at the C-terminus, are missing compared to rLinCas2C (<xref ref-type="sec" rid="s10">Supplementary Figure S3A</xref>). Structural superimposition of rLinCas2C_Lai revealed identical DNA binding loop L1 as rLinCas2; however, shorter loop L2. In LinCas2C_Lai, amino acid residues imparting loop L2 were intact even after frameshift mutation (<xref ref-type="fig" rid="F4">Figure 4C</xref>). Intact loop L2 may be the possible reason behind rLinCas2C_Lai displaying activity despite expressing the truncated protein.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>The crystal structure of rLinCas2C and its correlation with various orthologs. <bold>(A)</bold> The crystal structure of rLinCas2C is represented as a cartoon model. All the secondary structural elements, along with the N-and C-termini, are labeled for clarity. In order to map the putative substrate-binding loop (L1 and L2), rLinCas2C structure was correlated with LinCas2B; rmsd: 0.8 <bold>(B)</bold> LinCas2C_Lai; rmsd: 1.0 <bold>(C)</bold>, SpyCas2; rmsd: 0.9, BhaCas2; rmsd: 0.6, DvuCas2; rmsd: 0.9, SsoCas2; rmsd: 1.4&#xa0;&#xc5; <bold>(D)</bold>. <bold>(E)</bold> Multiple sequence alignment of LinCas2C with its orthologs. Two putative substrate-binding loops, L1 (DNA) and L2 (RNA), and secondary structure elements, are labeled. The secondary structural elements on top of the alignment are given according to the rLinCas2C. XalCas2: <italic>Xanthomonas albilineans</italic> (D2UG58), BhaCas2: <italic>Bacillus halodurans</italic> (Q9KFX8), DvuCas2: <italic>Desulfovibrio vulgaris</italic> (Q72WF4), SsoCas2: <italic>Sulfolobus solfataricus</italic> (Q97YC2), SpyCas2: <italic>Streptococcus pyogenes</italic> (Q99YS8). Loop L1 and L2 are marked with rectangles. Red triangles highlight conserved residues.</p>
</caption>
<graphic xlink:href="fmolb-09-988569-g004.tif"/>
</fig>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>List of structural homologs of LinCas2C.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Protein</th>
<th align="left">PDB id</th>
<th align="left">Rmsd (&#xc5;)</th>
<th align="left">Z-Score</th>
<th align="left">References</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">BhaCas2</td>
<td align="left">4ES1</td>
<td align="left">1.2</td>
<td align="left">14.7</td>
<td align="left">
<xref ref-type="bibr" rid="B37">Nam et al. (2012)</xref>
</td>
</tr>
<tr>
<td align="left">SpyCas2</td>
<td align="left">4QR2</td>
<td align="left">1.2</td>
<td align="left">14.6</td>
<td align="left">
<xref ref-type="bibr" rid="B25">Ka et al. (2014)</xref>
</td>
</tr>
<tr>
<td align="left">XalCas2</td>
<td align="left">5H1O</td>
<td align="left">1.6</td>
<td align="left">13.8</td>
<td align="left">
<xref ref-type="bibr" rid="B24">Ka et al. (2017)</xref>
</td>
</tr>
<tr>
<td align="left">TthCas2</td>
<td align="left">1ZPW</td>
<td align="left">1.8</td>
<td align="left">11.3</td>
<td align="left">
<xref ref-type="bibr" rid="B42">Seto et al. (2003)</xref>
</td>
</tr>
<tr>
<td align="left">DvuCas2</td>
<td align="left">3OQ2</td>
<td align="left">1.9</td>
<td align="left">13.4</td>
<td align="left">
<xref ref-type="bibr" rid="B41">Samai et al. (2010)</xref>
</td>
</tr>
<tr>
<td align="left">PhoCas2</td>
<td align="left">6K2E</td>
<td align="left">1.9</td>
<td align="left">10.4</td>
<td align="left">&#x2014;</td>
</tr>
<tr>
<td align="left">TonCas2</td>
<td align="left">5G4D</td>
<td align="left">1.9</td>
<td align="left">10.5</td>
<td align="left">
<xref ref-type="bibr" rid="B23">Jung et al. (2016)</xref>
</td>
</tr>
<tr>
<td align="left">PfuCas2</td>
<td align="left">2I0X</td>
<td align="left">2.0</td>
<td align="left">10.1</td>
<td align="left">&#x2014;</td>
</tr>
<tr>
<td align="left">TdeCas2</td>
<td align="left">6JHZ</td>
<td align="left">2.2</td>
<td align="left">9.3</td>
<td align="left">&#x2014;</td>
</tr>
<tr>
<td align="left">SpyCas2</td>
<td align="left">5ZYF</td>
<td align="left">2.5</td>
<td align="left">9.7</td>
<td align="left">
<xref ref-type="bibr" rid="B25">Ka et al. (2014)</xref>
</td>
</tr>
<tr>
<td align="left">SsoCas2</td>
<td align="left">2I8E</td>
<td align="left">2.5</td>
<td align="left">9.3</td>
<td align="left">
<xref ref-type="bibr" rid="B2">Beloglazova et al. (2008)</xref>
</td>
</tr>
<tr>
<td align="left">EcoCas2</td>
<td align="left">4MAK</td>
<td align="left">&#x2014;</td>
<td align="left">&#x2014;</td>
<td align="left">&#x2014;</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Structural homology search based on scores of selected parameters [Z-score and root mean square deviation (rmsd)] of the web server DALI (<xref ref-type="bibr" rid="B21">Holm, 2020</xref>) revealed the closest homologs of rLinCas2C to be BhaCas2 and SpyCas2 (<xref ref-type="table" rid="T2">Table 2</xref>).</p>
<p>In addition, the crystal structure of rLinCas2C was superimposed with the structures of Cas2 orthologs (SpyCas2C, BhaCas2C, DvuCas2C, and SsoCas2) and the putative DNA (L1) and RNA (L2) binding loops were compared (<xref ref-type="fig" rid="F4">Figure 4D</xref>). SpyCas2C, BhaCas2C, and DvuCas2 have identical L1 and L2 loop sizes to rLinCas2C. Similarly, to understand the rLinCas2C_Lai divergence in the putative substrate-binding loop, its modeled structure (<xref ref-type="sec" rid="s10">Supplementary Figure S3A</xref> and PDB file in supplementary information) was superimposed with the SpyCas2, BhaCas2, DvuCas2, and SsoCas2 (<xref ref-type="sec" rid="s10">Supplementary Figures S3B&#x2013;S3E</xref>). The putative loop L1 size of rLinCas2C_Lai aligns with SpyCas2, BhaCas2, and DvuCas2 but not SsoCas2. However, the putative loop L2 of rLinCas2C_Lai was smaller than its orthologs (<xref ref-type="sec" rid="s10">Supplementary Figures S3B&#x2013;S3E</xref>).</p>
<p>A multiple sequence alignment of LinCas2C with its orthologs also displays the variation in the residues responsible for constituting the loop L1 and L2 (<xref ref-type="fig" rid="F4">Figure 4E</xref>). Notably, LinCas2C_Lai shares a 31% amino acids sequence dissimilarity with LinCas2C, where few conserved residues (His8, Pro37, Phe38, Leu39, Trp44, Asn54, and Lys57) differ from their corresponding residues in LinCas2C (Asp8, Ser37, Val38, Phe39, Leu44, Asp64, and Arg67) (<xref ref-type="fig" rid="F4">Figure 4E</xref>). Another Cas2C paralog of serovar Linhai (LinCas2C_Linhai) shared 3% dissimilarity at N- (Pro32 and His47) and C-terminal region (Ile78, Glu91, Glu92, Pro93, Ile94, Ile95, and Leu96) to LinCas2C (<xref ref-type="fig" rid="F4">Figure 4E</xref>).</p>
<p>The asymmetric unit of the rLinCas2C crystal contains two protein subunits forming a dimer (<xref ref-type="fig" rid="F5">Figure 5A</xref>) and agrees with the crystal structure of Cas2 orthologs enlisted in <xref ref-type="table" rid="T2">Table 2</xref> (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>; <xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>). In SpyCas2, upon dimerization, the surface area buried is 2,793&#x2013;2,856&#xa0;&#xc5;<sup>2</sup>, forming 29&#x2013;32 hydrogen bonds between the two protomers (<xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>). In rLinCas2C, upon dimerization, the buried surface area is 3,430&#xa0;&#xc5;<sup>2</sup>, identified by PDBePISA (<xref ref-type="bibr" rid="B46">Velankar et al., 2010</xref>). A total of 33 hydrogen bonds were formed between two protomers of LinCas2C as analyzed by Coot (<xref ref-type="bibr" rid="B12">Emsley et al., 2010</xref>). The rLinCas2C dimer demonstrates that the &#x3b2;5 strand (6 residues) of one protomer bridges with the &#x3b2;4 strand (5 residues) of another protomer by 8&#x2013;10 hydrogen bonds and several other residues enlisted in <xref ref-type="sec" rid="s10">Supplementary Table S2</xref>. Interestingly, residues (Asp8, Asp64, Leu66, and Arg67) present at the dimeric interface were conserved among LinCas2C orthologs. To our dismay, rLinCas2C_Lai lacks &#x3b2;4 and &#x3b2;5 strands but still exhibits a trimeric state in solution. The trimeric structure of modeled LinCas2C_Lai was predicted by generating symmetry mate. Trimeric structural analysis revealed that Arg6, His8, Gln35, and Asn36 of one protomer interact with Gln35, Asn36, Arg6, and His8 of the second protomer. Arg17 of the third protomer interacts with Ser29 of the second protomer at a distance of &#x2264;3.5&#xa0;&#xc5; (data not shown). Such interaction may be the probable reason for the self-assembly of rLinCas2C_Lai as a trimer in solution.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>The dimeric interface of rLinCas2C crystal structure. <bold>(A)</bold> The crystal structure shows a dimeric form of rLinCas2C, where one protomer is shown in cyan and another in magenta. <bold>(B)</bold> Putative DNA binding region on LinCas2C. LinCas2C bound DNA model using the template of <italic>Synechocystis</italic> Cas1-Cas2/prespacer binary complex structure (PDB id:7CR6). <bold>(C)</bold> Surface electrostatic potential map of LinCas2C. The positive and negative charges are blue and red, respectively (scale -1&#xa0;kcal/mol to &#x2b;1&#xa0;kcal/mol for red and blue, respectively). <bold>(D)</bold> Comparison of the distance between Asp residues side chain of the two protomers of LinCas2C and SsoCas2 and <bold>(E)</bold> LinCas2C-HpyVapD.</p>
</caption>
<graphic xlink:href="fmolb-09-988569-g005.tif"/>
</fig>
<p>The rLinCas2C DNA-nuclease interface demonstrates that the two complementary strands of DNA are cleaved by each protomer of the nuclease (<xref ref-type="fig" rid="F5">Figure 5B</xref>). The rLinCas2C residues interacting with DNA are primarily from the loop L1, L2, and &#x3b1;1 regions (<xref ref-type="sec" rid="s10">Supplementary Table S3</xref>). In agreement, the heterocomplex Cas1-Cas2-dsDNA structure of <italic>E. coli</italic> showed that the residues constituting the L1 loop of Cas2 interact with dsDNA (<xref ref-type="bibr" rid="B38">Nunez et al., 2015</xref>). The mapping of the surface electrostatic potential of rLinCas2C demonstrated the presence of a positive charge at the putative nucleic acid substrate-binding loop (L1 and L2) and the &#x3b1;1 region (<xref ref-type="fig" rid="F5">Figure 5C</xref>). The mapped amino acid residues of rLinCas2C interacting with dsDNA are shown in <xref ref-type="sec" rid="s10">Supplementary Table S3</xref> and the PDB file of the supplementary information. Among all the enlisted interacting residues, seven amino acid residues were positively charged (Agr17, Arg21, Arg33, Lys36, and Lys62). The crystal structure of rLinCas2C (dimeric form) indicates it is in a catalytically inactive conformational state as the distance between the conserved Asp8 residue of each protomer is 11.0&#xa0;&#xc5; (as opposed to 6.5&#xa0;&#xc5; in SsoCas2) (<xref ref-type="fig" rid="F5">Figure 5D</xref>). The distance of 11.0&#xa0;&#xc5; seems too far to coordinate a single Mg<sup>2&#x2b;</sup> ion of the protein. Similarly, the protomers of SpyCas2 (11.4&#xa0;&#xc5;), BhaCas2 (10.6&#xa0;&#xc5;), DvuCas2 (15.4&#xa0;&#xc5;), and HpyVapD (12.6&#xa0;&#xc5;) measured uneven distance between the conserved equivalent aspartate residue (<xref ref-type="fig" rid="F5">Figure 5E</xref>) (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>; <xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>; <xref ref-type="bibr" rid="B5">Bertelsen et al., 2021</xref>).</p>
</sec>
<sec id="s3-5">
<title>Recombinant LinCas2C mutants and their activity</title>
<p>A multiple sequence alignment of LinCas2C with its orthologs illustrated similarity with SpyCas2 (sequence similarity: 45% and query coverage: 98%), XabCas2 of <italic>Xanthomonas albilineans</italic> (41 and 100%), BhaCas2 (39 and 100%), DvuCas2 (37 and 100%), and SsoCas2 (30 and 68%). Several conserved residues (Tyr7, Asp8, Ala24, Arg33, Gln35, Leu55, and Leu71) and motifs (RVQ and SVF) in LinCas2C were identified (<xref ref-type="fig" rid="F4">Figure 4E</xref>). We have shown previously that mutation of Asp10 of LinCas2B abolished its DNase activity but not its RNase activity (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>). Thus, in this study, an additional site-directed mutation was performed in rLinCas2C at one or more sites predisposed to nuclease activity, and the purified recombinant protein was obtained for its characterization (<xref ref-type="sec" rid="s10">Supplementary Figure S1A</xref>). A model of LinCas2C with metal-ion was proposed to map the metal-ion binding residues. Tyr7 and Asp8 were found to be putative metal-binding residues of LinCas2C (<xref ref-type="fig" rid="F6">Figure 6A</xref>). Two other residues, Arg33 and Phe39, were found close to metal-binding residues and putative active site groove [purposed by Yakunin and co-workers (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>)], and were also found to be conserved among Cas2 homologs (<xref ref-type="fig" rid="F6">Figure 6A</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Nuclease activity of rLinCas2C mutant variants. <bold>(A)</bold> Proposed model of LinCas2C with Mg<sup>2&#x2b;</sup> ion. LinCas2C amino acid residues interacting with metal ions and putative active site are shown in stick form. <bold>(B)</bold> Nuclease activity of rLinCas2C mutants was evaluated on plasmid-1 (0.5&#xa0;&#xb5;g) in the presence and absence of divalent metal ion. <bold>(C)</bold> RNase activity of rLinCas2C or its mutant variants and the rLinCas2C_Lai was quantified using fluorescently labeled RNA substrate. A fluorescent RNA substrate (10&#xa0;pmol) was incubated with rLinCas2C or its mutants (25&#xa0;&#xb5;M) at 37&#xb0;C, and fluorescence was recorded at 5&#xa0;min intervals for 1&#xa0;hour.</p>
</caption>
<graphic xlink:href="fmolb-09-988569-g006.tif"/>
</fig>
<p>In SsoCas2, the residues Arg31 and Phe37 have been essential for nuclease activity (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). The rLinCas2C with single (rLinCas2C<sup>Y7A</sup>) and double (rLinCas2C<sup>Y7A&#x2b;D8A</sup>) mutants demonstrated reduced DNase activity, where a change in conformation of plasmid was evident due to a partial nick in DNA (<xref ref-type="fig" rid="F6">Figure 6B</xref>). In agreement, in this study, additional mutation of Arg33 and Phe39 in rLinCasC (rLinCas2C<sup>R33A&#x2b;F39A</sup> and rLinCas2C<sup>Y7A&#x2b;D8A&#x2b;R33A&#x2b;F39A</sup>) exhibited complete abolition in DNase activity (<xref ref-type="fig" rid="F6">Figure 6B</xref>). Yakunin and co-workers speculated that DNA and RNA substrate might interact with Cas2 loop L1 (&#x3b2;1-&#x3b1;1) and L2 (&#x3b1;2-&#x3b2;4), respectively (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). Hence, to explore the role of loop L2 in rLinCas2, a mutant construct with L2 deletion (rLinCas2C<sup>&#x394;L2</sup>) was generated to analyze the DNase and RNase activity. To our surprise, rLinCas2C<sup>&#x394;L2</sup> displayed a complete loss of DNase activity (<xref ref-type="fig" rid="F6">Figure 6B</xref>). The DNase assay with rLinCas2C<sup>&#x394;L2</sup> conflicted with an earlier report (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>), where the loop L1 was speculated for DNA substrate recognition. To address this inconsistency, rLinCas2C was docked with random DNA. Analysis of the docking study of DNA-LinCas2C suggests that DNA may interact with rLinCas2C at multiple sites, including the residues (Asp60, Lys62, Thr63, and Asp64) that constitute the loop L2 (<xref ref-type="sec" rid="s10">Supplementary Figure S4</xref>).</p>
<p>A kit-based fluorogenic RNA substrate was employed to quantify the RNase activity of rLinCas2C and compare it with its mutants or rLinCas2C_Lai. The mutant rLinCas2C<sup>Y7A</sup> exhibited RNase activity very similar to rLinCas2C, while the activity of other mutants (rLinCas2C<sup>Y7A&#x2b;D8A</sup>, rLinCas2C<sup>R33A&#x2b;F39A</sup>, rLinCas2C<sup>Y7A&#x2b;D8A&#x2b;R33A&#x2b;F39A</sup>, and rLinCas2C<sup>&#x394;L2</sup>) reduced moderately (<xref ref-type="fig" rid="F6">Figure 6C</xref>). The RNase activity of mutant rLinCas2C<sup>Y7A&#x2b;D8A&#x2b;R33A&#x2b;F39A</sup> was affected most adversely; however, none of the mutants demonstrated complete abolition in RNase activity. The RNase activity of rLinCas2C and rLinCas2C_Lai were abolished after heat denaturation, indicating protein is free of RNase contaminant, and activity is dependent on Cas2C protein conformation. The RNase assay suggested that in LinCas2, the residues involved in RNA cleavage differ from the DNA and thus has metal independency.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>The recombinant LinCas2C and the naturally truncated LinCas2C_Lai demonstrated nuclease activity on diverse DNA substrates (circular dsDNA, linear, and circular ssDNA) in a divalent metal- and pH-dependent manner. However, these nucleases were inert toward small DNA oligos (23-50-mer). In a recent study, LinCas2C_Linhai, a Cas2C ortholog, prefers Mg<sup>2&#x2b;</sup> for nuclease activity (<xref ref-type="bibr" rid="B51">Xiao et al., 2019</xref>). In contrast, LpnCas2 of <italic>Legionella pneumophila</italic> and TthCas2 of <italic>Thermus thermophilus</italic> could demonstrate nuclease activity in the presence of Mn<sup>2&#x2b;</sup> (<xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B20">Gunderson et al., 2015</xref>). The DNase activity of rLinCas2C and rLinCas2C_Lai was consistent with the other reported Cas2 proteins, including BhaCas2 (<xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>), SpyCas2 (<xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>), XorCas2 of <italic>Xanthomonas oryzae</italic> (<xref ref-type="bibr" rid="B31">Makarova et al., 2011</xref>), and LinCas2B (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>). Consistent with LinCas2B activity, rLinCas2C was inert towards single-stranded short oligos (<xref ref-type="bibr" rid="B11">Dixit et al., 2016</xref>). The recombinant Cas2 nucleases (LinCas2C and LinCas2C_Lai) of the two serovars of <italic>Leptospira</italic> is a divalent metal-independent RNase. In contrast, Sso8090Cas2 homologs from <italic>Sulfolobus solfataricus,</italic> TmaCas2 of <italic>Thermotoga maritima</italic>, MthCas2 of <italic>Methanobacterium thermoautotrophicum</italic>, AfuCas2 of <italic>Archaeoglobus fulgidus,</italic> LpnCas2 and NeuCas2 of <italic>Nitrosomonas europea</italic> exhibited metal-dependent RNase activity (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). Detection of nuclease activity in rLinCas2C_Lai suggests that the conserved residues at the N-terminal are more involved in nucleic acid catalysis. The metal-independent RNase activity of rLinCas2C indicates its additional role beyond CRISPR biology.</p>
<p>In a recent study, the virulence-associated protein D (VapD) of toxin-antitoxin systems (TA) was shown to possess a ribonuclease fold similar to Cas2 proteins (<xref ref-type="bibr" rid="B5">Bertelsen et al., 2021</xref>). The VapD toxins act as metal-independent nucleases that modulate gene expression by degrading specific, stable RNAs, including tRNA, rRNA, and mRNA (<xref ref-type="bibr" rid="B17">Goeders and Van Melderen, 2014</xref>). However, structurally VapD possesses a modified ferredoxin fold (&#x3b2;1&#x3b1;1&#x3b2;2-&#x3b2;3&#x3b1;2&#x3b2;4), where each of the two &#x3b1;-helices is split into two shorter helices connected by short loops, resulting in a &#x3b2;1&#x3b1;1&#x2032;&#x3b1;1&#x3b2;2&#x3b2;3&#x3b1;2&#x3b1;2&#x2032;&#x3b2;4 topology. In addition, various other VapD homologs (RelE, MazF, and VapC) of a toxin-antitoxin (TA) system function as RNases (<xref ref-type="bibr" rid="B27">Kwon et al., 2012</xref>; <xref ref-type="bibr" rid="B5">Bertelsen et al., 2021</xref>). Structural similarity of VapD with Cas2 fuelled the notion that the bacterial CRISPR-Cas immunity systems might have evolved from a primordial vapXD-type TA system (<xref ref-type="bibr" rid="B30">Makarova et al., 2012</xref>). Further, in this study, the naturally truncated rLinCas2C_Lai retains its nuclease activity like the full-length Cas2C nucleases (LinCas2C and LinCas2B). It is speculated that the RNase property of Cas2 orthologs may degrade exotic phage transcripts or inhibit translation by mRNA cleavage globally (<xref ref-type="bibr" rid="B5">Bertelsen et al., 2021</xref>). Cas2 proteins may utilize the intrinsic metal-independent ribonuclease activity encoded in the VapD-like fold to modulate bacterial cell growth and survival during infection (<xref ref-type="bibr" rid="B20">Gunderson et al., 2015</xref>).</p>
<p>The structural investigation of rLinCas2C demonstrates it to exist in a dimeric and apostate conformation with each subunit containing the signature ferredoxin fold. The rLinCas2C structure confirms the evolutionary conservation of the VapD/Cas2-like ribonuclease protein fold provided by <xref ref-type="bibr" rid="B5">Bertelsen et al. (2021)</xref>. In LinCas2C, dimeric interface &#x3b2;4 of one protomer interacts with &#x3b2;5 of another protomer, similar to SsoCas2, where the &#x3b2;-strand (&#x3b2;5) of each protomer interacts with the &#x3b2;-sheet of the other monomer creating a two-joint, five-strand, anti-parallel &#x3b2;-sheets (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). Also, in TonCas2, the C-terminal region of &#x3b2;5 from each protomer interacts with the &#x3b2;4 of the other molecule to form a &#x3b2;-sheet of five strands in both subunits (<xref ref-type="bibr" rid="B23">Jung et al., 2016</xref>). The structure of rLinCas2C describes the role of the catalytic aspartate in limiting conformational freedom. The distance between conserved aspartate residues of each protomer is crucial for coordinating metal-ion. In the case of LinCas2C, it was found to be 11.0&#xa0;&#xc5; seems too far to coordinate a single Mg<sup>2&#x2b;</sup> ion. For these aspartates to bind a bridging metal, the rLinCas2C would need to undergo either a major conformational change of the &#x3b2;1 or ferredoxin fold region or altogether the dimer orientation. Similarly, the uneven distance between the conserved equivalent aspartate residue was observed for the protomers of SpyCas2, BhaCas2, DvuCas2, and HpyVapD. (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>; <xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>; <xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>; <xref ref-type="bibr" rid="B5">Bertelsen et al., 2021</xref>). For DNA, since divalent cations are involved in catalysis, it is also possible that one metal ion is symmetrically bound in each site.</p>
<p>Metal-independent RNase activity of rLinCas2C functionally corroborates with that of HpyVapD of <italic>H. pylori</italic> (HP0315); however, dissimilar to that of SsoCas2. In SsoCas2, the coordination between Mg<sup>2&#x2b;</sup> and two Asp10 residues from two dimer subunits is mandated for initiating the phosphodiester cleavage. These two Asp10 residues from SsoCas2 dimer molecules can create coordination with an Mg<sup>2&#x2b;</sup> ion, as the distance between the side chains from the two residues is only 6.5&#xa0;&#xc5;. However, in the case of HpyVapD or rLinCas2C, the distance between the side chains of two instances of Asp7 or Asp8 are greater than 10&#xa0;&#xc5; indicating its inability to coordinate the metal ion. HpyVapD showed ribonuclease activity without metal ions (<xref ref-type="bibr" rid="B27">Kwon et al., 2012</xref>). At this point, the exact mechanism of these two aspartate residues as a nucleophile in the absence of metal is difficult to justify. However, considering the mutational studies of HpyVapD and its comparison with Cas2, two aspartate residues (Asp7 and Asp76) have been proposed as strong candidates for the catalytic site of VapD. In metal-independent nucleases, 2&#x2032;-OH of ribose makes an intramolecular nucleophilic attack on the adjacent 3&#x2032;-phosphate and breaks the RNA backbone (<xref ref-type="bibr" rid="B55">Yang, 2011</xref>). This mechanism is usually based on acid-base catalysis, where active-site acidic and basic residues are involved (<xref ref-type="bibr" rid="B55">Yang, 2011</xref>).</p>
<p>The closest homolog SpyCas2 is a metal- and pH-dependent dsDNase and shares standard functional features with the BhaCas2 (<xref ref-type="bibr" rid="B25">Ka et al., 2014</xref>). Mutagenesis of SsoCas2 (SSO1404) identified six residues (Tyr9, Asp10, Arg17, Arg19, Arg31, and Phe37) important for RNase activity and suggested that Asp10 might be the principal catalytic residue (<xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>). However, in DvuCas2, neither Tyr13 nor Phe45 was disposed of a catalytic role due to its buried location (<xref ref-type="bibr" rid="B41">Samai et al., 2010</xref>). Two or three conserved acidic residues are critical for catalysis in most known RNases. They involve coordinating one or two metal cations, which activate a nucleophilic water molecule to hydrolyze the phosphodiester bond or stabilize the transition state in cleavage reactions (<xref ref-type="bibr" rid="B50">Worrall and Luisi, 2007</xref>). In LinCas2C, alanine replacement mutation of conserved residues Tyr7, Asp8, Arg33, and Phe39 and loop L2 abolishes DNase activity, whereas moderate reduction of RNase activity was evident in selected mutants. The variation in the nuclease activity of the Cas2 family has been proposed to be due to the structural difference at its catalytic site (<xref ref-type="bibr" rid="B37">Nam et al., 2012</xref>). There is an exciting future building from the current work on deciphering shared protein structure-function relationships between bacterial defense systems. The global inhibition of translation by mRNA cleavage may be a fundamental principle in the biological role of Cas2 proteins as reported for TA systems, including RelBE, MazEF, PemIK, and ChpBIK (<xref ref-type="bibr" rid="B34">Masuda et al., 1993</xref>; <xref ref-type="bibr" rid="B16">Gerdes et al., 2005</xref>; <xref ref-type="bibr" rid="B59">Zhang et al., 2005</xref>; <xref ref-type="bibr" rid="B2">Beloglazova et al., 2008</xref>; <xref ref-type="bibr" rid="B58">Zhang and Inouye, 2009</xref>). To better understand the RNA catalysis mechanism of Cas2, a structure with RNA substrate-bound is needed. Such a structure would be highly valuable and provide insights into RNase activity.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/<xref ref-type="sec" rid="s10">Supplementary Material</xref>.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>MK conceived and supervised the study; VA performed experiments and analyzed data; HP, PG, and SK performed crystallization and analyzed its data; SK, VA, and MK wrote the manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>The present work was financially supported by the Department of Biotechnology, Government of India, bearing project number BT/PR25083/NER/95/1002/2017.</p>
</sec>
<ack>
<p>The authors gratefully acknowledge laboratory members Bhuvan Dixit and Aman Prakash for providing indirect help for the experiments and improvement of the manuscript. The authors also acknowledge the central instrument facility of the Indian Institute of Technology Guwahati for providing the in-house macromolecular crystallography facility.</p>
</ack>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmolb.2022.988569/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmolb.2022.988569/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table1.docx" id="SM1" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.pdf" id="SM2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet2.zip" id="SM3" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Battye</surname>
<given-names>T. G. G.</given-names>
</name>
<name>
<surname>Kontogiannis</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Johnson</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Powell</surname>
<given-names>H. R.</given-names>
</name>
<name>
<surname>Leslie</surname>
<given-names>A. G.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM</article-title>. <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>67</volume>, <fpage>271</fpage>&#x2013;<lpage>281</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444910048675</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Beloglazova</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Brown</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Zimmerman</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Proudfoot</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Makarova</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Kudritska</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>A novel family of sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats</article-title>. <source>J. Biol. Chem.</source> <volume>283</volume>, <fpage>20361</fpage>&#x2013;<lpage>20371</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M803225200</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berman</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Battistuz</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Bhat</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Bluhm</surname>
<given-names>W. F.</given-names>
</name>
<name>
<surname>Bourne</surname>
<given-names>P. E.</given-names>
</name>
<name>
<surname>Burkhardt</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2002</year>). <article-title>The protein data bank</article-title>. <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>58</volume>, <fpage>899</fpage>&#x2013;<lpage>907</lpage>. <pub-id pub-id-type="doi">10.1107/s0907444902003451</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berman</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Bhat</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Bourne</surname>
<given-names>P. E.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Gilliland</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Weissig</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2000</year>). <article-title>The Protein Data Bank and the challenge of structural genomics</article-title>. <source>Nat. Struct. Biol.</source> <volume>7</volume>, <fpage>957</fpage>&#x2013;<lpage>959</lpage>. <pub-id pub-id-type="doi">10.1038/80734</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bertelsen</surname>
<given-names>M. B.</given-names>
</name>
<name>
<surname>Senissar</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Bisiak</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Cunha</surname>
<given-names>M. V.</given-names>
</name>
<name>
<surname>Molinaro</surname>
<given-names>A. L.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Structural basis for toxin inhibition in the VapXD toxin-antitoxin system</article-title>. <source>Structure</source> <volume>29</volume>, <fpage>139</fpage>&#x2013;<lpage>150.e3</lpage>. <comment>e3</comment>. <pub-id pub-id-type="doi">10.1016/j.str.2020.10.002</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Biasini</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bienert</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Waterhouse</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Arnold</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Studer</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>SWISS-MODEL: Modelling protein tertiary and quaternary structure using evolutionary information</article-title>. <source>Nucleic Acids Res.</source> <volume>42</volume>, <fpage>W252</fpage>&#x2013;<lpage>W258</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku340</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Br&#xfc;nger</surname>
<given-names>A. T.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Free R value: A novel statistical quantity for assessing the accuracy of crystal structures</article-title>. <source>Nature</source> <volume>355</volume>, <fpage>472</fpage>&#x2013;<lpage>475</lpage>. <pub-id pub-id-type="doi">10.1038/355472a0</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>V. B.</given-names>
</name>
<name>
<surname>Arendall</surname>
<given-names>W. B.</given-names>
</name>
<name>
<surname>Headd</surname>
<given-names>J. J.</given-names>
</name>
<name>
<surname>Keedy</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Immormino</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Kapral</surname>
<given-names>G. J.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>MolProbity: All-atom structure validation for macromolecular crystallography</article-title>. <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>66</volume>, <fpage>12</fpage>&#x2013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444909042073</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Delano</surname>
<given-names>W. L.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Pymol: An open-source molecular graphics tool</article-title>. <source>CCP4 Newsl. protein Crystallogr.</source> <volume>40</volume>, <fpage>82</fpage>&#x2013;<lpage>92</lpage>. </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dixit</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Anand</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Hussain</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>The CRISPR-associated Cas4 protein from Leptospira interrogans demonstrate versatile nuclease activity</article-title>. <source>Curr. Res. Microb. Sci.</source> <volume>2</volume>, <fpage>100040</fpage>. <pub-id pub-id-type="doi">10.1016/j.crmicr.2021.100040</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dixit</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Ghosh</surname>
<given-names>K. K.</given-names>
</name>
<name>
<surname>Fernandes</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Gogoi</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Dual nuclease activity of a Cas2 protein in CRISPR&#x2013;Cas subtype I&#x2010;B of Leptospira interrogans</article-title>. <source>FEBS Lett.</source> <volume>590</volume>, <fpage>1002</fpage>&#x2013;<lpage>1016</lpage>. <pub-id pub-id-type="doi">10.1002/1873-3468.12124</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Emsley</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Lohkamp</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Scott</surname>
<given-names>W. G.</given-names>
</name>
<name>
<surname>Cowtan</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Features and development of Coot</article-title>. <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>66</volume>, <fpage>486</fpage>&#x2013;<lpage>501</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444910007493</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Evans</surname>
<given-names>P. R.</given-names>
</name>
<name>
<surname>Murshudov</surname>
<given-names>G. N.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>How good are my data and what is the resolution?</article-title> <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>69</volume>, <fpage>1204</fpage>&#x2013;<lpage>1214</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444913000061</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Faine</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>1974</year>). <article-title>The microbiological background to the Leptospira and leptospirosis</article-title>. <source>Pathology</source> <volume>6</volume>, <fpage>92</fpage>. <pub-id pub-id-type="doi">10.1016/s0031-3025(16)39091-2</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fouts</surname>
<given-names>D. E.</given-names>
</name>
<name>
<surname>Matthias</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Adhikarla</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Adler</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Amorim-Santos</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Berg</surname>
<given-names>D. E.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>What makes a bacterial species pathogenic?: Comparative genomic analysis of the genus Leptospira</article-title>. <source>PLoS Negl. Trop. Dis.</source> <volume>10</volume>, <fpage>e0004403</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pntd.0004403</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gerdes</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Christensen</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>L&#xf8;bner-Olesen</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Prokaryotic toxin&#x2013;antitoxin stress response loci</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>3</volume>, <fpage>371</fpage>&#x2013;<lpage>382</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro1147</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goeders</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Van Melderen</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Toxin-antitoxin systems as multilevel interaction systems</article-title>. <source>Toxins</source> <volume>6</volume>, <fpage>304</fpage>&#x2013;<lpage>324</lpage>. <pub-id pub-id-type="doi">10.3390/toxins6010304</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gouet</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Robert</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Courcelle</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>ESPript/ENDscript: Extracting and rendering sequence and 3D information from atomic structures of proteins</article-title>. <source>Nucleic Acids Res.</source> <volume>31</volume>, <fpage>3320</fpage>&#x2013;<lpage>3323</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkg556</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guglielmini</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bourhy</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Schiettekatte</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Zinini</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Brisse</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Picardeau</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Genus-wide Leptospira core genome multilocus sequence typing for strain taxonomy and global surveillance</article-title>. <source>PLoS Negl. Trop. Dis.</source> <volume>13</volume>, <fpage>e0007374</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pntd.0007374</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gunderson</surname>
<given-names>F. F.</given-names>
</name>
<name>
<surname>Mallama</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Fairbairn</surname>
<given-names>S. G.</given-names>
</name>
<name>
<surname>Cianciotto</surname>
<given-names>N. P.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Nuclease activity of <italic>Legionella pneumophila</italic> Cas2 promotes intracellular infection of amoebal host cells</article-title>. <source>Infect. Immun.</source> <volume>83</volume>, <fpage>1008</fpage>&#x2013;<lpage>1018</lpage>. <pub-id pub-id-type="doi">10.1128/IAI.03102-14</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Holm</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>DALI and the persistence of protein shape</article-title>. <source>Protein Sci.</source> <volume>29</volume>, <fpage>128</fpage>&#x2013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1002/pro.3749</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jansen</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Embden</surname>
<given-names>J. D. V.</given-names>
</name>
<name>
<surname>Gaastra</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Schouls</surname>
<given-names>L. M.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Identification of genes that are associated with DNA repeats in prokaryotes</article-title>. <source>Mol. Microbiol.</source> <volume>43</volume>, <fpage>1565</fpage>&#x2013;<lpage>1575</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-2958.2002.02839.x</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jung</surname>
<given-names>T. Y.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>K. H.</given-names>
</name>
<name>
<surname>An</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Schulga</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Deyev</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jung</surname>
<given-names>J. H.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Structural features of Cas2 from Thermococcus onnurineus in CRISPR&#x2010;cas system type IV</article-title>. <source>Protein Sci.</source> <volume>25</volume>, <fpage>1890</fpage>&#x2013;<lpage>1897</lpage>. <pub-id pub-id-type="doi">10.1002/pro.2981</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ka</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Hong</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jeong</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Jeong</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Suh</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Suh</surname>
<given-names>J.-Y.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Structural and dynamic insights into the role of conformational switching in the nuclease activity of the Xanthomonas albilineans Cas2 in CRISPR-mediated adaptive immunity</article-title>. <source>Struct. Dyn.</source> <volume>4</volume>, <fpage>054701</fpage>. <pub-id pub-id-type="doi">10.1063/1.4984052</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ka</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Baek</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Bae</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Structural and functional characterization of Streptococcus pyogenes Cas2 protein under different pH conditions</article-title>. <source>Biochem. Biophys. Res. Commun.</source> <volume>451</volume>, <fpage>152</fpage>&#x2013;<lpage>157</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbrc.2014.07.087</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kelley</surname>
<given-names>L. A.</given-names>
</name>
<name>
<surname>Mezulis</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yates</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Wass</surname>
<given-names>M. N.</given-names>
</name>
<name>
<surname>Sternberg</surname>
<given-names>M. J.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>The Phyre2 web portal for protein modeling, prediction and analysis</article-title>. <source>Nat. Protoc.</source> <volume>10</volume>, <fpage>845</fpage>&#x2013;<lpage>858</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2015.053</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kwon</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>K. Y.</given-names>
</name>
<name>
<surname>Min</surname>
<given-names>Y. H.</given-names>
</name>
<name>
<surname>Im</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Structural and biochemical characterization of HP0315 from <italic>Helicobacter pylori</italic> as a VapD protein with an endoribonuclease activity</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <fpage>4216</fpage>&#x2013;<lpage>4228</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr1305</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Laskowski</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Macarthur</surname>
<given-names>M. W.</given-names>
</name>
<name>
<surname>Moss</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Thornton</surname>
<given-names>J. M.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Procheck: A program to check the stereochemical quality of protein structures</article-title>. <source>J. Appl. Crystallogr.</source> <volume>26</volume>, <fpage>283</fpage>&#x2013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.1107/s0021889892009944</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Dhingra</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Sashital</surname>
<given-names>D. G.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>The Cas4-Cas1-Cas2 complex mediates precise prespacer processing during CRISPR adaptation</article-title>. <source>Elife</source> <volume>8</volume>, <fpage>e44248</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.44248</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Makarova</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Anantharaman</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Aravind</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Koonin</surname>
<given-names>E. V.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Live virus-free or die: Coupling of antivirus immunity and programmed suicide or dormancy in prokaryotes</article-title>. <source>Biol. Direct</source> <volume>7</volume>, <fpage>40</fpage>. <pub-id pub-id-type="doi">10.1186/1745-6150-7-40</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Makarova</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Haft</surname>
<given-names>D. H.</given-names>
</name>
<name>
<surname>Barrangou</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Brouns</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Charpentier</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Horvath</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Evolution and classification of the CRISPR&#x2013;Cas systems</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>9</volume>, <fpage>467</fpage>&#x2013;<lpage>477</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro2577</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Makarova</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Wolf</surname>
<given-names>Y. I.</given-names>
</name>
<name>
<surname>Alkhnbashi</surname>
<given-names>O. S.</given-names>
</name>
<name>
<surname>Costa</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Shah</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Saunders</surname>
<given-names>S. J.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>An updated evolutionary classification of CRISPR&#x2013;Cas systems</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>13</volume>, <fpage>722</fpage>&#x2013;<lpage>736</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro3569</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Makarova</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Wolf</surname>
<given-names>Y. I.</given-names>
</name>
<name>
<surname>Iranzo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Shmakov</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Alkhnbashi</surname>
<given-names>O. S.</given-names>
</name>
<name>
<surname>Brouns</surname>
<given-names>S. J.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Evolutionary classification of CRISPR&#x2013;cas systems: A burst of class 2 and derived variants</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>18</volume>, <fpage>67</fpage>&#x2013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1038/s41579-019-0299-x</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Masuda</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Miyakawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Nishimura</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Ohtsubo</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>chpA and chpB, <italic>Escherichia coli</italic> chromosomal homologs of the pem locus responsible for stable maintenance of plasmid R100</article-title>. <source>J. Bacteriol.</source> <volume>175</volume>, <fpage>6850</fpage>&#x2013;<lpage>6856</lpage>. <pub-id pub-id-type="doi">10.1128/jb.175.21.6850-6856.1993</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mccoy</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Grosse-Kunstleve</surname>
<given-names>R. W.</given-names>
</name>
<name>
<surname>Adams</surname>
<given-names>P. D.</given-names>
</name>
<name>
<surname>Winn</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Storoni</surname>
<given-names>L. C.</given-names>
</name>
<name>
<surname>Read</surname>
<given-names>R. J.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Phaser crystallographic software</article-title>. <source>J. Appl. Crystallogr.</source> <volume>40</volume>, <fpage>658</fpage>&#x2013;<lpage>674</lpage>. <pub-id pub-id-type="doi">10.1107/S0021889807021206</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Murshudov</surname>
<given-names>G. N.</given-names>
</name>
<name>
<surname>Skub&#xe1;k</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Lebedev</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Pannu</surname>
<given-names>N. S.</given-names>
</name>
<name>
<surname>Steiner</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Nicholls</surname>
<given-names>R. A.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>REFMAC5 for the refinement of macromolecular crystal structures</article-title>. <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>67</volume>, <fpage>355</fpage>&#x2013;<lpage>367</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444911001314</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nam</surname>
<given-names>K. H.</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Haitjema</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Delisa</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Ke</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Double-stranded endonuclease activity in Bacillus halodurans clustered regularly interspaced short palindromic repeats (CRISPR)-associated Cas2 protein</article-title>. <source>J. Biol. Chem.</source> <volume>287</volume>, <fpage>35943</fpage>&#x2013;<lpage>35952</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M112.382598</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nunez</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Harrington</surname>
<given-names>L. B.</given-names>
</name>
<name>
<surname>Kranzusch</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Engelman</surname>
<given-names>A. N.</given-names>
</name>
<name>
<surname>Doudna</surname>
<given-names>J. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Foreign DNA capture during CRISPR&#x2013;Cas adaptive immunity</article-title>. <source>Nature</source> <volume>527</volume>, <fpage>535</fpage>&#x2013;<lpage>538</lpage>. <pub-id pub-id-type="doi">10.1038/nature15760</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nu&#xf1;ez</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Kranzusch</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Noeske</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wright</surname>
<given-names>A. V.</given-names>
</name>
<name>
<surname>Davies</surname>
<given-names>C. W.</given-names>
</name>
<name>
<surname>Doudna</surname>
<given-names>J. A.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Cas1&#x2013;Cas2 complex formation mediates spacer acquisition during CRISPR&#x2013;Cas adaptive immunity</article-title>. <source>Nat. Struct. Mol. Biol.</source> <volume>21</volume>, <fpage>528</fpage>&#x2013;<lpage>534</lpage>. <pub-id pub-id-type="doi">10.1038/nsmb.2820</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rollie</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Brinkmann</surname>
<given-names>A. S.</given-names>
</name>
<name>
<surname>Bolt</surname>
<given-names>E. L.</given-names>
</name>
<name>
<surname>White</surname>
<given-names>M. F.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Intrinsic sequence specificity of the Cas1 integrase directs new spacer acquisition</article-title>. <source>Elife</source> <volume>4</volume>, <fpage>e08716</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.08716</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Samai</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Shuman</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Structure of a CRISPR-associated protein Cas2 from Desulfovibrio vulgaris</article-title>. <source>Acta Crystallogr. Sect. F. Struct. Biol. Cryst. Commun.</source> <volume>66</volume>, <fpage>1552</fpage>&#x2013;<lpage>1556</lpage>. <pub-id pub-id-type="doi">10.1107/S1744309110039801</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seto</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Shirouzu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Terada</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Murayama</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Kuramitsu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yokoyama</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Crystal structure of a hypothetical protein, TT1725, from Thermus thermophilus HB8 at 1.7 &#xc5; resolution</article-title>. <source>Proteins</source> <volume>53</volume>, <fpage>768</fpage>&#x2013;<lpage>771</lpage>. <pub-id pub-id-type="doi">10.1002/prot.10412</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sievers</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Wilm</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Dineen</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Gibson</surname>
<given-names>T. J.</given-names>
</name>
<name>
<surname>Karplus</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Fast, scalable generation of high&#x2010;quality protein multiple sequence alignments using Clustal Omega</article-title>. <source>Mol. Syst. Biol.</source> <volume>7</volume>, <fpage>539</fpage>. <pub-id pub-id-type="doi">10.1038/msb.2011.75</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tamura</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Stecher</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Peterson</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Filipski</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>MEGA6: Molecular evolutionary genetics analysis version 6.0</article-title>. <source>Mol. Biol. Evol.</source> <volume>30</volume>, <fpage>2725</fpage>&#x2013;<lpage>2729</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst197</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tuszynska</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Magnus</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Jonak</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Dawson</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Bujnicki</surname>
<given-names>J. M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>NPDock: A web server for protein&#x2013;nucleic acid docking</article-title>. <source>Nucleic Acids Res.</source> <volume>43</volume>, <fpage>W425</fpage>&#x2013;<lpage>W430</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv493</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Velankar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Alhroub</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Alili</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Best</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Boutselakis</surname>
<given-names>H. C.</given-names>
</name>
<name>
<surname>Caboche</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>PDBe: Protein Data Bank in europe</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>D402</fpage>&#x2013;<lpage>D410</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkq985</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sheng</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Structural and mechanistic basis of PAM-dependent spacer acquisition in CRISPR-Cas systems</article-title>. <source>Cell</source> <volume>163</volume>, <fpage>840</fpage>&#x2013;<lpage>853</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2015.10.008</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Filamentation initiated by Cas2 and its association with the acquisition process in cells</article-title>. <source>Int. J. Oral Sci.</source> <volume>11</volume>, <fpage>1</fpage>&#x2013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1038/s41368-019-0063-0</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Winn</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Ballard</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Cowtan</surname>
<given-names>K. D.</given-names>
</name>
<name>
<surname>Dodson</surname>
<given-names>E. J.</given-names>
</name>
<name>
<surname>Emsley</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>P. R.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Overview of the CCP4 suite and current developments</article-title>. <source>Acta Crystallogr. D. Biol. Crystallogr.</source> <volume>67</volume>, <fpage>235</fpage>&#x2013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444910045749</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Worrall</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Luisi</surname>
<given-names>B. F.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Information available at cut rates: Structure and mechanism of ribonucleases</article-title>. <source>Curr. Opin. Struct. Biol.</source> <volume>17</volume>, <fpage>128</fpage>&#x2013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.1016/j.sbi.2006.12.001</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiao</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Yi</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Che</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Imran</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Khan</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Characterization of CRISPR&#x2010;Cas systems in Leptospira reveals potential application of CRISPR in genotyping of Leptospira interrogans</article-title>. <source>Apmis</source> <volume>127</volume>, <fpage>202</fpage>&#x2013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1111/apm.12935</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Ng</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Nam</surname>
<given-names>K. H.</given-names>
</name>
<name>
<surname>Ke</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>How type II CRISPR&#x2013;Cas establish immunity through Cas1&#x2013;Cas2-mediated spacer integration</article-title>. <source>Nature</source> <volume>550</volume>, <fpage>137</fpage>&#x2013;<lpage>141</lpage>. <pub-id pub-id-type="doi">10.1038/nature24020</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization</article-title>. <source>Biophys. J.</source> <volume>101</volume>, <fpage>2525</fpage>&#x2013;<lpage>2534</lpage>. <pub-id pub-id-type="doi">10.1016/j.bpj.2011.10.024</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Protein structure and function prediction using I&#x2010;TASSER</article-title>. <source>Curr. Protoc. Bioinforma.</source> <volume>52</volume>, <fpage>51</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1002/0471250953.bi0508s52</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Nucleases: Diversity of structure, function and mechanism</article-title>. <source>Q. Rev. Biophys.</source> <volume>44</volume>, <fpage>1</fpage>&#x2013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.1017/S0033583510000181</pub-id> </citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yosef</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Goren</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Qimron</surname>
<given-names>U.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Proteins and DNA elements essential for the CRISPR adaptation process in <italic>Escherichia coli</italic>
</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <fpage>5569</fpage>&#x2013;<lpage>5576</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks216</pub-id> </citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Not all predicted CRISPR&#x2013;cas systems are equal: Isolated cas genes and classes of CRISPR like elements</article-title>. <source>BMC Bioinforma.</source> <volume>18</volume>, <fpage>92</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1186/s12859-017-1512-4</pub-id> </citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Inouye</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>The inhibitory mechanism of protein synthesis by YoeB, an <italic>Escherichia coli</italic> toxin</article-title>. <source>J. Biol. Chem.</source> <volume>284</volume>, <fpage>6627</fpage>&#x2013;<lpage>6638</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M808779200</pub-id> </citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Inouye</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Characterization of ChpBK, an mRNA interferase from <italic>Escherichia coli</italic>
</article-title>. <source>J. Biol. Chem.</source> <volume>280</volume>, <fpage>26080</fpage>&#x2013;<lpage>26088</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M502050200</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>