Molecular and Computational Strategies to Increase the Efficiency of CRISPR-Based Techniques

The prokaryote-derived Clustered Regularly Interspaced Palindromic Repeats (CRISPR)/Cas mediated gene editing tools have revolutionized our ability to precisely manipulate specific genome sequences in plants and animals. The simplicity, precision, affordability, and robustness of this technology have allowed a myriad of genomes from a diverse group of plant species to be successfully edited. Even though CRISPR/Cas, base editing, and prime editing technologies have been rapidly adopted and implemented in plants, their editing efficiency rate and specificity varies greatly. In this review, we provide a critical overview of the recent advances in CRISPR/Cas9-derived technologies and their implications on enhancing editing efficiency. We highlight the major efforts of engineering Cas9, Cas12a, Cas12b, and Cas12f proteins aiming to improve their efficiencies. We also provide a perspective on the global future of agriculturally based products using DNA-free CRISPR/Cas techniques. The improvement of CRISPR-based technologies efficiency will enable the implementation of genome editing tools in a variety of crop plants, as well as accelerate progress in basic research and molecular breeding.


INTRODUCTION
The capability of creating genetic variation has always been the key for crop improvement (Griggs et al., 2013). While this is traditionally achieved through conventional plant breeding or random mutagenesis induced by ionizing radiation and chemical mutagens, nowadays, agricultural biotechnology has entered a new era of nucleotide-scale precision (Jung and Till, 2021).
Site-directed nucleases are used to produce single-or double-strand breaks at specific DNA target sites. These DNA breaks stimulate either the stochastic non-homologous end joining (NHEJ) or homology directed repair (HDR) pathway, resulting in diverse outcomes such as site-directed mutagenesis, gene replacement, and nucleotide insertions or deletions (Kumar and Jain, 2015). While hybrid enzymes, like meganucleases, Zinc Finger Nuclease (ZNF), and Transcription activator-like Effector Nucleases (TALEN), have been successfully used for genome editing in plants (Curtin et al., 2012;Osakabe and Osakabe, 2015) the clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein (Cas) system is the new gold standard, because it is more versatile, less expensive, simpler and more reliable (Belhaj et al., 2013).
CRISPR/Cas9 was first discovered as a bacterial and archaeal defense system against transmissible elements from virus or exogenous DNA (Mojica et al., 2000;Mojica and Rodriguez-Valera, 2016). The rapid adoption of CRISPR/Cas9 technique originated from Streptococcus pyogenes, enabled the initial application as a genome editing tool in plants and animals (Makarova et al., 2011a,b;Hsu et al., 2014).
Even though CRISPR/Cas9 technology has been rapidly adopted and successfully implemented in plants, its gene editing efficiency rate varied greatly (Howells et al., 2018). Recently, several robust technical improvements have emerged based on the necessity to improve efficiency and specificity. This review focuses on highlighting the different computational and experimental base and prime editing strategies developed to increase the efficiency of CRISPR/Cas-based genome editing systems. We first describe recent advances in deep learning tools used during CRISPR/Cas constructs design. We then present a comprehensive overview of the recently developed Cas9 and prime editing (PE) tools and their implications on enhancing editing efficiency. We also provide a perspective on the global future of agricultural based products using DNA-free CRISPR/Cas techniques. The advances we discuss here point to several venues of experiencing that are paving the way to increase the efficiency in producing homozygous edits, the occurrence of off target editing, the broadening of possibilities of target sites in the genome and to enhance homologous recombination. Addressing these key issues will have a major impact on the way we can modify the genome of crops, leveraging the efforts to a sustainable agriculture and preparing crops to a changing environment.

Deep-Learning Strategies to Increase CRISPR Cleavage Efficiency
The rapid progress of experimental procedures implementing CRISPR/Cas technology in plants over the past decade has been accompanied by equally impressive advances in the computational methods for gRNA design and off target prediction (Wang et al., , 2020bLowder et al., 2018;Hajiahmadi et al., 2019). As genetic transformation methods and CRISPR implementation grew, the emerging algorithm developments for accurate target prediction revealed increasingly complex facets of the underlying biology, from sequencing composition to epigenetic regulation Yan et al., 2020). At the same time, rapid growth has forced constant reevaluation of the underlying algorithms and statistical models used by these computational tools when implemented in CRISPR studies (Sledzinski et al., 2020).
The development of computational tools has become essential in the process of single guide RNA (sgRNAs) design. However, one of the major challenges of the CRISPR system is to precisely predict the sgRNA on-target knock-in or knock-out efficacy, while avoiding undesired cuts to similar DNA sequences in the target genome creating off-targets. In plants as well as in other organisms, a large difference in cleavage efficiency has been observed, which suggests that several factors regulate the binding and cutting efficacy of the sgRNA-Cas complex (Raitskin et al., 2019;Ren et al., 2019b;Banakar et al., 2020;Wolabu et al., 2020;Grützner et al., 2021). Those factors include sequence composition, nucleotide position, GC content, chromatin accessibility, gene expression profile, RNA secondary structure, melting temperature and free energy (Kim et al., 2019;Safari et al., 2019;Xiang et al., 2021). Thus, a careful design of the sgRNA is essential for optimal Cas9 activity. It is critical to search for conserved domains, motifs, or regions within the target sequence among different genotypes or related species to make sure that there are no point mutations at the gRNA binding site or at the PAM site. A second important aspect to take into consideration is the presence of favorable nucleotides at the seed sequence (10 nucleotides closest to the PAM site; Doench et al., 2014;Xu et al., 2015). To date, the methods developed for sgRNA efficacy prediction and on-target identification can be classified into three major groups: (i) Alignment-based methods: sgRNAs candidates are designed based on an alignment from the target sequence and the respective genome purely by locating the PAM sequence. (ii) Hypothesis-driven: sgRNAs efficiencies are scored empirically by compiling information related to the factors that impact the genome context. (iii) Machine and Deep learning-based: sgRNAs are predicted from a training model by considering different features.
Over the last couple of years, solid evidence has shown that hypothesis-driven and learning-based strategies outperform alignment-based strategies (Wang et al., 2020c;Yan et al., 2020;Zhang et al., 2020). However, the popularity gained by artificial intelligence has positioned learning-based methods as the leading strategies to increase efficiency efforts. Building reliable models to design highly efficient sgRNA using learning-based strategies largely depends on the critical factors that the models are built on. For instance, it is required a considerably large amount of data of sgRNAs designed from different platforms and their respective experimental efficiencies to build reliable models. In addition, experimental data on real quantification of off-target site prediction based on all the possible nucleotide mismatch loci at the whole genome level are limited (Zischewski et al., 2017). And finally, other genomic and epigenetic features that affect sgRNA remain unclear and could improve its efficacy.
Many of these statistical methods use algorithms to recognize targets based on the gRNA-DNA pairing. For instance, CRISPR-P, one of the most popular tools for sgRNA design in plants, supports sgRNA design for almost 50 plant species including gene annotation. In addition, it provides a sgRNA scoring system for on-target efficiency and off-target (Lei et al., 2014). Interestingly, it also supports various CRISPR-Cas systems including Cas12a (formerly Cpf1). Nevertheless, CRISPR-P allows uploading custom sequences and identifying sgRNAs if the genome of interest is not listed between the ones available. However, no off-target is available for that option (Liu et al., 2017a). Another interesting web-based tool, CRISPR-PLANT, calculates specificity of all gRNA spacer sequences based on both mismatches number and position in their alignments with other spacer sequences for any given sequence (Xie et al., 2014;Minkenberg et al., 2019). However, CRISPR-PLANT only allows searching for NGG-PAM sites. It would be interesting to have the possibility to upload genome sequences from other species, as implemented in CRISPR-P.

Learning Cas9 Machines
Rational engineering of Cas9 nuclease has required conformational and mechanistic understanding of the CRISPR-Cas system, which has contributed greatly to push the boundaries of this technology. Currently, a vast repertoire of engineered or evolved SpCas9 variants (e.g., evoCas9, xCas9, HypaCas9, SpCas9-HF1, eSpCas9, and HiFi Cas9) have been created by rational design or directed evolution, altering catalytic function, PAM requirements and reducing off-target activity (for a detailed review on these Cas9 variants see Meaker et al., 2020). As such, the different combinations of key amino acid changes in the functional domains of Cas9 have improved its specificity, genome coverage and consequently, its efficiency. Nonetheless, despite remarkable progress, the existence of a guide-dependent trade-off between specificity and cleavage activity has been described among the panel of these novel variants (with the exception of xCas9), still leaving researchers puzzled by which Cas9 variants to use at a given locus (Schmid-Burgk et al., 2020).
Different features associated with gRNA activity (e.g., differences of nucleotide preference, and position-dependent nucleotide composition) between wild-type SpCas9 and these highly specific Cas9 proteins, explain to some extent the high dependence of these variants on the target sequences. Yet, far from meeting an ideal Cas9 (i.e., with high specificity and activity regardless of the guide RNA used), the use of techniques of relatively low-throughput and the lack of extensive comparisons among variants, due to an exceedingly costly task and laborintensive, have limited novel biological insights.
These unprecedented challenges regarding the analysis and interpretation of Cas9 variants along with the astronomical numbers of theoretical targetable sequences (4 20 = ca. 10 12 molecules) have recently directed researchers to look at other alternatives to handle these massive amounts of data. Recently, for instance, an elegant bacterial system (termed Self-targeting sgRNA Library Screen-SLS) was developed to characterize these tailored Cas9 proteins, exploiting more than a million sequence libraries, wherein over 60 physicochemical parameters were considered (Tálas et al., 2021). Interestingly, the study identified various sequence features that positively or adversely impact on SpCas9-HF1 cleavage. This variant prefers a slightly higher GC-content in the middle region compared to other ones of the spacer. However, the presence of motifs corresponding or overlapping to the GTNAC sequence, also in the middle region (positions 10-14), affects its ability to form cleavagecompetent conformation. Besides, it is noteworthy to mention that the SLS approach presented a robust prediction tool for mammalian cleavage activities too, which might open the use of a dozen SpCas9 variants.
Another recent high-throughput methodology that has been gaining a growing interest for decoding the variant activities dependency on target sequences is deep learning. Briefly, deep learning is a set of machine learning techniques based on stacked artificial neural network layers that can learn rich data representations from raw inputs through affine transformations and non-linear activation function (Xu et al., 2020c). Recently, using a combination of lentiviral libraries and DL-based computational models, Kim et al. (2019) have assembled an extensive comparison of SpCas9 variants, providing a helpful and general guide (Kim et al., 2020b). This online tool called DeepSpCas9variants allows users to select the most appropriate and effective variant to use at a given target sequence according to 20 most important features for predicting their activities. However, the greatest challenge will be to obtain the best identified Cas9 variants and optimize them for wet lab experiments.
Given the Cas9 dependency of the protospacer DNA and the above-mentioned astronomical amount of possible permutations, it might seem unrealistic a library of Cas9 variants suitable for any possible composition of the target sequence. On the other hand, this raises the question of whether, in the near future, an ideal Cas9 suitable for all target DNA might be achievable. Fortunately, the field of deep learning for protein engineering is also moving rapidly and outperforming conventional methods. Besides, this approach has not yet been explored and used for the Cas9 protein structural diversity to identify good adaptive routes to higher fitness . Concomitant with that, the need of multiple mutated residues in Cas9 has been predicted to improve specificity and efficiency by changing the conformational dynamics and biophysical properties of the iCas9 at different stages of its activity (Palermo et al., 2017(Palermo et al., , 2018Boyle et al., 2021;Wang et al., 2021). Mostly, these residues mediate the allosteric communication of Cas9 (i.e., flexibility of the three conformational stages) and the RNA/DNA heteroduplex to Frontiers in Plant Science | www.frontiersin.org ensure the proper positioning of the catalytic site and the proofreading step (Yang et al., 2018). However, to find multiple mutations at one time is beyond the capacity of most screenings. In addition, the extreme epistatic interactions, so-called sign epistasis (i.e., the result of a mutation depends on the preceding mutations), in Cas9 variants may be a constraint on the path to an optimum fitness peak (Poelwijk et al., 2007;Romero and Arnold, 2009). Thereby, the use of conservative computational approaches such as UniRep (Alley et al., 2019), low-N (Biswas et al., 2021), and 1D convolutional neural network (CNN) is gaining much ground nowadays to the study of large-scale conformational changes of Cas9 and to accelerate engineered proteins (Orellana, 2019;Gao et al., 2020).

INCREASING THE PAM SEQUENCE SCOPE
Streptococcus pyogenes Cas9 is the most widely used protein in the CRISPR/Cas system for genome editing. However, the number of sequences it can recognize is limited by its strict dependence on the PAM (5′-NGG-3′) motif (Zeng et al., 2020). In case non-canonical PAM motifs could be recognized, the number of sequences that can be edited would increase remarkably, for instance: 1.36 times in rice (Hu et al., 2016) and 1.62 in pepper . Therefore, several studies have focused on developing variants that improve PAM recognition sequence flexibility allowing broader screens on the target genome. Variants with different PAM preferences have been reported, such as VQR-Cas9 (NGA PAM), VRER-Cas9 (NGCG PAM), EQR-Cas9 (NGAG PAM; Kleinstiver et al., 2015), xCas9 (NG, GAA, and GTA PAM; Hu et al., 2018), SpCas9-NG (NG PAM; Nishimasu et al., 2018), SpG, and SpRY .
SpCas9-VQR, EQR, and VRER have been used to edit Arabidopsis (Yamamoto et al., 2019) and rice genomes (Hu et al., 2016(Hu et al., , 2018, but their cleavage efficiency was lower in comparison with wild type SpCas9 (Kleinstiver et al., 2015;Hu et al., 2016Hu et al., , 2018Yamamoto et al., 2019). On the other hand, SpCas9-NG allows a broader sequence recognition, identifying at least four types of atypical PAMs without showing a preference for the third nucleotide (NAC, NTG, NTT, and NCG; Hua et al., 2019;Ren et al., 2019a;Zhong et al., 2019) which increases the possibility of genetic editing in any given target of a plant genome. The most effective xCas9 variant is xCas9-3.7, showing high targeting fidelity, although a lower editing efficiency compared to SpCas9-NG (Hua et al., 2019;Zhong et al., 2019).
Interestingly, the development of variants partial or totally independent to the PAM sequence (SpG and SpRY) increases the possibility to edit a wider variety of genomic loci . Both variants, SpRY and SpCas9-NG, are so far the most efficient in the context of site-directed mutagenesis not only in plants Xu et al., 2021) but also in single-celled organisms (Asano et al., 2021) Nevertheless, the relaxation that allows them to expand their compatibility with shorter PAM sequences can also lead to the recognition of a greater number of potential off-target sites (Endo et al., 2019). Comparison of the editing specificity of these variants with Cas9-WT demonstrated that SpCas9-NG has comparable editing activity (Hua et al., 2019;Zhong et al., 2019), while the broad flexibility of SpRY increases the editing of off-target sites in all plant genomes Xu et al., 2021). Fortunately, it was shown that SpRY-HF1 can almost completely mitigate off-target site editing while increasing the fraction of total events edited at on-target sites in human cells .

Improved Templates for Homologous Recombination Using Cas9
In eukaryotic cells, the two repair mechanisms, NHEJ and HR, compete with each other, and NHEJ is by far the preferred choice (Miyaoka et al., 2016). In the presence of a repair template containing a region with homology to the region flanking the DSB, HR can take place, replacing part of the gene of interest (Begemann et al., 2017;Oz et al., 2021;Wei et al., 2021). The frequency of HR in plants is very low (Butt et al., 2017;Miki et al., 2018;Que et al., 2019;Ali et al., 2020) and the delivery of the DNA template to different cell types in the right amount to stimulate recombination is not an easy task (Schindele et al., 2018;Huang and Puchta, 2019). HR can also be directed by the addition of an RNA template molecule. Li et al. (2016) used a pair of gRNA targeted to adjacent introns of the gene encoding 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS). By adding a donor DNA template containing a few mutations the rice endogenous gene was replaced at a 2% frequency, producing glyphosate-resistant plants. With the use of one sgRNA targeting one intron the gene replacement frequency was 2.2%. Furthermore, the site-specific gene replacements were transmitted to the next generation.
Several methods for template delivery aiming to increase HR efficiency have been developed and the majority lies on the type and/or quantity of the molecule used as template.
Other studies aimed to increase efficiency by co-localizing the donor template and the RNA-guided nuclease. Aird et al. (2018) demonstrated in mammal cells that a 30-fold increase in HDR can be obtained by covalently tethering a single-stranded oligonucleotide (ssODN) to the Cas9 complex using a HUH endonuclease, creating a stable complex between the ribonucleoprotein (RNP) and the ssODN without the need for its chemical alteration, changes in the sgRNA or additional proteins. Butt et al. (2017) verified that the template can be delivered as an RNA molecule fused with the sgRNA, producing one single RNA molecule that will set the specificity of the nuclease and serve as the HR template. However, this technique has some limitations, such as low efficiency, low versatility and the need for long homology arms. Lu et al. (2020) improved dsDNA template stability and consequently the insertion efficiency (compared to unmodified dsDNA and to ssDNA) by adding at the 5′ and 3′ ends of both strands two phosphorothioate linkages. This methodology was also efficient to insert short (<70 bp) and longer (526 and 2,049 bp) modified dsDNA donors. These authors also developed the tandem repeat-HDR strategy (TR-HDR) in which the desired insertion is flanked by a repeat sequence, increasing HR, providing a robust successful mean of base substitutions.
The integration efficiency can also be enhanced by the use of engineered DNA virus-based replicons to amplify the number of template copies available to stimulate HR. Several studies were successful, and insertion of desired DNA sequences were achieved in some species such as tomato (Čermák et al., 2015;Dahan-Meir et al., 2018), rice (Wang et al., 2017a), and hexaploid wheat (Gil-Humanes et al., 2017).
The microhomology-mediated end joining (MMEJ) pathway happens when the donor template has short homology arms (5-25 pb) to be used during the HR system (Nakade et al., 2014). Although this repair mechanism has low occurrence in G0/G1 phase compared to S and G2 phases of plant cell cycle (Truong et al., 2013) and is highly error-prone (García-Medel et al., 2019). Some studies suggest that MMEJ, especially in the hexaploid wheat, may be an effective HR strategy (Nakade et al., 2014;Wang et al., 2014;Arndell et al., 2019).
Recently, Schubert et al. (2021), from Integrated DNA Technologies (United States) evaluated a series of parameters affecting homologous recombination efficiency using singlestranded oligodeoxynucleotide (ssODN) as donor templates, probing 254 loci in Jurkat cells and another 239 loci in HAP1 cells. Among several parameters tested, it is worth noting the selection of the targeting strand is highly dependent on the loci and also on the cell type and that the position of the DSB site has a critical impact. Moreover, the introduction of blocking mutations, inhibiting the re-cleavage of the recombined product by Cas9, also had a positive effect, namely when targeting the PAM sequence. The company has a website 1 that uses all their findings to create optimized sequences for homologous recombination. The evaluation of these rules to plant genome editing awaits further experimentation.
Cas9-coupled deaminases and prime editing techniques were developed as an alternative to overcome those problems and precisely and efficiently produce specific edits without both DSB and a repair template, as discussed below.

Introducing Base Edition With Nuclease Fused Deaminases
Another variation of the CRISPR/Cas9 system is the base edition technique (Figure 1). Mutations are created due to the capacity of cytidine deaminases to convert C/G to A/T and adenine deaminases to convert A/T to C/G (Nishida et al., 2016;Schindele et al., 2018;Chen et al., 2019a). These editor proteins are fused to Cas9, Cas13, or Cas12a, that guide them to the desired locations in the genome. This approach allows the modification of only one nucleotide in a specific sequence. Using mammalian cells, it was shown that two mutations (Asp10Ala and His840Ala Cas9) can be introduced in the Cas9 protein, generating catalytic dead protein (dCas9) unable to produce a double-strand break, but still capable of being guided to a specific DNA target by a gRNA (Komor et al., 2016;Rees and Liu, 2018). The dCas9 protein is then fused with a deaminase allowing the substitution of the desired base. Amino acid substitution or stop codon generated by a single nucleotide mutation may lead to losing or changing the protein function. Komor et al. (2016) found that a Cas9 containing only the Asp10Ala mutation (Cas9 nickase, or nCas9) was able to introduce nicks at the target strand and performed better than dCas9. nCas9 also performed better than dCas9 in rice (Shimatani et al., 2017) and has been widely used in other plant species (Li et al., 2020c;Qin et al., 2020a).
More recently, several groups have reported the development of new deaminases able to perform transversion base changes, known as C-to-G base editors (CGBEs), by fusing rAPOBEC1 to either uracil DNA glycosylase (UNG; Kurt et al., 2021;Zhao et al., 2021) or XRCC1, a base excision repair protein . Besides their ability to perform new editions, a R33A mutation in the rAPOBEC1 deaminase and the use of UNG or rXRCC1, reduced off-targets of these new deaminases in the human genome Kurt et al., 2021;Zhao et al., 2021). Interestingly, these authors also found that in the target sequences, these CGBE have a stronger bias towards the cytosine positioned at the sixth position. Sretenovic et al. (2021) produced versions of these three CGBE with codon optimization for plants, and their performance showed a dependence on the species. For example, in rice, monoallelic editing efficiency with a rXRCC1-based CGBE produced efficiencies up to 38%, while in poplar the best performance (6.25%) was obtained with the UNG-rAPOBEC1 (R33A)-based CGBE. Assays using rice and tomato protoplasts also showed C to T editions and indels as major byproducts. By contrast, in transgenic rice plants, no indels were observed in most CGBE constructs, indicating that these byproducts might be dependent on cell cycle and DNA repair mechanisms. It is worth noting that G to C edition had lower efficiency when compared to C to T conversion by the regular CBE system.
The use of nucleases that recognize different PAM can add versatility to genome editing. This is the case of Cas12a, which recognizes TTTV (V = A/G/C) as the PAM. However, Cas12abased base editors (BEs) have lower editing efficiencies compared to SpCas9-based BE systems, probably because Cas12a has a looser binding to DNA targets when compared with SpCas9 (Bin Moon et al., 2018;Chen et al., 2020). The efficiency of base editing was also influenced by the context of the PAM sequence, indicating that the distinct gene context of the edition site should be carefully evaluated when using different combinations of nuclease:deaminase.
The addition of a protein from Bacillus subtilis bacteriophage PBS1, presenting uracil DNA glycosylase inhibitor (UGI), allowed a 3-fold enhancement in base edition in human cells (Komor et al., 2016). This is because this protein inhibits uracil DNA glycosylase (UDG), an enzyme involved in base-excision repair, reverting the edited U:G pair to the original C:G pair (Kunz et al., 2009). More recently, Qin et al. (2020b) produced an enhanced BE3 (eBE3), by adding three copies of UGI to the 3′ end of the BE3 sequence, which increased efficiency up to 2.8 times. Moreover, eBE3-edited plants had no indels which were observed in up to 25% of the edited plants using BE3 with a single UGI. The rates of undesired C>A and C>G conversion, and "clean edits" (only C>T substitutions) were Frontiers in Plant Science | www.frontiersin.org 1.14-3.81 higher with eBE3 compared to BE3 in several gene targets. Interestingly, Nishida et al. (2016) also found evidence that UGI also helped to decrease indels at the edited site. It is worth noting that in their seminal work, Gaudelli et al. (2017) found that adenine BEs have lower rates of indels when compared to editions using Cas9. In plants, Jin et al. (2019) also found that adenine BEs have a low frequency of changes in off target sites, when compared to cytosine BEs.
The ability of the protein::RNA complex to be transported to the nucleus is a key step in genome editing. Koblan et al. (2018) found that a bipartite nuclear localization signal (bpNLS) positioned at the N and C terminus of the coding region of an APOBEC deaminase::Cas9::UGI fusion enhanced 1.3 fold the editing process in human cells. The same authors also found that the algorithms used by different authors and companies to optimize the coding regions also influenced the efficiency of the cytidine base editor, ranging from 20% to up to 60% of total sequencing reads with target C:G converted to T:A in human cells (Koblan et al., 2018). In the case of an adenine base editor, these authors found increases in editing efficiency from 1.3-to 7.9-fold when the optimization algorithms from two private companies were compared. Anyway, these enhancements are correlated with an improved nuclear localization of the editing protein fusion, as observed by Zafra et al. (2018), using the BE3 cytidine base editor in mice cells.
The influence of different NLS as well as the effect of multiple copies of NLS in base editing in rice was reported by Wang et al. (2020a). These authors found that in some genome sites, the efficiency of the different configurations of the base editor was similar. However, the efficiency was 3-fold higher in the case of the OsSLR1 gene edited with a construct containing two different NLS compared to a construct containing one single copy. Li et al. (2018a), using a transient assay with a mutated version of the GFP gene in rice protoplasts, found no differences when constructs containing one or two NLS were used in vectors expressing an adenosine deaminase (ecTadA-ecTadA*) with nCas9 (D10A), while the insertion of a third copy of a NLS increased the efficiency by 25%. However, the increase in efficiency was less pronounced, around 10%, when several genes in the rice genome were assayed.
Changes in the sequence of the sgRNA can enhance A to G edition by more than two-fold in 13 target sites in both rice and wheat (Li et al., 2018a). Qin et al. (2020b) also observed editing frequencies of the enhanced gRNA (esgRNA) 1.9-2.1 fold higher than the usual gRNA in five genes from rice. These changes seem to increase not only the assembly of gRNA with the dCas9 protein, the gRNA stability, but also eliminate a putative Pol-III terminator (Chen et al., 2013). However, Wu et al. (2019) produced another esgRNA, introducing mutations in the native gRNA scaffold slightly different from those made by Qin et al. (2020), and observed only modest increases in BE efficiency. Although these studies tested the distinct esgRNA in different target genes, the data suggest that changes in the native gRNA need to be carefully tested.

PRIME EDITING
Prime editing was first described at the end of 2019 in human and yeast cells (Anzalone et al., 2019) and since then it has been used broadly in several plant species (Butt et al., 2020;Hua et al., 2020;Jiang et al., 2020;Li et al., 2020b;Lin et al., 2020;Xu et al., 2020a,b;Lu et al., 2021;Perroud et al., 2022). PE is based on a "search and replace" approach in which a nickase Cas9 (nCas9) is fused to a reverse transcriptase (RT), coupled to a prime editing guide RNA (pegRNA) that specifies both the target site and provides the template for the edition (Figure 2). The mechanism of PE action is described in Figure 3.
PE is divided into three main strategies: PE1 strategy 1 (PE1) uses a wild type Moloney murine leukemia virus (M-MLV) RT fused to a nCas9 and a pegRNA. PE strategy 2 (PE2) uses an engineered RT to increase the edition efficiency (mutations that affect thermostability, processivity, DNA-RNA substrate affinity and RNaseH activity) and therefore, exhibited The endogenous cellular endonucleases prefer 5′ flaps as a substrate digesting it and leaving the 3′ flap to be ligated. At the end of the process the nicked DNA strand is replaced by the newly synthesized strand that had the information copied from the pegRNA generating a heteroduplex. In this case, the repair mechanism is going to resolve the mismatch using one of the strands, so there is a 50% chance of the edition being repaired by the cell. To overcome this setback, the induction of a second nick on the unedited strand using a sgRNA (H) stimulates the repair by copying the information present on the edited strand (I).
T, G to A, A to G, and T to C), and transversions (C to A, C to G, G to C, G to T, A to C, A to T, T to A, and T to G) without inducing DBS. However, the authors describe that each prime editing experiment must be optimized because several factors can affect the efficiency, such as the source of RT enzyme, thermostability and binding capacity of the RT enzyme, length of the RT template, length of the PBS, position of the nicking sgRNA in the unmodified strand, secondary structure of the pegRNA, G/C content of the PBS, the target gene and location of the mutation relative to the PAM site (Anzalone et al., 2019). In plants, these strategies have been tested in rice and the evidences show that PE3 does not increase the efficiency in relation to PE2 (Butt et al., 2020;Lin et al., 2020;Tang et al., 2020;Xu et al., 2020a). PE3b also does not increase the efficiency in relation to PE3 Lin et al., 2020;Xu et al., 2020a). For PE3 and PE3b a second nick is performed on the non-edit DNA strand to stimulate the repair using the new reverse transcribed strand as a template. Besides the different PE strategies, PBS, template length, nicking positions also have to be optimized for each target gene Tang et al., 2020;Xu et al., 2020a,b;Lu et al., 2021). However, Lu et al. (2021) concluded that closer second nicks and simpler base changes affect the efficiency positively. Lin et al. (2020) did an extensive evaluation of all types of base editions, deletions and insertion on several rice targets. They concluded that, although successful, each of them presented different efficiencies in different targets and that editing efficiencies decreased as the size of the intended deletions or insertions increased. Hua et al. (2020) showed in rice that only 9.1% of the regenerated plants were edited for a simple S627N change in the OsALS gene, while other targets (APO1, SLR1, OsSPL14, and APO2) aiming C42F change, 3-bp, 24-bp, and 24-bp insertion respectively, had no mutated lines. These data reinforce that the efficiency of even simple edits varies at different sites and longer insertions are harder to achieve. Xu et al. (2020a) using the OsPDS as a model analyzed different types of editions (insertion of 1, 2 or 3 nt; 28 bp deletion; A to T or A to C modification) and also observed a wide amplitude of edition frequencies (from 0 to 31.3%), with transversion being the most efficient edit.

Optimizations of nCas9 and RT
The system can be modified by exchanging the nCas9-RT fusion and its expression. Hua et al. (2020) tested SaCas9, a multiple-turnover enzyme that releases DNA faster than SpCas9 (Yourik et al., 2019). SaCas9 was used to target the mutated version of the EGFP gene expressed in rice calli, but only 5.12% expressed GFP, much less than the 52.5-55.5% successfully edited using SpCas9. SaCas9 may require a different sequence and structure of sgRNA, and optimization of the pegRNA is required for this type of protein. Xu et al. (2020b) fused a hygromycin phosphotransferase to the c-terminus of the nSpCas9-M-MLV region with a self-cleaving 2A peptide. Overall, most of the edits had their efficiency significantly increased (when compared to the nSPCas9-M-MLV alone) and it was especially efficient to induce multiple bases mutations.
Codon optimization and the use of alternative promoters are also strategies tested in order to increase PE efficiency. Tang et al. (2020) optimized both Cas9 and M-MLV codons for plants and added a nuclear localization signal and even though the efficiencies in rice protoplast were very low (0.1-1.55%), the optimization increased slightly the number of edited reads. Lu et al. (2021) also showed that codon optimization of the M-MLV RT increased by three times the editing efficiency in tomato, while the RPS5A promoter was also three times more effective than the usual 35S promoter. Lin et al. (2020) evaluated the replacement of the engineered M-MLV RT by the cauliflower mosaic virus RT (RT-CaMV; Plant et al., 1985) or a retron-derived (RT-retron) from Escherichia coli BL21 (Lim and Maas, 1989) in an assay to convert the blue fluorescent protein (BFP) to GFP (Zong et al., 2017). Rice protoplasts were analyzed by flow cytometry and the M-MLV RT presented a 4.4% efficiency, RT-CaMV presented 3.7% and RT-retron 2.4%. M-MLV RT and RT-CaMV were also tested in an endogenous gene and showed significantly higher efficiency.

pegRNA Optimizations
Some pegRNA design tools have been reported, such as Multicrispr (Bhagwat et al., 2020), PrimeDesign (Hsu et al., 2021), and PlantPegDesigner (Lin et al., 2021), but both of them are suitable for human/mouse genomes only. Despite that, pegRNAs can be optimized by changing their promoters, and therefore increasing their expression, changing the PBS or the template lengths. One study reported a high-throughput evaluation of PE2 activities in human cells using 54,836 pairs of pegRNAs to develop computational models to predict pegRNA efficiency and an accuracy by Spearman's correlations was between 0.47 and 0.81. The author's recommendations were: (1) use a 13-nt PBS and a 12-nt RT template; (2) use a high GC count in the PBS region if possible; (3) use a G at the last templated nucleotide when the RT template length is ≤12 nt; and (4) include PAM editing for human cells (Kim et al., 2021).
For plants, Lin et al. (2020) tested a variety of pegRNAs with differential PBS and RT template lengths and also nicking positions. The authors concluded that these factors strongly affect the editing frequencies in rice, and each target site presents optimal parameters. Jiang et al. (2020) hypothesized that the efficiency could be improved by enhancing the expression of pegRNAs. However, they verified that doubling the pegRNA expression cassettes in rice did not increase PE frequency.
The common conclusion of all studies that tested different PSB and RT length in plants, such as tomato (Lu et al., 2021), rice Tang et al., 2020;Xu et al., 2020a,b), is that each target there has its optimal combination of PBS and template length, and for every study several tests must be performed to find the best strategy. Lin et al. (2021) described the in rice the optimal performance was achieved when the PBS had a melting temperature of 30°C and using two pegRNAs in trans coding for the same edits (called paired pegRNA) increased the edition efficiency from 2.9-fold to 17.4 fold and also launched a web application (PlantPegDesigner) for optimal pegRNA and paired pegRNA design.
A recent study  described the enhanced prime editing system (ePE) in which to avoid the circularization of the pegRNA due to complementarity between the PBS 3′ the spacer 5′ the 20-nt Csy4 recognition site was added to the 3′ end of the pegRNA. This site forms a hairpin therefore, avoiding pegRNA circularization and increased editing efficiency although the addition of Csy4 slightly increased indels frequency. This technique needs further optimization and there are no reports of its use in plants yet.

Reducing Off-Targets in Prime Editing
Prime editing has a lower incidence of off-targets when compared to conventional CRISPR/Cas9 approach (Anzalone et al., 2020). The higher efficiency may be due to the three-step hybridization necessary for the editing. The first one is between the target DNA and the spacer present in the pegRNA, the second one between the PBS and the target DNA to start the RT priming and finally, the hybridization of the DNA flap. In the conventional CRISPR/Cas9 approach, only one hybridization is necessary, between the target DNA and the spacer in the sgRNA. The off-target rate in each experiment can also be influenced by the target DNA configuration in the chromosome, as heterochromatin or euchromatin (Kallimasioti-Pazi et al., 2018;Verkuijl and Rots, 2019). However, only one of all the studies in plants reported the evaluation of off-targets . Authors used the CRISPR-GE software 2 and posterior PCR amplification and sequencing of 10 putative off targets of four genes targeted by prime editing, and they found no mutations, indicating a high specificity. Kim et al. (2020a) developed a strategy named Nickase-based Digenome-seq (nDigenome-seq), based on next generation sequencing to recognize all off-targets produced by experiments using PE2 in vitro in human cells. They demonstrated that undesired DNA modifications were detected in 0.1-1.9% of the cases, which can be considered as very low. Despite the importance of this type of evaluation, only Li et al. (2020b) described that no off-targets were detected; however, several by-products such as chimeras and unexpected insertions were detected in rice plants. Kim et al. (2021) also found that PE specificity in human cells could be further improved by incorporating mutations from engineered Cas9 variants, particularly eSpCas9 and Sniper Cas9, into PE, and these strategies can also be used for plants.
One of the main pitfalls of PE technique is the relative high frequency of by-products such as incorporation of pegRNA scaffolds, incomplete editions due mistakes of the repair system and unexpected deletions and insertions. Reports indicate up to 37.2% frequency of by-products in maize  and 2.23% in tomato (Xu et al., 2020b). In rice, by-products are also reported Tang et al., 2020;Xu et al., 2020a;Li et al., 2020b). Lin et al. (2020), studying rice and wheat, tested several targets and pegRNAs for each one of them and concluded that different PBS length did not affect the frequency of by-products. Meanwhile, template length not only dramatically affected the proportion of by-products, but were also variable depending on the target locus. Interestingly, 2 http://skl.scau.edu.cn/ OsCDC48-T3 and OsEPSPS-T2 target sites were less edited, despite the high indel frequency generated by nCas9, indicating that prime editing activity is independent from Cas9 activity at some targets. Xu et al. (2020a) described that PE3 with second nick closer to the first original nicking site was more efficient in producing edited plants, but also produced a higher amount of plants with by-products.
Some studies were capable of regenerating plants of rice (Butt et al., 2020;Hua et al., 2020;Li et al., 2020b;Lin et al., 2020;Xu et al., 2020b), tomato (Lu et al., 2021), and maize . However, despite all the efforts, these works have found that the production homozygous plants in the first generation has a very low efficiency. Most works reported heterozygous or chimeras which leads to the conclusion that this technique needs more improvement before being widely applied, namely in polyploid species.

CRISPR/Cas12a SYSTEM
The Cas12a endonuclease belongs to type V of class 2 effector proteins (Koonin et al., 2017) and has a size between ~1,200 and ~1,500 aa as a single subunit protein (Zetsche et al., 2015). Guided by a single mature crRNA of 42-44 nt length, Cas12a binds upstream of a typically thymidine-rich PAM TTTV (V = A, C, and G) and cleaves DNA 18-23 nt distal of the PAM via 5 bp staggered double-stranded breaks (Zetsche et al., 2015). Different from the Cas9 endonuclease which requires a tracrRNA and RNase III, Cas12a maturates the pre-crRNA by its intrinsic ribonuclease activities (Fonfara et al., 2016). The action mode of Cas12a is depicted at Figure 4.
The CRISPR/Cas12a system has been used mostly for mutagenesis through NHEJ repair in plants, but also HDR (Begemann et al., 2017;Li et al., 2019) and base editing have been demonstrated (Li et al., 2018b;Kleinstiver et al., 2020). With its great potential as a genome editing tool in plants, the CRISPR/Cas12a system has been improved to achieve higher editing efficiencies, more relaxed PAM requirements (Gao et al., 2017) and better efficiencies in multiplexed systems .
The use of RR and RVR variants of AsCas12a and LbCas12a in vitro and in human cells were shown to overcome the strict requirement for TTTV PAM sites (Gao et al., 2017). In a similar fashion, LbCas12a-RR/FnCas12a-RR and LbCas12a-RVR/ FnCas12a-RVR were tested in rice protoplasts at CCCC/TYCV PAMs (Y is C or T) and TATV PAMs, respectively . The editing activity of LbCas12a-RR was tested at all tested CCCC and 10/11 TYCV PAM sites between 2 and 40% (11/13 higher than 10%) and outperformed FnCas12a-RR. The RVR variants were not able to mediate editing at TATC PAM sites and while both variants showed activity at two out of three TATG PAM sites, LbCas12a-RVR outperformed FnCas12a-RVR. In rice T0 plants LbCas12a-RR showed the highest and most robust editing frequency at two tested TTCC PAM sites (93.3 and 100%), while no or low editing activity of FnCas12a-RR at all tested PAM sites was observed. Zhang et al. (2021) tested eight Cas12a orthologs in plants, which previously showed cleavage activity preferentially at TTV PAM sequences in vitro (Zetsche et al., 2020). From these, four orthologs [ErCas12a (MAD7, Inscripta, Inc.), Lb5Cas12a, BsCas12a, and Mb2Cas12a] showed high editing efficiencies at two different target sites in rice protoplasts (OsDEP1-TTTC and OsEPFL9-TTTG), which were comparable to those of LbCas12a (>30%). Also in transgenic rice lines, most of the tested Cas12a orthologs showed medium to high editing efficiencies at both target sites compared to LbCas12a.
Interestingly, Mb2Cas12a induced genome editing at 13 out of 18 tested VTTV PAM sequences with ~15% or higher efficiency. The editing efficiency of Mb2Cas12a at two more relaxed target sites (OsROC5-GTTG and OsDEP1-GTTC) was ~10% and Mb2Cas12a outperformed all other tested Cas12a orthologs. Introducing a RVR mutation into Mb2Cas12a (Mb2Cas12a-RVR) resulted in editing efficiencies from 15 to 35% at all TATV PAM sites which were higher than those from all other tested RVR variants . The RVRR variant (Mb2Cas12a-RVRR) was active at TTTV, VTTV, TATV, TYCV, CCCV, and CTCV PAM sites and the genome-wide PAM analysis revealed that all targetable sites of Mb2Cas12a-RVRR cover 22.2% of the total rice genome (SpCas9 10.5%; Zhang et al., 2021). Recently, an engineered variant of LbCas12a (impLbCas12a) was shown to cleave TNTN PAMs with higher activity (Tóth et al., 2020) and could be a promising tool in plants.

Overcoming Temperature Sensitivity of Cas12a
The above-mentioned temperature sensitivity of Cas12a nucleases is another drawback for their application in plants, because most plant transformation and cultivation processes are performed at lower temperatures (Bernabé-Orts et al., 2019;Lee et al., 2019;Malzahn et al., 2019). Malzahn et al. (2019) showed that AsCas12a, FnCas12a, and LbCas12a performed significantly better at temperatures ≥28°C compared to 22°C. Enhanced AsCas12a (enAsCas12a) showed two times higher activity at lower temperatures compared to AsCas12a in human cells , but was still outperformed by LbCas12a in vitro . Schindele and Puchta (2020) engineered, based on the findings from Kleinstiver et al. (2020), two LbCas12a variants (enLbCas12a and ttLbCas12a) and found that editing efficiencies of ttLbCas12a were 2-to 7-fold higher than those of LbCas12a at 22°C and were even higher than those of the other variants at 28°C. Interestingly, Mb2Cas12a was less sensitive to lower temperatures (22°C) compared to other Cas12 orthologs and the activity could be further improved by introducing mutations equivalent to enAsCas12a Zhang et al., 2021).

Improving cRNA Design
Cas12a presents both endoribonuclease and endonuclease activity (Fonfara et al., 2016;Creutzburg et al., 2020). Consequently, it is not sufficient to predict Cas12a's editing efficiency only based on the spacer-protospacer complementarity, but also on the efficiency of pre-crRNA maturation (Creutzburg et al., 2020). The maturation of the pre-crRNA is catalyzed by the ribonuclease domain of Cas12a after recognition of the hairpin structure formed by the repeat sequence at the 5′ end of the pre-crRNA (Fonfara et al., 2016;Creutzburg et al., 2020). The presence of surrounding nucleotides that base pair with nucleotides in the repeat sequence can result in alternative secondary structures which are not recognized by Cas12a (Creutzburg et al., 2020). While Creutzburg et al. (2020) provide valuable recommendations for crRNA design, it would be required to develop tools that predict efficient crRNAs also based on secondary structures.
While typically a crRNA of FnCas12a has a 24-nt guide sequence, Negishi et al. (2020) showed that the length of this guide sequence (18-30 nt) can positively influence the editing efficiency, while off-targets are not affected. It seems that there is an optimal length for each target sequence that must be tested individually. Tang et al. (2017) investigated the target specificity of LbCas12a by introducing double mismatch mutations into the protospacer and found that mismatches in the first 18 nt of the 23 nt of the protospacer abolished cleavage activity and mismatches at position 21-22 reduced cleavage activity by ~50%. These findings are in line with data from Zhang et al. (2021) that also confirm the high targeting specificity of several Cas12a orthologs.

Increasing Multiplexing Efficiency in Cas12a Systems
Multiplex genome editing allows the introduction of two or more precise gene mutations in one plant generation and has the potential to significantly accelerate breeding processes . Several multiplex Cas12a systems in plants have been developed, but the efficiency of high biallelic editing frequencies in all targeted genes were very low Tang et al., 2019). An extensive comparison of different multiplexed systems indicated that the editing efficiency of all tested Pol II promoter-based systems were superior to Pol III promoter-based systems . The most efficient combination was LbCas12a with a pZmUbi-HH-crRNA-HDV system, which resulted in 26/36 T 0 lines carrying biallelic mutation in all four targeted genes. The same system, in combination with Mb2Cas12a, resulted in 8/11 T 0 lines with biallelic mutations at all four sites. Mb2Cas12a-RVRR was able to edit seven targets with canonical and altered PAM sequences with editing efficiencies ranging from 5.1 to 31.1% in rice protoplasts. The power of this system was further demonstrated by simultaneously targeting 16 genomic sites across nine chromosomes in rice plants. While 20/21 T 0 lines contained at least 13 edited sites of which at least seven were biallelic edits, 11/21 T 0 lines had 15 edits and 1/21 T 0 line showed editing in all 16 targeted genes. The proposed system allows editing of 14 sites simultaneously with biallelic mutations.

Increasing HDR Efficiencies in Cas12a Systems
Cas12a is particularly suited for HDR as it induces a doublestrand break 18 nt distal of the PAM and does not destroy the target sequence (Moreno-Mateos et al., 2017). Begemann et al. (2017) first demonstrated the capability of FnCas12a for an efficient targeted insertion in rice plants (8%), which performed better than LbCas12a under comparable conditions (up to 3%).
One strategy to improve the low HDR efficiency is to optimize the template delivery and Vu et al. (2020) utilized a geminiviral DNA replicon to express LbCas12a in tomato. The HDR efficiency mediated by LbCas12a expressed from the construct containing a geminiviral replicon (~4.5%) was significantly higher than the control and similar to the CRISPR/SpCas9-based construct (~3.6%). Further results showed that at higher temperatures (31°C) the efficiency can be significantly improved (9.8%). The use of a T-DNA system containing multiple replicons further increased the levels of donor template after the transformation step and resulted in higher HDR efficiencies up to 12.79%. In Arabidopsis, the variant ttLbCas12a outperformed LbCas12a at 22°C/28°C and ttLbCas12a showed a higher HDR efficiency at 22°C compared to LbCas12 at 28°C (Merker et al., 2020).
The first identified Cas12b orthologs, Alicyclobacillus acidoterrestris Cas12b (AacCas12b) and Bacillus thermoamylovorans Cas12b (BthCas12b) required high temperatures (48-50°C) for optimal activity in vitro (Shmakov et al., 2015). Wang et al. (2020d) used an AacCas12b-based system to edit the GhCLA gene in cotton, which as a thermophilic plant is able to survive higher temperatures. From three different temperatures (42°C, 45°C, and 48°C) and eight incubation times (ranging from 6 h to 15 days), the highest editing efficiency was achieved at 45°C after 4 days (Wang et al., 2020d).
The discovery of Alicyclobacillus acidiphilus Cas12b (AaCas12b; Teng et al., 2018) and later Bacillus hisashii Cas12b (BhCas12b; Strecker et al., 2019) which showed activity at lower temperatures, enabled the broader use of Cas12b for plant genome editing. In a comparative study, Ming et al. (2020) showed that AaCas12b has an editing efficiency of ~10% in rice protoplasts, while AacCas12b (~5%) and BthCas12b (very low) were less efficient. All three tested endonucleases generated 4-14 bp deletions located 12-24 bp distal to the PAM sites. Testing different PAM requirements for AaCas12b and AacCas12b revealed that both are capable of editing VTTV PAM sites with a preference for ATTV and GTTG sites. AaCas12b was further shown to be very sensitive to 2-pb mismatches in the protospacer and a protospacer length of ≤18 bp abolished the editing efficiency. 54.2% of the tested T 0 lines were edited by AaCas12b and of these 46.2% of the T 0 lines carried biallelic mutations.
Cas12b was further used in a multiplexed system with dual Pol II promoters and a hammerhead-sgRNA-hepatitis delta virus array to target three genes simultaneously (Ming et al., 2020). Compared to AacCas12b, AaCas12b performed better at two out of three target sites, but showed no activity at one site. At the two edited sites, 29.2 and 45.8% of the tested T 0 line carried biallelic mutations.

CRISPR/Cas12f SYSTEM
Recently, two Class 2 miniature type V-F Cas nucleases, SpCas12f1 (497 aa) and AsCas12f1 (422 aa), have been used for genome editing in maize (Bigelyte et al., 2021). With their compact size compared to Cas9 and Cas12 endonucleases, Cas12f nucleases could be a good choice when using viral-based delivery systems. Furthermore, their dependence on higher temperatures for optimal activity (45-55°C) could help to better control off-target effects in some plant species (Bigelyte et al., 2021).

FINAL REMARKS
There is an urgent need to improve crop sustainability and the use of gene editing technologies will not only to increase our ability to modify traits already present in the crop, but also introduce new traits that so far have only been achieved with regular transgenesis, such as tolerance to herbicides (Dong et al., 2021) and insects (Tyagi et al., 2020). The use of deep learning technologies certainly will help our ability to engineer new proteins, with not only higher specificity, but also higher efficiency. In this sense, there is a plethora of organisms that have CRISPR-like system, including extremophyles, that remain to be explored (Salwan et al., 2020). In Table 1, we have compiled components that can be manipulated in order to increase the editing efficiency of CRISPR-based methods. It is worth mentioning that high throughput methods to screen new versions of all the editing components will improve our ability to discover new improvements in gRNA and protein structures, new editing features, among others.
Genome editing technologies are a new addition to the toolbox of genetic engineering that so far has relied extensively in the addition of DNA sequences from other organisms. In spite of all the scientific evidence indicating genetically modified organisms (GMOs) are safe for human consumption, skepticism still remains among consumers, with 46.7% having a negative view on GM food in China (Cui and Shoemaker, 2018), 57% in the United States (Pew Research Center, 2015), and 34.7% in Canada (Charlebois et al., 2019). Although increased knowledge from qualified sources are known to move this trend towards a positive view on GMOs (Wunderlich and Gatto, 2015;Hunt and Wald, 2021), there is evidence that precision genome editing, without the incorporation of recombinant DNA in plant genomes will have a higher acceptance than GMOs because no exogenous DNA would be introduced (Muringai et al., 2020).
Additionally, after all components of the gene-editing system have been integrated into the host genome, chances of off-target indels increase with time (Lawrenson et al., 2015). The capability of producing transgenic plants without the integration of foreign DNA would alleviate the regulatory concerns in order to commercialize new varieties (Jones, 2015).
The delivery of reagents for genome editing without incorporating DNA into the genome results in mutations identical to naturally occurring events. This approach, also called DNA-free genome editing, has been reported in protoplast from several plant species with a mutagenesis frequency of up to 46% in Arabidopsis, tobacco, rice, and lettuce plants (Woo et al., 2015;Kim et al., 2017). However, regeneration of plants from transformed protoplasts, especially monocots, can be challenging. To overcome this, Zhang et al. (2016) used biolistics to introduce Cas9 mRNA and gRNA into immature wheat embryos that were allowed to regenerate plants without selection. This strategy resulted in genome editing at a ratio of 1.1 plants per 100 bombarded embryos. In addition, transformed plants were transgene-free, since no DNA was used in the process.
The use of DNA-free CRISPR/Cas techniques in crop plants is particularly interesting due to the regulatory issues involving the release of commercial products from genetically modified organisms. Recently, the USDA-APHIS confirmed that plants with phenotypes developed through genome editing without foreign DNA inserted in the genome will not require regulatory approval for commercial production in the United States (Waltz, 2016). Products to be commercialized will be evaluated on a case-by-case basis, but certainly will result in a remarkable reduction of regulatory costs for cultivar development. In Brazil, genome editing technologies will not be regulated as GMOs, on a case-by-case analysis (CTNBio, 2018). On the contrary, in Europe, the decision on regulating precision breeding technologies as GMOs has created concerns on the continent's economy (Hjort et al., 2021). On a global scale, local authorities are still discussing regulatory frameworks for genome-edited plants in other countries. In the meantime, delivery methods have been developed that introduce targeted mutations without any transgenic footprint of the genome editing tools . This transformative technology will allow a paradigm

Component
Strategies Impact Future developments Guide RNA/ pegRNA -New computational algorithms allow the design of gRNA sequences with high affinity and specificity to the target sites, as well as higher affinity with the nuclease -Larger gRNAs (pegRNA) provide an HR template when combined with nuclease/ reverse transcriptases -Reduced binding to non-target sites -Enhanced binding to target sites -Higher nuclease activity -Improved algorithms to decipher the best gRNA sequence -Improve prediction for best optimal combination of PBS and template length Nucleases -Protein engineering to produce evolved nucleases -Association with proteins or domains with new features -Discovery of new nucleases -Higher DNA cutting activity -Increased affinity to gRNA -Broader target selection, allowing to edit the entire genome (PAM-less nucleases) -New functions by fusion with other proteins, such as deaminases and reverse transcriptase -Improved nuclear localization -Higher activity at temperatures used in plant cell tissue culture -Computational tools to create new protein structures -Identification of proteins from a broader range of organisms, with new abilities -Increase the editing power in order to decrease chimera or heterologous plants Deaminase -Protein engineering -Fusions with new proteins or protein domains -New editing capacities, such as transversions -Higher predictability of the edited base -Discovery of new deaminases or other protein modules that can improve the editing process HR templates -Increased template availability by using synthetic DNA more resistant to nucleases, higher copy number using virus replicons, and co-localization of the HR template to the RNP complex by fusions with the gRNA -Higher probability of HR, therefore, increased desired editing rate -Better understanding of key cellular components involved in HR enhancement and also in HR inhibition -New strategies to increase HR stability and proximity to the editing site Computational tools -Identification of new nuclease variants with increased specificity and efficiency -Increased accuracy of both target and offtarget predictions -New experiments designed to gather data to feed the algorithms -New algorithms and publicly available servers -Decipher the role of epigenomics and chromatin structure in the gRNA binding and nuclease activity Plant tissue culture -Protocols that allow the selection of edited cells without transgenes, due to mutations in genes encoding herbicide tolerance -GMO-free edited plants -Multiplex editions, targeting the herbicide tolerance gene and other genes related to other traits Proteins components in general -Codon optimization -Promoters and terminators with higher activity -Enhanced levels of the proteins involved in gene editing -Identification of new promoters and terminators with higher activity in the cell types and initial stages of the transformation