Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci., 09 January 2026

Sec. Plant Biotechnology

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1737513

This article is part of the Research TopicGene Editing for Biofortification: Innovations and ApplicationsView all 6 articles

CRISPR/Cas9-mediated inactivation of the soybean agglutinin Le1 gene to improve grain quality

Joo Matheus Kafer,João Matheus Kafer1,2Alessandra Koltun,Alessandra Koltun2,3Rodrigo Thibes Hoshino,Rodrigo Thibes Hoshino1,4Larissa Girotto,Larissa Girotto1,5Cesar Augusto SilveiraCesar Augusto Silveira1Silvana Regina Rockenbach MarinSilvana Regina Rockenbach Marin1Elibio Leopoldo RechElibio Leopoldo Rech6Alexandre Lima Nepomuceno,Alexandre Lima Nepomuceno1,2Liliane Marcia Mertz-Henning,*Liliane Marcia Mertz-Henning1,2*
  • 1Embrapa Soja, Londrina, Paraná, Brazil
  • 2Department of General Biology, State University of Londrina, Londrina, Paraná, Brazil
  • 3Department of Agronomy, State University of Maringá, Maringá, Paraná, Brazil
  • 4Fundação de Apoio à Pesquisa e ao Desenvolvimento (FAPED), Sete Lagoas, Minas Gerais, Brazil
  • 5Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Brasília, Distrito Federal, Brazil
  • 6Embrapa Recursos Genéticos e Biotecnologia, Brasília, Distrito Federal, Brazil

Introduction: Soybean agglutinin (SBA) is a major antinutritional factor in soybean seeds, reducing digestibility in monogastric animals. The Le1 gene encodes the primary lectin accumulated in seeds. Genome editing offers a direct strategy to eliminate this factor in elite cultivars.

Methods: Two gRNAs targeting Le1 were inserted into a CRISPR/Cas9 binary vector and used for Agrobacterium tumefaciens–mediated transformation of the soybean cultivar BRS 537. Edited plants were screened by PCR, Sanger sequencing, protein electrophoresis (SDS-PAGE), hemagglutination assays, and segregating generations were tested to identify transgene-free progeny. Agronomic traits were evaluated under field conditions.

Results: Twenty transformation events were generated, with an editing efficiency of 10%. Event AF12-13-1 carried a 4-bp deletion producing a truncated, unstable lectin protein. SDS-PAGE confirmed the absence of the ~30 kDa SBA band, and hemagglutination assays showed complete loss of lectin activity. Transgene-free T2 plants lacking Bar, Cas9, and AtU6 sequences were identified. Agronomic traits—including yield and thousand-seed weight—were comparable to the wild-type cultivar.

Discussion: CRISPR/Cas9 editing of Le1 effectively eliminated SBA accumulation without compromising key agronomic traits. The resulting low-lectin soybean lines represent a promising approach to improve digestibility and feed efficiency for monogastric animals.

1 Introduction

Soybean (Glycine max Merril.) is a major legume crop, highly valued for its seeds, which are rich in protein (∼40%), oil (∼20%), and carbohydrates (26–30%) (George et al., 2008; Choi et al., 2022; Padalkar et al., 2023). Soybean and its derivatives are widely used across the food, feed, pharmaceutical, and industrial sectors (Singh et al., 2008; Rizzo et al., 2018). Although soybean meal represents a major global protein source for animal feed, soybean proteins also play important roles in human nutrition and functional food applications, owing to their physicochemical, bioactive, and technological properties (Messina, 2016). Even in human nutrition, soybean is regarded as a functional food, owing to its medicinal and health-promoting properties, which include a range of bioactive compounds such as isoflavones, flavonols, proteins, and lipids (Padalkar et al., 2023).

Despite its nutritional and medicinal value, soybean contains antinutritional factors that limit its use in both human consumption and animal feed, as they impair nutrient digestibility and absorption in the gastrointestinal tract. Among the many antinutritional factors, the most relevant are the Kunitz trypsin inhibitor (KTI) and soybean agglutinins, or lectins (Schmidt et al., 2015; Choi et al., 2022; Padalkar et al., 2023).

Structurally, lectins possess a characteristic domain and are classified according to their molecular structure and carbohydrate-binding properties. In plants, lectins perform different functions depending on their subcellular localization. Plant lectins can participate in defense by acting as signaling or recognition molecules in nucleocytoplasmic compartments, where they respond to biotic stresses (Chrispeels and Raikhel, 1991; Lannoo and Van Damme, 2014; De Coninck and Van Damme, 2022). In contrast, seed lectins, which accumulate predominantly in protein storage vacuoles, act mainly as storage proteins rather than components of signaling-driven defense mechanisms (De Coninck and Van Damme, 2021, 2022).

Ingestion of tissues from storage organs, such as seeds, releases lectins into the animal intestine, where they bind to glycan receptors on the intestinal wall. This lectin–carbohydrate interaction alters intestinal epithelial morphology and impairs nutrient digestion and absorption (Alves de Moraes et al., 2006; George et al., 2008; Schmidt et al., 2015). Raw soybean seeds contain 10–20 g.kg−1 of lectin, with concentrations above 7 g.kg−1 considered detrimental to digestibility and unsuitable for direct consumption (George et al., 2008), limiting soybean use both as a source of human food and in animal feed formulations, including its use as a forage crop (Padalkar et al., 2023). Processing methods can inactivate these antinutritional factors, but they are costly and may reduce the nutritional content of soybean. For these reasons, genetic inactivation of antinutritional factors has become an attractive strategy.

In soybean, four transcribed legume lectin genes have been identified: the vegetative Le3 (SVL/LE3), the pseudogene Le2, the seed-specific Le1 (soybean lectin, also known as phytohemagglutinin or agglutinin), and Le4. In Arabidopsis thaliana, histochemical analyses showed that the Le2 promoter drives expression in all tissues, including seeds, but not roots. Comparing the Le1 promoter with Le3 and Le4 promoters revealed that the Le1 promoter contains a higher number of seed-specific motifs (Chragh et al., 2015; Shamimuzzaman and Vodkin, 2018). The Le1 gene encodes soybean lectin, a 120 kDa glycoprotein composed of four similar 30 kDa subunits. In recessive lectin genotypes (Le1), a 3.5 kb insertion element (Tgm1) was identified, which prevents transcription and, consequently, lectin accumulation in seeds. Furthermore, removal of Tgm1 from the mutant allele restores the gene’s ability to be transcribed and expressed during seed development (Goldberg et al., 1983; Vodkin et al., 1983; Okamuro and Goldberg, 1992). Therefore, Le1 is responsible for the accumulation of this antinutritional factor in seeds and represents a highly valued target for strategies aiming to reduce lectin content through genetic approaches.

Developing genotypes that do not produce lectin using genome editing technology offers advantages over conventional breeding, as it can be applied directly to elite cultivars without the need for crosses to fix the trait in the target genotype. Accordingly, the aim of this study is to reduce lectin content in soybean seeds by knocking out the Le1 gene using CRISPR/Cas9 technology in an elite soybean variety adapted to tropical climates, aiming to improve digestibility mainly for monogastric animals.

2 Materials and methods

2.1 Prospection of the Le1 gene in soybean and in silico characterization

The Le1 gene sequence was retrieved from the Wm82.a2.v1 and Wm82.a4.v1 versions of the Glycine max reference genome, available on Phytozome. Orthologous and paralogous genes were identified based on amino acid sequences from the PF00139 family (true lectins with the exclusive Lectin_legB domain), also obtained from Phytozome v13. Proteins containing additional domains or belonging to the PF03388 family (putative lectins) were excluded. Sequences were aligned using MUSCLE (Edgar, 2004), and a phylogenetic tree was constructed with iTOL (Letunic and Bork, 2024). Physicochemical similarities among proteins were analyzed using Persephone (https://web.persephonesoft.com/). Gene expression was assessed via the RNA-Seq Atlas (https://soyatlas.venanciogroup.uenf.br/), with comparisons made between homologous genes.

2.2 CRISPR vector construction and soybean transformation

gRNAs targeting the Le1 gene (Glyma.02G012600) were designed using CRISPRDirect (Naito et al., 2015), and the most specific guide with no predicted off-targets in the guide + PAM region was selected. The modular vector C034p7ioR-35SCasWT (DNA Cloning Service, Hamburg, Germany), which contains two BsmBI sites for gRNA insertion, was employed. Cohesive ends for cloning were designed as follows: oligo 1 – 5’ TGATTG N(20) 3’ and oligo 2 – 5’ AAAC N(20) CA 3’, with an additional 5’ guanine to optimize transcription from the U6 promoter. The vector expresses Cas9 under the CaMV 35S promoter, the Bar gene conferring resistance to ammonium glufosinate, and the gRNA under the control of the Arabidopsis thaliana U6 promoter. Stable transformation of the soybean cultivar BRS 537 was conducted via Agrobacterium tumefaciens following Paz et al., 2005with minor modifications. Cotyledonary node explants from sterilized, germinated seeds were gently wounded with a sterile microbrush and infected with Agrobacterium cultures (OD600 ≈ 0.6). After a 5-day co-cultivation, explants were transferred to shoot induction and elongation media supplemented with glufosinate ammonium for selection. Regenerated shoots were rooted under selection, acclimated, and screened by PCR to confirm the presence of transgenic sequences.

2.3 Polymerase chain reaction for detection of transgenes

Genomic DNA was extracted from leaves using a modified CTAB method (Doyle and Doyle, 1987). DNA quality and concentration were assessed by spectrophotometry (260/280 nm) and 1% agarose gel electrophoresis. The presence of transgenic sequences was verified by conventional PCR using primers specific for the Bar and Cas9 genes, and the AtU6 promoter. Amplified products were separated on 1.5% agarose gels containing ethidium bromide, visualized under a UV transilluminator, and documented by photodocumentation. The presence of bands at expected sizes was considered indicative of transgenesis. Primer sequences and the corresponding expected PCR product sizes are detailed in Supplementary Table 1.

2.4 Sanger sequencing

The Le1 gene was amplified by polymerase chain reaction (PCR) using primers flanking the target region of interest. Reactions were carried out with previously extracted genomic DNA in a mixture containing appropriate buffer, magnesium chloride, deoxynucleotide triphosphates (dNTPs), primers, and a thermostable DNA polymerase. Amplification was performed in a thermocycler under the following conditions: initial denaturation at 94°C for 3 minutes, followed by 35 cycles of 94°C for 30 seconds, annealing at 58°C for 30 seconds, and extension at 72°C for 1 minute, with a final extension at 72°C for 5 minutes. The amplified product was verified by agarose gel electrophoresis and subsequently purified using ExoSAP-IT™ to remove residual primers and dNTPs. Sequencing was performed using the BigDye™ Terminator v3.1 kit, followed by purification via ethanol/EDTA precipitation. Samples were resuspended in Hi-Di™ formamide and subjected to capillary electrophoresis on an ABI Prism 3500 automated sequencer (Applied Biosystems). Electropherograms were analyzed using the ICE platform (Synthego).

2.5 In silico protein structure analysis

For in silico protein analysis, the wild-type and mutant Le1 gene sequences were translated to generate the corresponding protein sequences. Both protein sequences were assessed using the ExPASy platform (https://www.expasy.org/) to evaluate protein stability (Gasteiger et al., 2005). Subsequently, protein structure was analyzed based on crystallographic data, and Ramachandran plots were generated using Swiss-Model (Waterhouse et al., 2024). Finally, carbohydrate-binding capacity of the protein was evaluated using PyMOL (PyMOL, 2025) with the SBF1 crystallographic structure as reference (Olsen et al., 1997).

2.6 Polyacrylamide gel electrophoresis analysis

Sample preparation for SDS-PAGE followed Alves de Moraes et al. (2006), with adaptations. Protein analysis was conducted by denaturing polyacrylamide gel electrophoresis in the presence of sodium dodecyl sulfate (SDS-PAGE), according to the method described by Laemmli (1970). The separation gel was prepared as a gradient (20–8% acrylamide) at pH 8.8, with a 4% stacking gel at pH 6.8. Electrophoresis was performed in Tris-glycine-SDS buffer (25 mM Tris, 192 mM glycine, 0.1% SDS, pH 8.3) at a constant voltage of 90 V until complete band separation. Gels were stained with Coomassie Brilliant Blue R-250 (0.1% in methanol:acetic acid:water, 45:10:45 v/v/v) for 1 hour and subsequently destained in methanol:acetic acid:water (30:10:60 v/v/v) until the background was cleared. Bands were visualized on a white light table, and molecular weights were estimated by comparison to a pre-stained protein marker.

2.7 Hemagglutination assay

The hemagglutination assay was performed according to (Adamcová et al., 2021), with adaptations. Human blood type A, containing the GalNAc carbohydrate for lectin detection, was used. Blood was collected from João Matheus Kafer and Human blood type A was collected using sodium citrate as anticoagulant following standard procedures (WHO, 2010). To isolate erythrocytes, 2 mL of human blood was centrifuged at 500 g, and the lymphocyte and plasma fractions were carefully removed using a Pasteur pipette. The blood was then washed three times with 0.1% PBS to prepare a 2% erythrocyte suspension in PBS. This suspension was treated with 0.05% trypsin freshly prepared in PBS, then washed three times with PBS for use in serial dilutions with the samples.

Sample preparation involved the grinding seeds from the AF12-13–1 plant, BRS 537 (background), and black bean (positive control), which were stored in a refrigerator until use. To extract soluble lectin, 5 g of each sample was diluted in 30 mL PBS and agitated at 250 RPM for 1 hour, followed by overnight storage at 4°C. Subsequently, 5 mL of the protein extract was centrifuged at 2850 g for 5 minutes, and the supernatant was recovered and centrifuged again at the same speed. This extract was used for preparing serial dilutions of the samples.

The AF12-13-1, BRS 537, and black bean samples underwent 10 serial dilutions (1/2, 1/4, 1/8, 1/16, 1/32, 1/64, 1/128, 1/256, 1/512, and 1/1024). Dilutions were distributed into reaction plates with an equal proportion of trypsin-treated erythrocytes in duplicate. Hemagglutination was observed visually, with a magnifying glass, and under a microscope. Hemagglutination was characterized by deformation of erythrocytes, exhibiting irregular shapes, whereas absence of hemagglutination was indicated by erythrocytes precipitating as a dot at the bottom of the plate.

2.8 Evaluation of agronomic traits

To evaluate agronomic traits, field trials were conducted at the EMBRAPA headquarters (Rodovia Carlos João Strass, s/n°, Acesso Orlando Amaral, Distrito da Warta; Coordinates: 23°11’6”S, 51°10’32”W). Plants were grown under standard field conditions with appropriate fertilization. The experiment was arranged in a randomized block design (DBC) with four replications, and each plot had an area of 8 m².

The evaluated traits included plant height (PH), thousand-seed weight (TSW), and productivity (PROD). PH was measured from the soil surface to the apex at physiological maturity. TSW was obtained from the mass of 1000 dried seeds. Productivity (PROD) was expressed as grain yield in kg·ha−1, calculated from the total seed weight harvested per plot and normalized to planting density. Oil and protein contents were quantified using Fourier transform near-infrared spectroscopy (FT-NIR; Antaris II, ThermoFisher Scientific, Waltham, MA, USA), following the protocol described by Mertz-Henning et al. (2017).

2.9 Statistical analysis

Statistical analyses were performed using RStudio version 4.4.3 (RStudio, 2020). Tukey’s test was applied at a 5% significance level.

3 Results

3.1 Identification of the Le1 gene in soybean

Based on the in silico characterization of genes from the PF00139 family – which comprises the true lectins containing the Lectin_legB domain – 19 genes were identified as encoding true lectins with this single domain, distributed across 10 chromosomes (Figure 1A). Gene expression data from the Soybean Expression Atlas (https://soyatlas.venanciogroup.uenf.br/), using genome version Wm82.a4.v1, revealed their expression patterns across various tissues. The Le1 gene (Glyma.02G012600), which encodes soybean seed agglutinin, is expressed mainly in the embryo derived tissues (Figure 1B). The gene most closely related to Le1, Glyma.02G156800, shares 62% structural similarity, but shows no detectable expression in seed tissues, as its expression is restricted to flowers and hypocotyls. These results suggest that Le1 has a seed tissue-specific function and occurs as a single-copy gene.

Figure 1
Phylogenetic tree and heatmap analysis of gene expression. Left (A): Circular phylogenetic tree showing the relationships among various Glyma genes, with Glyma.02G012600 highlighted. Right (B): Heatmap illustrating expression levels of the same genes across different plant tissues, with color-coded intensity from purple to green indicating low to high expression.

Figure 1. Characterization of soybean lectin genes from the PF00139 family. (A) Phylogenetic tree based on amino acid sequences. (B) Heatmap showing the average expression levels of PF00139 soybean lectin genes across different tissues.

3.2 Development of the Le1 gene knockout vector

Vector cloning was performed following the protocol described by Koltun et al. (2023). The gRNAs were inserted into the modular vector C034p7ioR-35SCasWToi using type IIS restriction enzymes (BsmBI). Cas9 nuclease expression was driven by the CaMV promoter. The Bar gene, under the control of the CaMV 35S promoter, was used to select transformed explants by conferring resistance to the herbicide glufosinate ammonium (Figure 2A). The vector includes two gRNAs targeting Le1 gene sequence, as shown in Figure 2B.

Figure 2
Diagram showing a gene construct (A) and gRNA targeting (B). Part A shows various components from left to right: CaMV, BAR, E9-T, CaMV, Cas9-WT, NOS-T, At-U6, gRNA 1, gRNA 2. Part B displays the Glyma.02G012600-Le1 gene with two gRNA targets: gRNA 1 (sequence 5’ACTGGTGCTACTGACCAGCAAGG3’) and gRNA 2 (sequence 3’GGTGTTGGCTTAGTGTCAAT5’), showing positions of the PAM sequences. The section between gRNAs spans 858 base pairs.

Figure 2. Vector map used for CRISPR/Cas9-mediated editing of the Le1 gene (A). gRNA target sites within Le1 (B).

Insertion of the gRNAs was confirmed by PCR using one primer annealing to the AtU6 promoter and another complementary to the gRNA sequence. Additionally, the PCR products were validated by Sanger sequencing. The vector was then transformed into Agrobacterium tumefaciens for use in soybean stable transformation.

3.3 Plant transformation and gene editing

A total of 285 soybean cotyledons were transformed via Agrobacterium tumefaciens. Of these, 20 putative transformant plants regenerated under selection and were transferred to acclimatization, where they completed their life cycle. Among them, 10 were confirmed as successfully transformed by PCR, using primers that detect the presence of the vector through amplification of the U6 promoter region, generating a 564 bp fragment (Figure 3A). Positive events were sequenced at the target gene region to confirm successful gene edition. Editing of the Le1 gene was confirmed in two events, although only one of the two gRNAs was effective. Plant AF12-13, which showed the highest editing frequency, was selected for generation advancement (Figure 3B).

Figure 3
Gel electrophoresis image labeled “A” showing DNA bands with red arrowheads indicating specific bands and asterisks highlighting notable lanes. The gel includes lanes 1 to 20 and markers “+”, “WT”, “-”, and “M”. Below, a bar graph labeled “B” shows an indel spectrum with percentages of sequences on the vertical axis and deletions/insertions on the horizontal axis. Red and black bars represent different statistical significances, with total efficiency marked as 47.1%.

Figure 3. PCR detection of transformation plants confirming the presence of the CRISPR/Cas machinery using a primer annealing to the U6 promoter region (A). Red arrows indicated positive transgenic plants; (*) indicate edited plants; (+) positive control; (WT) wild type; (–) negative control. Sanger sequencing of event AF12–13 analyzed with TIDE software to confirm Le1 gene editing. (B) Target sites of gRNA1 and gRNA2 within the Le1 gene. The schematic representation shows the exon–intron structure and the relative positions of the two CRISPR/Cas9 cleavage sites, including the PAM sequences. Both gRNAs target exon regions to promote a frameshift-inducing deletion in the coding sequence of Le1.

Based on sequence analysis using the TIDE software, the editing frequency in the T0 plant was 47.1%. Detected mutations ranged from –7 to +1 base pairs at the Cas9 cleavage site, with the most frequent mutation being a 4-bp deletion, accounting for 20.2% of all observed edits (Figure 3B). These results indicate transformation and editing efficiencies of 7.02% and 10%, respectively.

3.4 Editing and confirmation of mutants

Plant AF12-13 (T0) produced a total of 93 T1 seeds, which were planted in the greenhouse to advance generation and identify stable plants. T1 segregating plants were screened for the presence of the transgenic construct to identify edited plants free of exogenous DNA. Eight plants lacking the 564 bp vector fragment – detected using a primer annealing to the U6 promoter – were identified, five of them were selected for further confirmation (Figure 4).

Figure 4
Gel electrophoresis image showing multiple lanes with DNA bands. Red arrows highlight specific bands. Labels include “+”, “WT”, “-”, “M”, and “564 bp”, indicating controls and size marker.

Figure 4. PCR amplification of fragments used for transgene detection in event AF12-13. Red arrows indicate T1 generation plants testing negative for the vector fragment.

Plants considered transgene-free, as they showed no presence of the vector, were selected for sequencing. Sanger sequencing revealed fixation of a 4-bp deletion (GACA) at the guide RNA target site in T1 plant AF12-13-1 (Figures 5A–C). A double verification was performed via PCR using primers amplifying different regions of the vector (U6 promoter, Cas9 nuclease, and Bar gene) (Figure 6) to confirm the absence of the CRISPR machinery. This plant was advanced to the next generation, and the mutation was confirmed in the T2 generation (Figure 5D). Sequencing of T2 AF12–13 plants confirmed that the 4-nucleotide deletion (GACA) at positions 388–391 of the Le1 gene was stably inherited.

Figure 5
Genomic analysis results showing a CRISPR experiment. Section A indicates a successful status with specific guide and PAM sequences. Sections B, C, and D display chromatograms of DNA sequences, showing colored peaks corresponding to nucleotides. Section B highlights an indel event and PAM site. Data is from Synthego's ICE tool.

Figure 5. Sanger sequencing of events analyzed after the T0 generation, processed with ICE Synthego software (A). Electropherogram of the wild-type plant highlighting the gRNA and PAM sequences (B). Sequencing of T1 event AF12-13-1 (C) and T2 event AF12-13-1 (D).

Figure 6
Gel electrophoresis results displayed in three panels with labeled DNA bands. Top: Bands marked U6-564 base pairs, Bar-190 base pairs, and Le1-858 base pairs. Middle: Bands marked Cas9-F1 at 845 base pairs and Cas9-F2 at 629 base pairs. Bottom: Bands marked Bar-Cas9-F1 at 554 and 776 base pairs, with U6-564 base pairs indicated. Ladder markers are present in each panel.

Figure 6. PCR amplification of fragments from DNA of 5 plants from event AF12-13. The empty vector was used as a positive control for the reaction, and amplification of the Le1 gene served as a DNA quality control (+). Red arrows indicate fragments amplified in the positive control (+).

3.5 In silico characterization of mutants

The effects of the deletion were analyzed using the Muscle tool (https://www.ebi.ac.uk/Tools/msa/muscle/), which revealed a frameshift starting at the mutation site (Figure 6C). This frameshift introduced a premature stop codon, reducing the total protein length from 286 to 191 amino acids, including the 32-amino-acid signal peptide (Figure 7). Excluding the signal peptide, the wild-type protein contains 253 amino acids, whereas the mutated protein comprises 159 amino acids. This change results in protein instability, as indicated by ProtParam analysis (https://web.expasy.org/protparam/), which showed an increase in the instability index from 34.75 to 41.21, classifying the mutated protein as unstable (Guruprasad et al., 1990; Gasteiger et al., 2005).

Figure 7
Diagram showing a sequence alignment and secondary structure prediction of a protein. Panel A displays the sequence alignment with a color-coded indication of insertions, deletions, and contributions. Panels B and C depict the secondary structure prediction with amino acids labeled in circles. Green-shaded N indicates specific residues, and red arrows point to significant sites.

Figure 7. Comparison of Le1 sequences between the BRS 537 wild type and genotype AF12-13-1. (A) Sanger sequencing of the Le1 gene in the AF12-13–1 genotype. (B) Amino acid sequence of the Le1 protein in the wild type. (C) Amino acid sequence of the Le1 protein in the AF12-13–1 genotype.

Consequently, the molecular weight of the mutated protein decreased to 17.4 kDa, compared to 30 kDa in the wild type. The composition of charged residues also changed: the mutated protein contains 10 negatively charged and 16 positively charged amino acids, whereas the wild-type protein has 26 negatively and 21 positively charged residues. Both proteins share an identical amino acid sequence up to position 97.

Using the Swiss-Model prediction tool, the mutated protein exhibited 43.17% identity with the proposed model, corresponding to the bean phytohemagglutinin. This divergence is likely due to the frameshift starting at amino acid 97. The effect is further illustrated in the Ramachandran plot, which shows the torsion angles of amino acids and their suitability for forming secondary and tertiary structures (Figure 8).

Figure 8
Panel A shows a Ramachandran plot with clusters of blue dots within green shaded regions, representing dihedral angles. Panel B presents a three-dimensional protein structure in blue ribbons, displaying symmetry. Panel C features another Ramachandran plot with red, orange, and purple dots clustered in shaded green areas. Panel D shows two protein structures in a linear arrangement with orange-to-purple gradient ribbons, indicating structural alignment.

Figure 8. Structural comparison between the wild-type agglutinin and the mutated protein of AF12-13-1. (A) Ramachandran plot of the wild-type protein. (B) Crystallographic structure of the wild-type soybean agglutinin protein. (C) Ramachandran plot of the mutated protein. (D) Predicted crystallographic structure of the mutated protein from genotype AF12-13-1.

In Figures 8A, B, the wild-type protein structure shows a significant clustering of points around ϕ ≈ −60° and ψ ≈ −50°, indicating amino acids in an α-helix conformation. In Figures 8C, D, the mutated protein displays more residues outside this region, suggesting lower protein stability. Likewise, in the β-sheet region between ϕ ≈ −120° to −150° and ψ ≈ +100° to +150°, the mutated protein exhibits greater dispersion of points. Additionally, the mutated protein has a higher number of amino acids outside allowed regions, indicating reduced conformational stability.

One of the main features of SBA as a lectin is its specific and reversible binding to carbohydrates, classifying it as a glycoprotein. Glycosylation enhances protein stability and ensures proper function. Therefore, we evaluated event AF12-13–1 based on its carbohydrate-binding capacity. SBA binds to the carbohydrate GalNAc through amino acids Ala-86, Asp-88, Ala-105, Phe-128, Asn-130, Leu-214, Asp-215, and Ile-216 (Figure 9A). The mutated protein from plant AF12-13–1 shows significant changes in the carbohydrate-binding site. While Ala-86 and Asp-88 remain, Ala-105, Phe-128, and Asn-130 are replaced by Val, Thr, and Gly, respectively. Additionally, Leu-214, Asp-215, and Ile-216 are absent due to the premature stop codon at position 191 (Figure 9B).

Figure 9
Protein-ligand interaction diagrams with four panels labeled A to D. Panels A and C display the three-dimensional structure of a protein in green ribbon form, with purple and green spheres representing ligands. Panels B and D show close-ups of interactions between the protein and ligand, highlighting amino acid residues such as ASP-215, ILE-216, PHE-128, and GLY-130. Pink structures indicate the ligand binding location with yellow dotted lines showing interaction points.

Figure 9. Structural model of the wild-type Le1 protein compared to the mutated protein. (A) Protomer of the wild-type protein from cultivar BRS 537. (B) Interactions with N-acetylglucosamine carbohydrate in the wild-type protein. (C) Protomer of the AF12-13–1 Le1. (D) Interactions with Nacetylglucosamine carbohydrate in the protein from AF12-13-1.

3.6 Denaturing protein gel analysis

To verify the presence of the protein in plant AF12-13-1, protein extracts were prepared from the BRS 537 (wild type) and AF12-13–1 genotypes, using black bean as a control. The results are shown in Figure 10: both the black bean and the wild-type BRS 537 genotype displayed a 30 kDa band corresponding to the lectin, while this band was absent in the AF12-13–1 genotype. This demonstrates that the frameshift mutation at positions 388–391 reduced the protein size.

Figure 10
SDS-PAGE gel showing three lanes labeled ‘Black bean,’ ‘BRS 537,’ and ‘AF12-13-1.’ In the black bean lane, three protein bands labeled A, B, and C serve as natural molecular weight markers. A fourth band labeled D, corresponding to soybean agglutinin (~30 kDa), is visible in the black bean and BRS 537 lanes. The AF12-13-1 lane lacks band D, indicating the absence of the full-length lectin.

Figure 10. Polyacrylamide gel (SDS-PAGE) using protein extracts from genotypes BRS 537, AF12-13-1, and black bean as control. (A) 47 kDa Phaseolin (α-type). (B) 31 kDa PHA-E and PHA-L (bean lectins). (C) 25 kDa Phaseolin (β-type). (D) 30 kDa soybean agglutinin, with the band absent in genotype AF12-13-1.

3.7 Hemagglutination assay

To assess the biological activity of SBA in carbohydrate binding and quaternary structure formation, a hemagglutination test was performed using event AF12-13-1, the wild-type cultivar BRS 537, and black bean as a positive control. Human blood type A, which contains the SBA-binding carbohydrate GalNAc, was used. In Figure 11, the absence of hemagglutination is observed for plant AF12-13–1 at all dilutions, demonstrating that the protein does not form the quaternary structure.

Figure 11
Photograph of a 96-well hemagglutination plate comparing serial dilutions of seed extracts from three samples: black bean (positive control), BRS 537 (wild type), and AF12-13-1 (edited genotype). Wells for the black bean and wild type show visible hemagglutination (red cell clumping) at low dilutions, appearing as diffuse, settled material across the well. Wells for AF12-13-1 show no hemagglutination at any dilution, appearing as a compact red blood cell button at the center of each well, indicating absence of lectin activity. Plate labels and dilution series are visible at the edges for reference.

Figure 11. Hemagglutination analysis of protein extracts from soybean genotypes AF12-13-1 (edited genotype), BRS 537 (wild type), and Black Bean (positive control). (A) Naked-eye view of microplate wells at 1/16 and 1/32 dilutions. (B) Naked-eye view of microplate wells at 1/512 and 1/1024 dilutions. (C–E) Magnified microplate views at 1/16 and 1/32 dilutions for genotypes AF12-13-1, BRS 537, and Black Bean, respectively. (F–H) Magnified microplate views at 1/512 and 1/1024 dilutions for genotypes AF12-13-1, BRS 537, and Black Bean, respectively. (I) Edited genotype and (J) wild type: microscope views at 1/32 dilution.

3.8 Agronomic evaluation

For the agronomic traits evaluated, including plant height (PH), thousand-seed weight (TSW) and productivity (PROD), no significant differences were observed between the edited genotype and the wild-type cultivar (Figure 12). PH was measured at physiological maturity, TSW was obtained from the mass of 1000 dried seeds, and PROD was calculated as grain yield per plant under field conditions. Similarly, protein and oil contents showed no significant variation between genotypes. Overall, the edited line displayed agronomic performance comparable to the non-edited cultivar, with only minor differences.

Figure 12
Bar graphs comparing two variables: AF12-13-1 and BRS 537. Section A shows mean values for plant height, production, and TSW. BRS 537 has slightly higher values in all metrics with 109.5 cm, 3.5 x 1000 production, and 164.8 g TSW compared to AF12-13-1. Section B depicts oil and protein percentages, with BRS 537 having 22.14% oil and 38.39% protein, slightly lower and higher respectively than AF12-13-1. Bars are purple for AF12-13-1 and green for BRS 537. Error bars indicate data variability.

Figure 12. Agronomical performance of the genotypes AF12-13–1 and BRS 537 for the variables. (A) height (PH), productivity (PROD), thousand-seed weight (TSW). (B) Protein (%) and Oil (%). Different letters indicate significant differences between genotypes AF12-13–1 and BRS 537 at a 5% error probability according to Tukey’s test.

4 Discussion

The gene Le1 (Glyma.02G012600) is responsible for lectin accumulation in soybean seeds. This trait is governed by the dominant allele (LE), whereas the recessive allele (le) carries an insertion element (Tgm1) that disrupts the gene’s reading frame, preventing lectin accumulation (Padalkar et al., 2023; Alves de Moraes et al., 2006; Vodkin et al., 1983). Expression analyses across different plant tissues indicate that Le1 is predominantly expressed in seeds (Figure 4). Moreover, its promoter harbors a higher number of seed-specific motifs compared to other members of the PF03388 family (Chragh et al., 2015). Transcriptome profiling at various seed developmental stages further shows that Le1 is the only family member expressed in this tissue, with peak mRNA levels occurring during mid-development (cotyledons of 100–200 mg) (Shamimuzzaman and Vodkin, 2018). Although Le1 is specifically associated with seed lectin accumulation, other members of the lectin gene family remain expressed in vegetative tissues, such as leaves and roots, where they contribute to defense against herbivores and pathogens (Chrispeels and Raikhel, 1991; Lannoo and Van Damme, 2014; De Coninck and Van Damme, 2022).These observations suggest that introducing mutations in Le1 is unlikely to affect other plant traits, which is corroborated by agronomic performance data (Figure 13), showing no differences between the edited plant and the non-edited cultivar in either plant height or grain yield in field conditions.

Figure 13
Top image shows a detailed molecular structure with green strands representing protein chains, labeled with various amino acids connected by yellow dotted lines. Bottom image displays a complex multicolored protein structure with red circles highlighting specific areas.

Figure 13. In silico representation of amino acid interactions maintaining the quaternary structure of soybean agglutinin, predicted using pyMOL.

Regarding genetic transformation and generation of edited plants, soybean is considered a recalcitrant species. Despite adaptations that optimized transformation protocols, the efficiency remains low compared to species like rice, tobacco, and maize (Xu et al., 2020). In our work, the transformation efficiency was above to the average reported in the literature, which is around 3% (Xu et al., 2020). As for editing efficiency, two gRNAs were designed to target Le1, but only one showed editing in the events derived from stable transformation. Several factors might have contributed to this result, including the fact that the gRNAs were expressed under the AtU6 promoter from Arabidopsis thaliana in tandem, which may lead to competition for the transcription machinery, Additionally, the structure of the guide RNAs influences editing efficiency. By evaluating the structure of both guides, it is possible to identify that gRNA2 presented a more suitable configuration for Cas9 binding (Supplementary Figure 1). We observed that most edited events were obtained with gRNA 2, with an editing efficiency close to 10%. This result is similar to those reported in other studies using the AtU6 promoter to drive gRNA expression (Sun et al., 2015; Du et al., 2016). Literature also indicates a significant increase in editing efficiency when the GmU6–10 promoter replaced AtU6-26 (Sun et al., 2015; Du et al., 2016; Di et al., 2019). Additionally, optimizing the Cas9 enzyme sequence can significantly affect editing efficiency (Michno et al., 2015, 2020; Cai et al., 2018).

Among the transgenic plants obtained, the AF12-13–1 plant exhibited a deletion of four amino acids at positions 388-391 (Figure 7). This mutation caused a frameshift and introduced a premature stop codon, resulting in a nonsense mutation. Bioinformatic analysis revealed that the truncated protein in AF12–13 consists of 159 amino acids and is classified as unstable. Soybean agglutinin is made up of four monomeric units that bind together to form a tetramer. The quaternary structure is stabilized by ionic interactions among the residues Asp-192, Arg-185, Asp-169, and Lys-163, which, positioned at different interfaces of the tetramer, collectively maintain the structural stability of the protein (Figure 13) (Sinha and Surolia, 2005; Sinha et al., 2007). The structural modification in the AF12-13–1 protein led to the loss of amino acids essential for maintaining the quaternary structure (Figure 8). Moreover, soybean agglutinin forms two dimers, which associate to form the tetrameric structure (dimers of dimers) (Halder et al., 2016). In the plant AF12-13-1, the amino acids mediating these dimer–dimer interactions are missing.

To perform its carbohydrate-binding activity and fulfill its biological function, the lectin must be properly glycosylated. The interaction between the carbohydrate and the protein requires the amino acids aspartate, asparagine, and an aromatic amino acid (Sharon and Lis, 2001). Mirkov and Chrispeels, 1993 showed that a mutation in Asn-128 of the leucoagglutinin (Pha-LEU) from Phaseolus vulgaris disrupts its carbohydrate-binding ability and abolishes the protein’s biological activity. Similarly, in Pisum sativum, van Eijsden et al., 1992 demonstrated that a mutation in Asn-125 of Psl eliminates the protein’s capacity to bind the monosaccharides glucose and mannose, which are essential for binding blood carbohydrates and hemagglutinating activity (Ma et al., 2011; Nabila et al., 2019; Tang et al., 2024).

The mutations introduced in the edited plant limited the lectin’s hemagglutination activity due to the absence of the protein’s quaternary structure in the AF12-13–1 plant (Figure 13). Therefore, the CRISPR/Cas9-induced mutation in the Le1 gene of AF12-13–1 prevents full translation of the gene, resulting in a truncated protein. SDS-PAGE results support this finding, showing the protein’s absence in the mutated plant. In recessive Le1 genotypes, the presence of the Tgm1 insertion element disrupts the protein’s reading frame and prevents its accumulation in the seed (Goldberg et al., 1983; Vodkin et al., 1983; Okamuro and Goldberg, 1992). The inactivation of soybean lectin, either using lectin-free genotypes, thermal processing, fermentation, or high-pressure treatment, has been consistently associated with improved animal performance and reduced intestinal damage caused by the deleterious effects of this soybean agglutinin (Pan et al., 2018; Han et al., 2023; Obeidat et al., 2025).

The generation advancement, monitored through molecular characterization by conventional PCR and sequencing, enabled the development of an plant free of the CRISPR/Cas machinery, which can be considered non-GMO under the Brazilian legislation. Overall, one of the major goals of soybean breeding is to eliminate or reduce antinutritional factors that impair animal growth performance, particularly in monogastric species (Douglas et al., 1999; Palacios et al., 2004; Palliyeguru et al., 2011; Hoffmann et al., 2019; Buzek et al., 2023; Nisley et al., 2025).

Recent studies have demonstrated that CRISPR-based genome editing can effectively inactivate genes responsible for these compounds, including the Kunitz trypsin inhibitor (KTI), the Bowman–Birk inhibitor (BBI), the raffinose family oligosaccharides, and phytic acid (Le et al., 2020; Cao et al., 2022; Song et al., 2022; Lin et al., 2023; Wang et al., 2023; Liu et al., 2025). In the near future, soybean lines carrying individual edits for these traits could be combined through conventional breeding, generating plants with reduced levels of multiple antinutritional factors. Conventional breeding efforts have already demonstrated the feasibility of pyramiding alleles associated with the natural absence of several antinutritional proteins. Classical crossing and marker-assisted selection have produced Triple Null and Penta Null soybean genotypes that combine recessive alleles for Kunitz trypsin inhibitor, soybean agglutinin, lipoxygenases, the 7S α′ subunit, and stachyose synthase (Choi et al., 2022; Schmidt et al., 2015). Together with our results, these genotypes confirm that eliminating multiple antinutritional traits does not compromise yield or seed composition. However, conventional approaches are time-consuming, require multiple generations, and depend on existing natural variation, highlighting the value of genome editing for achieving similar or broader outcomes with greater precision and efficiency.

In this context, CRISPR-based approaches have already achieved multigene improvements within a single generation. Multiplex editing of raffinose synthase genes (RS2 and RS3) has effectively reduced raffinose-family oligosaccharides without affecting seed viability or productivity (Cao et al., 2022; Lin et al., 2023), while simultaneous knockouts of Kunitz trypsin inhibitor and Bowman–Birk inhibitor genes have generated low-protease-inhibitor soybean lines with improved digestibility and feed performance (Wang et al., 2023; Liu et al., 2025). Beyond the specific knockout of Le1 achieved in this study, these advances illustrate a broader paradigm shift in crop biotechnology, transitioning from single-gene modification toward coordinated optimization of complex, nutritionally relevant traits.

Emerging CRISPR-derived tools, such as base and prime editing, further expand the potential for precise modulation of multiple loci related to nutritional composition, stress resilience, and yield stability (Li et al., 2023). Similar integrated strategies have been successfully applied in Brassica crops to balance glucosinolate biosynthesis and optimize both defense and nutritional quality (Cheng et al., 2024). Combining multiplex editing with genomic selection and metabolomic profiling is expected to accelerate the development of soybean cultivars that integrate enhanced feed value, sustainability, and robust agronomic performance (Chen et al., 2024).

Ultimately, soybean quality improvement should not focus solely on eliminating individual antinutritional factors, such as lectin, KTI, or oligosaccharides, but rather on achieving an optimized equilibrium between nutritional quality, resilience, and yield. This integrated perspective defines a forward-looking pathway for next-generation soybean breeding.

5 Conclusion

The Le1 gene has been characterized as being exclusively expressed in seeds, and its knockout did not affect plant growth or yield;

Gene editing via CRISPR/Cas9 proved to be an efficient strategy for generating low-lectin varieties, producing a truncated protein that lost both its biological activity and the ability to form a quaternary structure;

Taken together, these results demonstrate that CRISPR/Cas9-mediated editing of seed antinutritional proteins, such as lectins, is an effective strategy to improve soybean grain quality for animal feed.

Data availability statement

The plasmid vector generated and used in this study has been deposited in the GenBank database (Accession number PX725410). The complete nucleotide sequence of the vector is also available in the Supplementary Materials of this article.

Author contributions

JK: Conceptualization, Methodology, Validation, Writing – original draft, Writing – review & editing. AK: Conceptualization, Writing – review & editing. RH: Methodology, Validation, Writing – review & editing. LG: Methodology, Writing – review & editing. CS: Methodology, Writing – review & editing. SM: Methodology, Validation, Writing – review & editing. ER: Funding acquisition, Writing – review & editing. AN: Funding acquisition, Writing – review & editing. LM-H: Conceptualization, Funding acquisition, Project administration, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was funded by The Brazilian Agricultural Research Corporation (Embrapa) Project Number (20.22.03.028.00.00) and by The National Institute of S&T in Engineering Systems Biology, EngBio (processo: 408411/2024-4).

Acknowledgments

We thank the Coordination for the Improvement of Higher Education Personnel (CAPES) for granting a doctoral scholarship to JMK, and the National Council for Scientific and Technological Development (CNPq) for granting a Productivity in Research Scholarship (PQ Scholarship - 303753/2022-6) to LM-H.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was used in the creation of this manuscript. The author(s) declare that Generative AI (ChatGPT, OpenAI, GPT 4, 2025) was used in the creation of this manuscript to improve language and translation.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1737513/full#supplementary-material

References

Adamcová, A., Laursen, K. H., and Ballin, N. Z. (2021). Lectin activity in commonly consumed plant-based foods: Calling for method harmonization and risk assessment. Foods 10, 2796. doi: 10.3390/foods10112796

PubMed Abstract | Crossref Full Text | Google Scholar

Alves de Moraes, R. M., Soares, T. C. B., Colombo, L. R., Spegiorin Salla, M. F., Barros, J. G. D. A., Piovesan, N. D., et al. (2006). Assisted selection by specific DNA markers for genetic elimination of the kunitz trypsin inhibitor and lectin in soybean seeds. Euphytica 149, 221–226. doi: 10.1007/s10681-005-9069-0

Crossref Full Text | Google Scholar

Buzek, A., Zaworska-Zakrzewska, A., Muzolf-Panek, M., and Kasprowicz-Potocka, M. (2023). Microbial phytase in a diet with lupine and extruded full-fat soya seeds affects the performance, carcass characteristics, meat quality, and bone mineralization of fatteners. Anim. (Basel) 13, 1655. doi: 10.3390/ANI13101655

PubMed Abstract | Crossref Full Text | Google Scholar

Cai, Y., Chen, L., Liu, X., Guo, C., Sun, S., Wu, C., et al. (2018). CRISPR/Cas9-mediated targeted mutagenesis of GmFT2a delays flowering time in soya bean. Plant Biotechnol. J. 16, 176–185. doi: 10.1111/PBI.12758

PubMed Abstract | Crossref Full Text | Google Scholar

Cao, L., Wang, Z., Ma, H., Liu, T., Ji, J., and Duan, K. (2022). Multiplex CRISPR/Cas9-mediated raffinose synthase gene editing reduces raffinose family oligosaccharides in soybean. Front. Plant Sci. 13. doi: 10.3389/FPLS.2022.1048967/BIBTEX

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, F., Chen, L., Yan, Z., Xu, J., Feng, L., He, N., et al. (2024). Recent advances of CRISPR-based genome editing for enhancing staple crops. Front. Plant Sci. 15. doi: 10.3389/FPLS.2024.1478398/FULL

PubMed Abstract | Crossref Full Text | Google Scholar

Cheng, B., Ran, R., Qu, Y., Verkerk, R., Henry, R., Dekker, M., et al. (2024). Advancements in balancing glucosinolate production in plants to deliver effective defense and promote human health. Agric. Commun. 2, 100040. doi: 10.1016/J.AGRCOM.2024.100040

Crossref Full Text | Google Scholar

Choi, S. W., Ly, S., Lee, J. H., Oh, H. S., Kim, S. Y., Kim, N. H., et al. (2022). Breeding of penta null soybean [Glycine max (L.) merr.] for five antinutritional and allergenic components of lipoxygenase, KTI, lectin, 7S α′ Subunit, and stachyose. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.910249

PubMed Abstract | Crossref Full Text | Google Scholar

Chragh, M., Zolotarov, Y., Saeed, H., and Strömvik, M. V. (2015). Le4 is an epicotyl preferential homologue of the soybean seed-specific Le1 lectin and the vegetative Le3 lectin genes. Plant Mol. Biol. Rep. 33, 1779–1789. doi: 10.1007/s11105-015-0873-y

Crossref Full Text | Google Scholar

Chrispeels, M. J. and Raikhel, N. V. (1991). Lectins, lectin genes, and their role in plant defense. Plant Cell 3, 1–9. doi: 10.1105/TPC.3.1.1

PubMed Abstract | Crossref Full Text | Google Scholar

De Coninck, T. and Van Damme, E. J. M. (2021). Review: The multiple roles of plant lectins. Plant Sci. 313, 111096. doi: 10.1016/j.plantsci.2021.111096

PubMed Abstract | Crossref Full Text | Google Scholar

De Coninck, T. and Van Damme, E. J. M. (2022). Plant lectins: Handymen at the cell surface. Cell Surface 8, 100091. doi: 10.1016/J.TCSW.2022.100091

PubMed Abstract | Crossref Full Text | Google Scholar

Di, Y. H., Sun, X. J., Hu, Z., Jiang, Q. Y., Song, G. H., Zhang, B., et al. (2019). Enhancing the CRISPR/Cas9 system based on multiple GmU6 promoters in soybean. Biochem. Biophys. Res. Commun. 519, 819–823. doi: 10.1016/J.BBRC.2019.09.074

PubMed Abstract | Crossref Full Text | Google Scholar

Douglas, M. W., Parsons, C. M., and Hymowitz, T. (1999). Nutritional evaluation of lectin-free soybeans for poultry. Poult Sci. 78, 91–95. doi: 10.1093/ps/78.1.91

PubMed Abstract | Crossref Full Text | Google Scholar

Doyle, J. J. and Doyle, J. L. (1987). A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bull. 19, 11–15.

Google Scholar

Du, H., Zeng, X., Zhao, M., Cui, X., Wang, Q., Yang, H., et al. (2016). Efficient targeted mutagenesis in soybean by TALENs and CRISPR/Cas9. J. Biotechnol. 217, 90–97. doi: 10.1016/J.JBIOTEC.2015.11.005

PubMed Abstract | Crossref Full Text | Google Scholar

Edgar, R. C. (2004). MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797. doi: 10.1093/NAR/GKH340

PubMed Abstract | Crossref Full Text | Google Scholar

Gasteiger, E., Hoogland, C., Gattiker, A., Duvaud, S., Wilkins, M. R., Appel, R. D., et al. (2005). Protein identification and analysis tools on the exPASy server. Proteomics Protoc. Handb. 112, 571–607. doi: 10.1385/1-59259-890-0:571

PubMed Abstract | Crossref Full Text | Google Scholar

George, M. A., Bhide, S. V., Thengane, R. J., Hosseini, G. H., and Manjaya, J. G. (2008). Identification of low lectin mutants in soybean. Plant Breed. 127, 150–153. doi: 10.1111/j.1439-0523.2007.01449.x

Crossref Full Text | Google Scholar

Goldberg, R. B., Hoschek, G., and Vodkin, L. O. (1983). An insertion sequence blocks the expression of a soybean lectin gene. Cell 33, 465–475. doi: 10.1016/0092-8674(83)90428-2

PubMed Abstract | Crossref Full Text | Google Scholar

Guruprasad, K., Reddy, B. V. B., and Pandit, M. W. (1990). Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. 4, 155–161. doi: 10.1093/PROTEIN/4.2.155

PubMed Abstract | Crossref Full Text | Google Scholar

Halder, S., Surolia, A., and Mukhopadhyay, C. (2016). Dynamics simulation of soybean agglutinin (SBA) dimer reveals the impact of glycosylation on its enhanced structural stability. Carbohydr Res. 428, 8–17. doi: 10.1016/J.CARRES.2016.04.009

PubMed Abstract | Crossref Full Text | Google Scholar

Han, X., Sun, Y., Huangfu, B., He, X., and Huang, K. (2023). Ultra-high-pressure passivation of soybean agglutinin and safety evaluations. Food Chem. X 18, 100726. doi: 10.1016/J.FOCHX.2023.100726

PubMed Abstract | Crossref Full Text | Google Scholar

Hoffmann, D., Thurner, S., Ankerst, D., Damme, K., Windisch, W., and Brugger, D. (2019). Chickens’ growth performance and pancreas development exposed to soy cake varying in trypsin inhibitor activity, heat-degraded lysine concentration, and protein solubility in potassium hydroxide. Poult Sci. 98, 2489–2499. doi: 10.3382/ps/pey592

PubMed Abstract | Crossref Full Text | Google Scholar

Koltun, A., e Silva, N. V., Angelotti-Mendonça, J., Marin, S. R. R., Gonçalves, L. S. A., Nepomuceno, A. L., et al. (2023). CRISPR-transient expression in soybean for simplified gRNA screening in planta. Pesqui Agropecu Bras. 58, e03000. doi: 10.1590/S1678-3921.PAB2023.V58.03000

Crossref Full Text | Google Scholar

Laemmli, U. K. (1970). Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227, 680–685. doi: 10.1038/227680A0

PubMed Abstract | Crossref Full Text | Google Scholar

Lannoo, N. and Van Damme, E. J. M. (2014). Lectin domains at the frontiers of plant defense. Front. Plant Sci. 5. doi: 10.3389/FPLS.2014.00397

PubMed Abstract | Crossref Full Text | Google Scholar

Le, H., Nguyen, N. H., Ta, D. T., Le, T. N. T., Bui, T. P., Le, N. T., et al. (2020). CRISPR/cas9-mediated knockout of galactinol synthase-encoding genes reduces raffinose family oligosaccharide levels in soybean seeds. Front. Plant Sci. 11. doi: 10.3389/FPLS.2020.612942/BIBTEX

PubMed Abstract | Crossref Full Text | Google Scholar

Letunic, I. and Bork, P. (2024). Interactive Tree of Life (iTOL) v6: Recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 52, W78–W82. doi: 10.1093/NAR/GKAE268

PubMed Abstract | Crossref Full Text | Google Scholar

Li, J., Zhang, C., He, Y., Li, S., Yan, L., Li, Y., et al. (2023). Plant base editing and prime editing: The current status and future perspectives. J. Integr. Plant Biol. 65, 444–467. doi: 10.1111/JIPB.13425

PubMed Abstract | Crossref Full Text | Google Scholar

Lin, W., Kuang, H., Bai, M., Jiang, X., Zhou, P., Li, Y., et al. (2023). Multiplex genome editing targeting soybean with ultra-low anti-nutritive oligosaccharides. Crop J. 11, 825–831. doi: 10.1016/J.CJ.2023.01.001

Crossref Full Text | Google Scholar

Liu, K., Wang, Z., and An, Y. Q. (2025). Developing and evaluating CRISPR-Cas9 edited transgene-free soybeans with dramatic reduction in trypsin and chymotrypsin inhibition based on selfing and phenotyping T1 to T3 seeds. J. Agric. Food Res. 22, 101811. doi: 10.1016/J.JAFR.2025.101811

Crossref Full Text | Google Scholar

Ma, Y., Wang, T., Faris, R., and Spurlock, M. (2011). In vitro hemagglutination activity of β-conglycinin and glycinin fractions and feeding study of non-thermal treated soy protein. JAOCS J. Am. Oil Chemists’ Soc. 88, 983–992. doi: 10.1007/s11746-010-1752-1

Crossref Full Text | Google Scholar

Mertz-Henning, L. M., Ferreira, L. C., Henning, F. A., Mandarino, J. M. G., Santos, E. D., Oliveira, M. C. N. D., et al (2017). Effect of Water Deficit-Induced at Vegetative and Reproductive Stages on Protein and Oil Content in Soybean Grains. Agronomy 3. doi: 10.3390/agronomy8010003

Crossref Full Text | Google Scholar

Messina, M. (2016). Soy and health update: evaluation of the clinical and epidemiologic literature. Nutrients 8, 754. doi: 10.3390/NU8120754

PubMed Abstract | Crossref Full Text | Google Scholar

Michno, J. M., Virdi, K., Stec, A. O., Liu, J., Wang, X., Xiong, Y., et al. (2020). Integration, abundance, and transmission of mutations and transgenes in a series of CRISPR/Cas9 soybean lines. BMC Biotechnol. 20, 1–13. doi: 10.1186/S12896-020-00604-3/FIGURES/5

PubMed Abstract | Crossref Full Text | Google Scholar

Michno, J. M., Wang, X., Liu, J., Curtin, S. J., Kono, T. J., and Stupar, R. M. (2015). CRISPR/Cas mutagenesis of soybean and Medicago truncatula using a new web-tool and a modified Cas9 enzyme. GM Crops Food 6, 243–252. doi: 10.1080/21645698.2015.1106063

PubMed Abstract | Crossref Full Text | Google Scholar

Mirkov, T. E. and Chrispeels, M. J. (1993). Mutation of Asn to Asp of Phaseolus vulgaris leucoagglutinin (PHA-L) eliminates carbohydrate-binding and biological activity. Glycobiology 3, 581–587. doi: 10.1093/glycob/3.6.581

PubMed Abstract | Crossref Full Text | Google Scholar

Nabila, N., Meidianto, V. F., Tjandrawinata, R. R., Rachmawati, H., and Ismaya, W. T. (2019). Agaricus bisporus mannose binding protein is not an agglutinating protein. Biochem. Biophys. Res. Commun. 519, 773–776. doi: 10.1016/j.bbrc.2019.09.071

PubMed Abstract | Crossref Full Text | Google Scholar

Naito, Y., Hino, K., Bono, H., and Ui-Tei, K. (2015). CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites. Bioinformatics 31, 1120–1123. doi: 10.1093/BIOINFORMATICS/BTU743

PubMed Abstract | Crossref Full Text | Google Scholar

Nisley, M. J., Miller, K. A., Spencer, J. D., Mendoza, O. F., Krishnan, H. B., and Gabler, N. K. (2025). Increasing dietary soybean-derived trypsin inhibitor protein compromises nursery pig performance, nitrogen digestibility, and retention. J. Anim. Sci. 103, skaf253. doi: 10.1093/JAS/SKAF253

PubMed Abstract | Crossref Full Text | Google Scholar

Obeidat, M. D., Alzoubi, S. Q., Nusairat, B. M., Obeidat, B. S., and Riley, D. G. (2025). Effects of fermented soybean meal supplementation on growth, carcass quality, and intestinal morphology in Ross 308 and Indian River broilers. Anim. (Basel) 15, 2659. doi: 10.3390/ANI15182659

PubMed Abstract | Crossref Full Text | Google Scholar

Okamuro, J. K. and Goldberg, R. B. (1992). A mutant lectin gene is rescued from an insertion element that blocks its expression. Plant Cell 4, 1141–1146. doi: 10.1105/tpc.4.9.1141

PubMed Abstract | Crossref Full Text | Google Scholar

Olsen, L. R., Dessen, A., Gupta, D., Sabesan, S., Sacchettini, J. C., and Brewer, C. F. (1997). X-ray crystallographic studies of unique cross-linked lattices between four isomeric biantennary oligosaccharides and soybean agglutinin. Biochemistry 36, 15073–15080. doi: 10.1021/BI971828+

PubMed Abstract | Crossref Full Text | Google Scholar

Padalkar, G., Mandlik, R., Sudhakaran, S., Vats, S., Kumawat, S., Kumar, V., et al. (2023). Necessity and challenges for exploration of nutritional potential of staple-food grade soybean. J. Food Composition Anal. 117, 105093. doi: 10.1016/j.jfca.2022.105093

Crossref Full Text | Google Scholar

Palacios, M. F., Easter, R. A., Soltwedel, K. T., Parsons, C. M., Douglas, M. W., Hymowitz, T., et al. (2004). Effect of soybean variety and processing on growth performance of young chicks and pigs. J. Anim. Sci. 82, 1108–1114. doi: 10.2527/2004.8241108X

PubMed Abstract | Crossref Full Text | Google Scholar

Palliyeguru, M. W. C. D., Rose, S. P., and Mackenzie, A. M. (2011). Effect of trypsin inhibitor activity in soya bean on growth performance, protein digestibility and incidence of sub-clinical necrotic enteritis in broiler chicken flocks. Br. Poult Sci. 52, 359–367. doi: 10.1080/00071668.2011.577054

PubMed Abstract | Crossref Full Text | Google Scholar

Pan, L., Farouk, M. H., Qin, G., Zhao, Y., and Bao, N. (2018). The influences of soybean agglutinin and functional oligosaccharides on the intestinal tract of monogastric animals. Int. J. Mol. Sci. 19, 554. doi: 10.3390/IJMS19020554

PubMed Abstract | Crossref Full Text | Google Scholar

Paz, M. M., Martinez, J. C., Kalvig, A. B., Fonger, T. M., and Wang, K. (2005). Improved cotyledonary node method using an alternative explant derived from mature seed for efficient Agrobacterium-mediated soybean transformation. Plant Cell Rep. 25, 206–213. doi: 10.1007/S00299-005-0048-7

PubMed Abstract | Crossref Full Text | Google Scholar

PyMOL (2025). The PyMOL Molecular Graphics System. Available online at: https://www.pymol.org/support.html (Accessed October 11, 2025).

Google Scholar

Rizzo, G., Baroni, L., Rizzo, G., and Baroni, L. (2018). Soy, soy foods and their role in vegetarian diets. Nutrients 10, 43. doi: 10.3390/NU10010043

PubMed Abstract | Crossref Full Text | Google Scholar

RStudio Team. (2020). RStudio: Integrated Development Environment for R. RStudio, PBC, Boston, MA, USA. Available online at: https://www.rstudio.com/

Google Scholar

Schmidt, M. A., Hymowitz, T., and Herman, E. M. (2015). Breeding and characterization of soybean Triple Null; a stack of recessive alleles of Kunitz Trypsin Inhibitor, Soybean Agglutinin, and P34 allergen nulls. Plant Breed. 134, 310–315. doi: 10.1111/pbr.12265

Crossref Full Text | Google Scholar

Shamimuzzaman, M. and Vodkin, L. (2018). Ribosome profiling reveals changes in translational status of soybean transcripts during immature cotyledon development. PloS One 13, 1–28. doi: 10.1371/journal.pone.0194596

PubMed Abstract | Crossref Full Text | Google Scholar

Sharon, N. and Lis, H. (2001). The structural basis for carbohydrate recognition by lectins. Adv. Exp. Med. Biol. 491, 1–16. doi: 10.1007/978-1-4615-1267-7_1

PubMed Abstract | Crossref Full Text | Google Scholar

Singh, P., Kumar, R., Sabapathy, S. N., and Bawa, A. S. (2008). Functional and edible uses of soy protein products. Compr. Rev. Food Sci. Food Saf. 7, 14–28. doi: 10.1111/J.1541-4337.2007.00025.X

Crossref Full Text | Google Scholar

Sinha, S., Gupta, G., Vijayan, M., and Surolia, A. (2007). Subunit assembly of plant lectins. Curr. Opin. Struct. Biol. 17, 498–505. doi: 10.1016/J.SBI.2007.06.007

PubMed Abstract | Crossref Full Text | Google Scholar

Sinha, S. and Surolia, A. (2005). Oligomerization endows enormous stability to soybean agglutinin: A comparison of the stability of monomer and tetramer of soybean agglutinin. Biophys. J. 88, 4243–4251. doi: 10.1529/biophysj.105.061309

PubMed Abstract | Crossref Full Text | Google Scholar

Song, J. H., Shin, G., Kim, H. J., Lee, S. B., Moon, J. Y., Jeong, J. C., et al. (2022). Mutation of gmIPK1 gene using CRISPR/cas9 reduced phytic acid content in soybean seeds. Int. J. Mol. Sci. 23, 10583. doi: 10.3390/IJMS231810583/S1

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, X., Hu, Z., Chen, R., Jiang, Q., Song, G., Zhang, H., et al. (2015). Targeted mutagenesis in soybean using the CRISPR-Cas9 system. Sci. Rep. 5, 1–10. doi: 10.1038/srep10342

PubMed Abstract | Crossref Full Text | Google Scholar

Tang, C., Jiang, X., Liu, C., Washburn, B. K., Sathe, S. K., and Rao, Q. (2024). Effect of temperature on structural configuration and immunoreactivity of pH-stressed soybean (Glycine max) agglutinin. Food Chem. 442, 138376. doi: 10.1016/j.foodchem.2024.138376

PubMed Abstract | Crossref Full Text | Google Scholar

van Eijsden, R. R., Hoedemaeker, F. J., Díaz, C. L., Lugtenberg, B. J. J., Sylvia de Pater, B., and Kijne, J. W. (1992). Mutational analysis of pea lectin. Substitution of Asn125 for Asp in the monosaccharide-binding site eliminates mannose/glucose-binding activity. Plant Mol. Biol. 20, 1049–1058. doi: 10.1007/BF00028892

PubMed Abstract | Crossref Full Text | Google Scholar

Vodkin, L. O., Rhodes, P. R., and Goldberg, R. B. (1983). Ca lectin gene insertion has the structural features of a transposable element. Cell 34, 1023–1031. doi: 10.1016/0092-8674(83)90560-3

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, Z., Shea, Z., Rosso, L., Shang, C., Li, J., Bewick, P., et al. (2023). Development of new mutant alleles and markers for KTI1 and KTI3 via CRISPR/Cas9-mediated mutagenesis to reduce trypsin inhibitor content and activity in soybean seeds. Front. Plant Sci. 14. doi: 10.3389/FPLS.2023.1111680/BIBTEX

PubMed Abstract | Crossref Full Text | Google Scholar

Waterhouse, A. M., Studer, G., Robin, X., Bienert, S., Tauriello, G., and Schwede, T. (2024). The structure assessment web server: For proteins, complexes and more. Nucleic Acids Res. 52, W318–W323. doi: 10.1093/NAR/GKAE270

PubMed Abstract | Crossref Full Text | Google Scholar

WHO (2010). “Capillary sampling,” in WHO Guidelines on Drawing Blood: Best Practices in Phlebotomy.

Google Scholar

Xu, H., Zhang, L., Zhang, K., and Ran, Y. (2020). Progresses, challenges, and prospects of genome editing in soybean (Glycine max). Front. Plant Sci. 11. doi: 10.3389/fpls.2020.571138

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: agglutinin, antinutritional factor, Glycine max, knockout, gene editing

Citation: Kafer JM, Koltun A, Hoshino RT, Girotto L, Silveira CA, Marin SRR, Rech EL, Nepomuceno AL and Mertz-Henning LM (2026) CRISPR/Cas9-mediated inactivation of the soybean agglutinin Le1 gene to improve grain quality. Front. Plant Sci. 16:1737513. doi: 10.3389/fpls.2025.1737513

Received: 01 November 2025; Accepted: 28 November 2025; Revised: 25 November 2025;
Published: 09 January 2026.

Edited by:

Li Tian, University of California, Davis, United States

Reviewed by:

Taras P. Pasternak, Miguel Hernández University of Elche, Spain
Guanglong Hu, Beijing Academy of Agricultural and Forestry Sciences, China
Zachary Shea, Virginia Tech, United States

Copyright © 2026 Kafer, Koltun, Hoshino, Girotto, Silveira, Marin, Rech, Nepomuceno and Mertz-Henning. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Liliane Marcia Mertz-Henning, bGlsaWFuZS5oZW5uaW5nQGVtYnJhcGEuYnI=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.