Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Neurol., 05 February 2026

Sec. Neurogenetics

Volume 17 - 2026 | https://doi.org/10.3389/fneur.2026.1772239

This article is part of the Research TopicGenotype-Phenotype Correlations, Genetic Mechanisms of Phenotypic Heterogeneity, Optimized Diagnosis and Targeted Therapies in Epilepsy and Neurodevelopmental DisordersView all 15 articles

Toward precision medicine in SCN3A variants-associated encephalopathies and epilepsy: optimizing genetic diagnosis and molecular subregional effects

  • 1The Affiliated Nanhua Hospital, Department of Neurology, Hengyang Medical School, University of South China, Hengyang, China
  • 2Department of Neurology, Institute of Neuroscience, Key Laboratory of Neurogenetics and Channelopathies of Guangdong Province and the Ministry of Education of China, The Second Affiliated Hospital, Guangzhou Medical University, Guangzhou, China
  • 3Department of Neurology, The First Affiliated Hospital & Clinical Neuroscience Institute of Jinan University, Guangzhou, China
  • 4The First Affiliated Hospital, Department of Neurology, Hengyang Medical School, University of South China, Hengyang, China

Background: Variants in SCN3A gene encoding the voltage-gated sodium channel Nav1. 3 have been associated with severe developmental and/or epileptic encephalopathies, characterized by early-onset, drug-resistant seizures, malformations of cortical development, and profound neurodevelopmental impairment. Rapid clinical interpretation of SCN3A missense variants remains challenging. This study aimed to explore potentially reliable indicators in reflecting the pathogenicity of SCN3A variants, thereby promoting genetic diagnosis.

Methods: The disease-associated and benign/likely benign SCN3A missense variants were systematically collected via two independent epilepsy geneticists to curate high-confidence dataset. The molecular subregional effects were analyzed to explore possible genotype-phenotype correlation. The diagnostic performance of nineteen commonly used algorithms was systematically evaluated using ROC analysis and confusion matrices metrics such as accuracy, sensitivity, specificity, and Matthews correlation coefficient (MCC).

Results: A total of 20 pathogenic, affecting 37 patients, and 45 benign/likely benign SCN3A variants were included. Pathogenic SCN3A variants were statistically more located in transmembrane regions than in other regions, suggesting possible subregional effects. Deep-learning-based tools incorporating structural data, AlphaMissense, demonstrated superior balanced accuracy (>90%) and robust discrimination (AUC > 0.96). Meta-predictors, such as BayesDel_addAF and ClinPred, also showed high sensitivity but lower specificity at default thresholds. Notably, applying gene-specific optimal thresholds significantly improved performance across multiple tools.

Conclusion: This study provides systematic benchmarks for algorithms in SCN3A-related DEEs. Integration of reliable algorithms with gene-specific thresholds into clinical variant interpretation pipelines could possibly refine the pathogenicity assessment of missense variants, subsequently informing timely risk stratification and personalized therapeutic strategies for affected patients.

1 Introduction

Developmental and epileptic encephalopathies (DEEs) constitute a group of severe neurodevelopmental disorders characterized by early-onset, drug-resistant seizures, malformations of cortical development, and profound neurodevelopmental impairment (1). These conditions typically manifest in infancy or early childhood, and in some cases, clinical or electrographic abnormalities are evident at birth or even prenatally. The phenotypic spectrum is broad, ranging from isolated developmental delay to profound neurological deficits accompanied by cortical malformations and movement disorders. Critically, the developmental dysfunction in DEEs is not solely attributable to seizure burden but also stems from the underlying pathogenic mechanism itself, which disrupts normal brain maturation. The profound impact of DEEs extends beyond the affected individuals, imposing substantial emotional, financial, and societal burdens on families and healthcare systems.

Recent advances in high-throughput genomic sequencing have accelerated the discovery of monogenic causes of DEEs. A growing number of genes have been associated with DEEs, including KCNQ2 (2, 3), SCN1A (4), and SCN2A (5). Notably, many of the most frequently implicated genes encode voltage-gated ion channels, underscoring the critical dependence of early brain development on the precise regulation of neuronal excitability and network activity (6).

The SCN3A gene, located on chromosome 2q24, encodes sodium voltage-gated channel alpha subunit 3 (Nav1.3), which is mainly expressed in the brain and predominantly during the embryonic stage (7, 8). Nav1.3 proteins play crucial roles in the generation and propagation of action potentials or network maturation in the brain. In decades, pathogenic variants in SCN3A have been associated with an expanding spectrum of neurodevelopmental disorders, including familial focal epilepsy with variable foci-4 (FFEVF4 [MIM: 617935]) (7), epileptic encephalopathy-62 (DEE62 [MIM: 617938]) (8), and cortical malformations such as polymicrogyria and schizencephaly (9). Affected infants typically present with drug-resistant seizures, global developmental delay, and movement abnormalities. In severe patients, structural brain anomalies are detectable by prenatal or postnatal MRI. Most patients presented with early symptoms that resemble other forms of epilepsy. Even with genetic tests, pathogenicity cannot be assessed rapidly, especially for missense variants. Clinicians urgently require rapid and reliable methods for distinguishing pathogenic SCN3A variants from benign ones to support early diagnosis, risk stratification, and personalized therapy.

Among the various types of SCN3A variants, missense variants are difficult to interpret because the functional phenotypes cannot be inferred from gene dosage. To assist variant classification, numerous algorithms, such as SIFT (10, 11), PolyPhen-2 (12), MutationTaster (13), CADD (14), and REVEL (15), combine evolutionary conservation, physicochemical properties, structural modeling, and machine learning to estimate the likelihood that a given substitution is deleterious. Although these tools are embedded in most clinical sequencing pipelines, these algorithms are trained on genome-wide data and used by default thresholds that may not be optimal for SCN3A variants. Their predictive performance varies widely across genes and disease contexts (16, 17). Previous studies on SCN1A and SCN2A have demonstrated that prediction accuracy can be enhanced by integrating clinical phenotypes or protein structural constraints (18, 19). However, no systematic benchmark currently exists for SCN3A, which limits the optimizing genetic diagnosis of SCN3A-related disease.

In this study, we collected a curated set of pathogenic and benign SCN3A missense variants. The subregional effects of SCN3A were analyzed to screen the association between pathogenicity and the structure of the Nav1.3. Subsequently, A total of nineteen algorithms was analyzed including accuracy, sensitivity, specificity, and area under the receiver-operating characteristic curve (AUC). By providing vertical domain performance metrics and threshold recommendations, we aim to supply a systematic benchmark and framework that improves the clinical interpretation of SCN3A variants and patient care.

2 Materials and methods

2.1 Collection of patients and variants

To systematically assess the performance of algorithms in predicting the pathogenicity of SCN3A missense variants, we first initially classified variants into two categories: pathogenic and benign variants. Pathogenic variants were curated from the PubMed and HGMD database using the query: (SCN3A OR EIEE62 OR FFEVF4 OR NAC3 OR Nav1.3) AND (variant OR mutation). Only variants supported by detailed clinical evidence and explainable origins were included (Supplementary Table S1). The benign variants were from the Genome Aggregation Database (gnomAD), which is denoted by the ClinVar database. Previously reported SCN3A variants and related phenotypes were systematically reviewed up to June 2025. All included variants met the criteria of the ACMG guidelines and were reviewed by a panel of neurogenetics experts and neurologists. These variants from real-world patients were deduplicated for further analysis.

2.2 Sub-molecular effects analysis

To investigate the sub-molecular determinants of pathogenicity, we used structural domain annotations from previous literature and mapped all curated missense variants onto the regions, including four domains with six transmembrane segments, the loops regions, and linker regions. The distribution of variants across these domains was then stratified by clinical interpretation. To assess whether specific domains/regions were enriched for pathogenic variants, we performed Fisher's exact test comparing the observed frequency of pathogenic variants within each domain against the population missense variants from gnomAD.

2.3 Algorithms prediction

Nineteen widely used and well-validated algorithms (Supplementary Table S1) were chosen for their proven effectiveness in assessing the functional consequences of missense variants. These tools included: AlphaMissense (20), BayesDel_addAF (21), BayesDel_noAF (21), ClinPred (22), ESM1b, Fathmm-XF_coding (23), LIST-S2 (24), M-CAP (25), MetaLR, MetaRNN, MetaSVM, MutationAssessor (26), MutationTaster (13, 27), Polyphen2_HDIV (28), Polyphen2_HVAR (28), PROVEAN (29), PrimateAI (30), SIFT (10), and SIFT4G (11). Scores were retrieved via precomputed database.

2.4 Performance evaluation by confusion matrices

To assess algorithms at its recommended thresholds, we built confusion matrices using a deduplicated dataset for the nineteen algorithms that provide classified variables. For binary outputs, “D” was defined as predicted pathogenic and “T” as predicted benign. Multi-class outputs were defined as follows: PolyPhen-2 (HDIV and HVAR)—“D” and “P” as pathogenic, “B” as benign; MutationTaster—“A” or “D” as pathogenic, “N” or “P” as benign; MutationAssessor—“H” or “M” as pathogenic, “N” or “L” as benign; AlphaMissense—“P” as pathogenic, “B” or “A” as benign; Fathmm-XF_coding—“D” as pathogenic, “N” as benign; PROVEA—“D” as pathogenic, “N” as benign; SCN3A missense variants were then categorized into true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN).

Based on these basic metrics, we calculated additional performance indicators, including accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Furthermore, we evaluated the F-score, a harmonic mean of precision and recall, and the Matthews correlation coefficient (MCC), a robust measure of binary classification quality. The F score was computed as 2PRP + R, where precision (P) = TPTP + FP and recall (R)= TPTP + FN. The MCC was determined using the formula: TP × TN - FP × FN(TP + FP)(TP + FN)(TN + FP)(TN + FN). This coefficient ranges from −1 to 1, with −1 representing a completely incorrect prediction, 0 indicating random performance, and 1 denoting a perfectly accurate classifier.

2.5 Statistical analysis and ROC curve

Score distributions were first compared between pathogenic and benign variants. Normality in each group was checked with the Shapiro–Wilk test; data that were normal with equal variances were analyzed with Student's t-test, while data with unequal variances were analyzed with Welch's t-test, and non-normal data with the Wilcoxon rank-sum test. A p-value < 0.05 was considered statistically significant for all comparisons.

To compare algorithms of thresholds independently, algorithms with significant differences were included in the ROC analysis. The area under the curve was calculated with the DeLong method and is reported with its 95% confidence interval and the p-value vs. 0.5. The threshold that maximized Youden's index defined the optimal operating point; at this point, we extracted the full confusion matrix and derived accuracy, sensitivity, specificity, positive predictive value, negative predictive value, Matthews correlation coefficient, and F1-score. Higher AUC values indicated better discriminatory power for distinguishing pathogenic from benign SCN3A variants. All statistical analyses and visualizations were carried out in R (version 4.5.2) using the packages dplyr, purrr, pROC, janitor, and ggplot2.

3 Results

3.1 Molecular subregional effects analysis of SCN3A variants

We systemically reviewed the location of previously reported SCN3A variants and their pathogenicity and associated phenotypes (including whether they presented with seizures or polymicrogyria) (8, 9, 3141). To date, a total of 20 pathogenic variants from 37 patients and 45 benign/likely benign SCN3A variants have been identified (Table 1), which were associated with varied pathogenicity. To analyze the molecular sub-regional effects, the variants were classified as pathogenic and benign, which were consistent with the above test set. It is shown that pathogenic SCN3A variants were statistically more located in transmembrane regions than in other regions (Figures 1A, B). Considering the phenotypic heterogeneity, we tested the association between variant location and clinical presentation. No significant correlations were observed between positional features and phenotype, including seizures or polymicrogyria (Supplementary Figure S1). Different regions or domains showed varied toleration to missense variants, which served as a useful indicator for predicting missense pathogenicity. These results suggest that structural annotations helped identify pathogenicity but provide limited value for classifying clinical subtypes. This finding warrants the need to integrate clinical covariates when annotating SCN3A variants to develop gene-specific, vertical models with greater clinical utility.

Table 1
www.frontiersin.org

Table 1. Characteristics of previously reported cases with putative disease-causing SCN3A variants.

Figure 1
Panel A showed a transmembrane protein’s structure with domains I to IV labeled DI to DIV. Pathogenic and benign variants are marked, color-coded by region: black circles for pathogenic, blank squares for benign. Panel B demonstrates that benign variants are predominantly located in non-transmembrane regions, while pathogenic variants show a significantly higher proportion located within transmembrane regions, with a p-value less than 0.0001.

Figure 1. Molecular subregional effects analysis of SCN3A variants. (A) The locations of SCN3A variants were associated with the pathogenicity of variants. (B) Patients with SCN3A variants in the transmembrane regions showed a significantly higher prevalence of pathogenic variants compared to those with variants in other regions.

3.2 Performance evaluation of algorithms tools

The predictive performance of nineteen widely used algorithms were assessed at recommended thresholds, to establish the systematic benchmark and evaluation framework for SCN3A variants. Variants were carefully curated: pathogenic variants were obtained from a trusted source and supported by a confirmed clinical diagnosis reviewed by our panel, while benign variants were retrieved from the gnomAD database, confirming their non-deleterious nature.

The nineteen algorithms that provide classified output showed differences in performance (Figure 2 and Supplementary Table S2). AlphaMissense and PrimateAI, two deep-learning models trained by structure information, achieved the most robust result. AlphaMissense achieved the highest balanced accuracy of 90.6% and a superior MCC of 0.7911, followed closely by PrimateAI with a balanced accuracy of 89.4% and a MCC of 0.7615. Both maintained a balance between sensitivity (>0.90) and specificity (>0.88), suggesting that structural information effectively minimizes false positives. Two meta-predictors, BayesDel_addAF (90.0%) and ClinPred (88.9%), were also showed high balanced accuracy, yet their performance profile differed; While both achieved a perfect sensitivity of 1.0, their lower specificities (0.80 and 0.7778) reflect a strategic bias toward “over-calling” pathogenicity to ensure no deleterious variants are missed. This trend continued in MetaRNN and BayesDel_noAF, where balanced accuracy remained above 81% but MCC values were below 0.62, indicating a loss in overall predictive reliability. Among tools relying primarily on sequence conservation and traditional machine learning methods, SIFT4G performed with a balanced accuracy of 88.06%. Polyphen2_HVAR and Polyphen2_HDIV exhibited intermediate results, with balanced accuracy of 78.6% and 77.8%, respectively, both showing sensitivities around 0.75–0.8 but higher false negatives. PROVEAN and MutationTaster showed balanced accuracy decline toward 72% due to specificities falling below 0.47. The lowest performance was observed in M-CAP, which yielded a balanced accuracy of only 59.3%, with a specificity of 0.186, offering limited utility for filtering benign variants.

Figure 2
Heatmap and bar chart displayed variant prediction models’ performance. Models are ranked by balanced accuracy, ranging from 59.3% to 90.6%. Heatmap scales showed true positive/false negative density and true negative/false positive density. Bar chart indicates variant effect (deleterious in red and tolerated in blue) with counts for each model.

Figure 2. Algorithms performance evaluation. The left heatmap showed the classification outcomes: true positives (TP) and false negatives (FN) among pathogenic variants, and true negatives (TN) and false positives (FP) among benign variants. The central column showed balanced accuracy. The balanced accuracy of each algorithm served as the primary metric to rank the algorithms. The distribution of predictions made by each algorithm at its recommended threshold, showing the proportion of variants classified as deleterious (DEL, red) vs. tolerated (TOL, blue).

3.3 Difference analysis and ROC curves for the algorithms tools

Difference analysis of scores among algorithms (including derived algorithms) showed clear separation between pathogenic and benign (control) variants (Figure 3). All comparisons were significant (p < 0.0001).

Figure 3
Scatter plots compared various scores between control and pathogenic groups, showed statistical differences with p-values less than 0.0001.

Figure 3. Difference analysis of algorithms prediction scores for SCN3A variants. For each algorithm (including AlphaMissense, BayesDel_addAF, BayesDel_noAF, ClinPred, ESM1b, Fathmm-XF_coding, LIST-S2, M-CAP, MetaLR, MetaRNN, MetaSVM, MutationAssessor, MutationTaster, Polyphen2_HDIV, Polyphen2_HVAR, PROVEAN, PrimateAI, SIFT, and SIFT4), scores are shown for benign (gray) and pathogenic (red) variants. Violin plots show how the scores are distributed; The difference between groups was tested with a Wilcoxon rank-sum test; all were highly significant (p < 0.0001).

ROC analysis further showed performance differences among algorithms (Figure 4A and Supplementary Table S3). AUCs ranged from 0.81 to 0.98 with a median of 0.95. The top-performance algorithms by ROC analysis were MetaRNN (AUC [0.982], 95% CI [0.958–1]), BayesDel_addAF (AUC [0.981], 95% CI [0.956–1]), ClinPred (AUC [0.979], 95% CI [0.95–1]), and AlphaMissense (AUC [0.968], 95% CI [0.932–1]; Figure 5) Best thresholds were calculated by Youden's J statistic; Most algorithms achieved performance improvements at the optimal threshold (Figure 4B and Supplementary Table S3). Compared with published/recommended cutoffs, majority of algorithms achieved better performance than the others at the recommended threshold. For example, AlphaMissense showed best thresholds differed at the threshold of 0.699 and improved accuracy with sensitivity of 0.9, specificity of 0.933, MCC of 0.822, and F-score of 0.878. BayesDel_addAF showed best thresholds differed at the threshold of 0.401 and improved accuracy with sensitivity of 0.9, specificity of 0.978, MCC of 0.891, and F-score of 0.923. Notably, M-CAP, MetaLR, and fathmm-XF_coding largely improved accuracy after adjusting the threshold.

Figure 4
Panel A showed a receiver operating characteristic curves comparing the top 10 predictors by AUC, indicating true positive rate vs. false positive rate. Panel B presents a bar chart of predictor thresholds and accuracies, with accuracies ranging from 78.5% to 95.4% across different predictors.

Figure 4. Receiver operating characteristic curve (ROC) performance of top ten algorithms and optimal thresholds of algorithms for SCN3A variants. (A) Each algorithm is represented by different colors, and the dashed line indicates random distribution. The area under each curve (AUC), along with its 95% confidence interval (CI), is shown, higher AUC values reflect better discrimination between pathogenic and benign variants. Full ROC curves for all tools are provided in Figure 5. (B) Bar plot showed the data-driven best threshold for each tool (top) and the accuracy at that threshold (bottom). Best thresholds were defined by maximizing Youden's J. Supplementary Table S3 listed sensitivity, specificity, balanced accuracy, F1-score, and MCC, and optimal thresholds.

Figure 5
Receiver operating characteristic curves displayed the performance of various models for differentiating true positives from false positives. Area under the curve values range from approximately 0.806 to 0.982, with 95% confidence intervals.

Figure 5. All ROC curves across nineteen algorithms. ROC curves were presented with AUC and 95% CI, including AlphaMissense, BayesDel_addAF, BayesDel_noAF, ClinPred, ESM1b, Fathmm-XF_coding, LIST-S2, M-CAP, MetaLR, MetaRNN, MetaSVM, MutationAssessor, MutationTaster, Polyphen2_HDIV, Polyphen2_HVAR, PROVEAN, PrimateAI, SIFT, and SIFT4.

4 Discussion

In this study, we collected a curated set of pathogenic and benign SCN3A missense variants. Pathogenic SCN3A variants were statistically more located in transmembrane regions than in other regions, suggesting possible subregional effects. Systematic benchmarks at their recommended thresholds were provided by building confusion matrices and comparing the balanced accuracy, sensitivity, specificity, MCC, and F1-score. AlphaMissense achieved the highest performance. Statistical analysis showed significant differences between pathogenic and benign variants across all algorithms, and ROC curves were then used to assess threshold-independent predictive performance of algorithms.

Features considered include evolutionary conservation, characteristics of the protein structure, properties of the amino acids involved, underlying nucleotide-level attributes, and patterns of variation observed in clinical and population databases. In benchmarks at recommended thresholds, AlphaMissense and PrimateAI, which include structural information, achieved high balanced accuracy. SIFT4G, driven almost exclusively by conservation scores, already reached a generally acceptable level, and BayesDel_addAF outperformed BayesDel_noAF, confirming the added value of allele-frequency data. These findings underscore that conservation and population frequency are core indicators of variant impact, while structural information can further raise predictive performance. This study warrants further attribution analysis for these tools, such as SHapley Additive exPlanations (SHAP), which quantifies the contribution of each feature group and thus informs rational feature selection and algorithm refinement.

With the recommended threshold, multiple prediction algorithms achieved high accuracy in distinguishing between pathogenic and benign SCN3A variants, including, AlphaMissense, BayesDel_addAF, PrimateAI, and ClinPred. However, the optimal tool may vary depending on the specific clinical or research. For scenarios in which sensitivity is paramount, such as initial screening or when the primary aim is to minimize false negatives, ClinPred is particularly suitable with a sensitivity of 1.00 at both thresholds, ensuring that nearly all pathogenic variants are identified. In contrast, when specificity is the primary consideration, for example, in confirmatory analyses where minimizing false positives is crucial, AlphaMissense demonstrated outstanding performance, with a specificity of 0.91 at recommend threshold. For routine clinical use, where balanced performance between sensitivity and specificity is most desirable, AlphaMissense, PrimateAI, BayesDel_addAF, and ClinPred provided high accuracy and balanced accuracy, making them strong candidates for general application. Notably, the performance metrics observed here represent a significant improvement over earlier benchmarks for sodium channels, which typically reported accuracies in the range of 0.50–0.80. This improvement reflects the development of machine learning algorithms. In addition, previous studies on SCN1A, SCN2A, and SCN8A have already demonstrated that gene-specific optimization enhanced the performance of traditional methods. Our findings on SCN3A corroborate this pattern, confirming that tailored calibration is as critical as algorithm selection for accurate variant interpretation. These findings suggest that algorithm selection should be tailored to the practical demands of different use cases in variant interpretation.

ROC analysis demonstrated that multiple algorithms, such as MetaRNN, BayesDel_addAF, ClinPred, and AlphaMissense, exhibited a favorable balance between predictive efficiency, sensitivity, and specificity. Notably, M-CAP, MetaLR, and fathmm-XF_coding demonstrated a remarkable improvement in prediction performance when gene-specific optimal thresholds were applied, differing from previous benchmarking results based on thresholds without gene stratification. These findings highlight a general improvement in predictive performance across most algorithms when gene-level thresholds are adopted. This study underscored the value of refining classification criteria at the gene level and emphasized the importance of establishing gene-specific vertical evaluation metrics.

Previous studies showed that the damaging effects of variants were associated with gene-specific sub-molecular effects (4248). Variants may lead to heterogeneous phenotypes related to domain, residue burial or depth from the protein surface, proximity to functional sites, and biophysical changes of ion channels. In this study, missense variants within the transmembrane domain were more often pathogenic than those in the non-transmembrane domain, which indicated regional heterogeneity of missense intolerance within SCN3A. Domain or structural location may serve as a predictor of pathogenicity. A key advantage of a vertical, gene-specific framework is that such sub-molecular features or other functional features may be explicitly integrated into prediction models to improve performance.

Several aspects of this study warrant consideration. First, given the inherent rarity of SCN3A-related diseases, the dataset size is relatively constrained, which may influence the precision of performance estimates. Consequently, the current benchmarking results should be viewed as a foundational framework. Second, while gene-specific thresholds demonstrated improved performance, future validation in independent, large-scale cohorts would further strengthen their generalizability and mitigate potential overfitting risks. Finally, this study contrasted clearly pathogenic variants with population controls; extending this analysis to variants with intermediate phenotypes or lower penetrance in future studies may help in characterizing algorithmic performance across the phenotypic spectrum.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author/s.

Ethics statement

The studies involving humans were approved by the Ethics Committee of The Affiliated Nanhua Hospital of the University of South China (Approval number: 2022-Ky-268). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants' legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

P-YW: Conceptualization, Formal analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. J-XZ: Formal analysis, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. W-HL: Data curation, Investigation, Methodology, Supervision, Validation, Writing – review & editing. Y-JC: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. H-WW: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research received financial support by the “4310” program of clinical medical research of the University of South China, Research on genetic diagnosis and precise treatment strategies of hereditary epilepsy (20224310NHYCG11), the Key Projects of Hunan Provincial Health and Health Commission in 2020 (20201910) titled Genetic Causes and Prevention Strategies of Early Epileptic Encephalopathy, and Innovation Projects for Postgraduates of Hunan Province in 2025(CX20251490). The funders had no role in the study design, data collection and analysis, decision to publish, or manuscript preparation.

Acknowledgments

We would like to express our gratitude to the Gen AI technology (ChatGPT 5.2 and Gemini 3 Pro) for its assistance in refining the language and enhancing the clarity of this paper.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Gen AI was used in the creation of this manuscript. Generative AI was used to assist with language editing and to improve the clarity and readability of the manuscript. It was not used for data generation, analysis, or interpretation of the scientific results. The author(s) reviewed and edited all AI-generated text and took full responsibility for the final content of the manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2026.1772239/full#supplementary-material

Supplementary Figure S1 | Molecular subregional effects analysis of SCN3A variants. (A) Schematic representation of the Nav1.3 proteins structure highlighting the positions of variants associated with developmental and epileptic encephalopathy (DEE) and neurodevelopmental disorders (NDD) without epilepsy phenotype (EP). (B) Bar chart comparing the number of variants in S4-S6 segments vs. other segments for patients with NDD with EP and DEE. No significant was shown. (C) The location of variants associated with polymicrogyria (PMG), normal, and other phenotypes. (D) Bar chart comparing the number of variants in transmembrane (TM) segments vs. non-transmembrane (non-TM) segments for patients with PMG and normal individuals. No significant was shown.

Supplementary Table S1 | Detailed description of algorithms.

Supplementary Table S2 | Performance valuation of nineteen algorithms at the recommended threshold.

Supplementary Table S3 | Performance valuation of nineteen algorithms at the best threshold.

References

1. Berg AT, Berkovic SF, Brodie MJ, Buchhalter J, Cross JH, van Emde Boas W, et al. Revised terminology and concepts for organization of seizures and epilepsies: report of the ILAE Commission on Classification and Terminology, 2005-2009. Epilepsia. (2010) 51:676–85. doi: 10.1111/j.1528-1167.2010.02522.x

PubMed Abstract | Crossref Full Text | Google Scholar

2. Borgatti R, Zucca C, Cavallini A, Ferrario M, Panzeri C, Castaldo P, et al. A novel mutation in KCNQ2 associated with BFNC, drug resistant epilepsy, and mental retardation. Neurology. (2004) 63:57–65. doi: 10.1212/01.WNL.0000132979.08394.6D

PubMed Abstract | Crossref Full Text | Google Scholar

3. Weckhuysen S, Mandelstam S, Suls A, Audenaert D, Deconinck T, Claes LR, et al. KCNQ2 encephalopathy: emerging phenotype of a neonatal epileptic encephalopathy. Ann Neurol. (2012) 71:15–25. doi: 10.1002/ana.22644

PubMed Abstract | Crossref Full Text | Google Scholar

4. Dravet C, Bureau M, Dalla Bernardina B, Guerrini R. Severe myoclonic epilepsy in infancy (Dravet syndrome) 30 years later. Epilepsia (2011) 52:1–2. doi: 10.1111/j.1528-1167.2011.02993.x

PubMed Abstract | Crossref Full Text | Google Scholar

5. Wolff M, Johannesen KM, Hedrich UBS, Masnada S, Rubboli G, Gardella E, et al. Genetic and phenotypic heterogeneity suggest therapeutic implications in SCN2A-related disorders. Brain (2017) 140:1316–1336. doi: 10.1093/brain/awx054

PubMed Abstract | Crossref Full Text | Google Scholar

6. Davila-Velderrain J, van Giesen L. Voltage-gated ion channel diversity underlies neuronal excitability and nervous system evolution. Nat Commun. (2025). doi: 10.1101/2025.02.15.638427

PubMed Abstract | Crossref Full Text | Google Scholar

7. Vanoye CG, Gurnett CA, Holland KD, George AL Jr, Kearney, JA. Novel SCN3A variants associated with focal epilepsy in children. Neurobiol Dis. (2014) 62:313–22. doi: 10.1016/j.nbd.2013.10.015

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zaman T, Helbig I, BoŽović IB, DeBrosse SD, Bergqvist AC, Wallis K, et al. Mutations in SCN3A cause early infantile epileptic encephalopathy. Ann Neurol. (2018) 83:703–717. doi: 10.1002/ana.25188

PubMed Abstract | Crossref Full Text | Google Scholar

9. Smith RS, Kenny CJ, Ganesh V, Jang A, Borges-Monroy R, Partlow JN, et al. Sodium Channel SCN3A (Na(V)1.3) Regulation of human cerebral cortical folding and oral motor development. Neuron (2018) 99:905–13.e7. doi: 10.1016/j.neuron.2018.07.052

Crossref Full Text | Google Scholar

10. Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. (2009) 4:1073–81. doi: 10.1038/nprot.2009.86

PubMed Abstract | Crossref Full Text | Google Scholar

11. Vaser R, Adusumalli S, Leng SN, Sikic M, Ng PC. SIFT missense predictions for genomes. Nat Protoc. (2016) 11:1–9. doi: 10.1038/nprot.2015.123

PubMed Abstract | Crossref Full Text | Google Scholar

12. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods. (2010) 7:248–9. doi: 10.1038/nmeth0410-248

PubMed Abstract | Crossref Full Text | Google Scholar

13. Schwarz JM, Rödelsperger C, Schuelke M, Seelow D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods. (2010) 7:575–6. doi: 10.1038/nmeth0810-575

PubMed Abstract | Crossref Full Text | Google Scholar

14. Schubach M, Maass T, Nazaretyan L, Röner S, Kircher M. CADD v1.7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions. Nucleic Acids Res. (2024) 52:D1143–54. doi: 10.1093/nar/gkad989

Crossref Full Text | Google Scholar

15. Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am J Hum Genet. (2016) 99:877–85. doi: 10.1016/j.ajhg.2016.08.016

PubMed Abstract | Crossref Full Text | Google Scholar

16. Tejura M, Fayer S, McEwen AE, Flynn J, Starita LM, Fowler DM. Calibration of variant effect predictors on genome-wide data masks heterogeneous performance across genes. Am J Hum Genet. (2024) 111:2031–43. doi: 10.1016/j.ajhg.2024.07.018

PubMed Abstract | Crossref Full Text | Google Scholar

17. Gu YJ, Wang PY, Fu QQ, Lai JH, Chen X, Liu XH, et al. Epilepsy-associated CHD2 missense variants and optimization strategies for genetic diagnosis: a comparative analysis of algorithms. Front Neurol. (2025) 16:1729387. doi: 10.3389/fneur.2025.1729387

PubMed Abstract | Crossref Full Text | Google Scholar

18. Tang B, Li B, Gao LD, He N, Liu XR, Long YS, et al. Optimization of in silico tools for predicting genetic variants: individualizing for genes with molecular sub-regional stratification. Brief Bioinform (2020) 21:1776–86. doi: 10.1093/bib/bbz115

PubMed Abstract | Crossref Full Text | Google Scholar

19. Brunklaus A, Pérez-Palma E, Ghanty I, Xinge J, Brilstra E, Ceulemans B, et al. Development and validation of a prediction model for early diagnosis of SCN1A-related epilepsies. Neurology (2022) 98:e1163–74. doi: 10.1212/WNL.0000000000200028

PubMed Abstract | Crossref Full Text | Google Scholar

20. Cheng J, Novati G, Pan J, Bycroft C, Žemgulyte A, Applebaum T, et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science (2023) 381:eadg7492. doi: 10.1126/science.adg7492

PubMed Abstract | Crossref Full Text | Google Scholar

21. Feng BJ. PERCH: a unified framework for disease gene prioritization. Hum Mutat. (2017) 38:243–51. doi: 10.1002/humu.23158

PubMed Abstract | Crossref Full Text | Google Scholar

22. Alirezaie N, Kernohan KD, Hartley T, Majewski J, Hocking TD. ClinPred: prediction tool to identify disease-relevant nonsynonymous single-nucleotide variants. Am J Hum Genet. (2018) 103:474–83. doi: 10.1016/j.ajhg.2018.08.005

PubMed Abstract | Crossref Full Text | Google Scholar

23. Shihab HA, Gough J, Cooper DN, Stenson PD, Barker GL, Edwards KJ, et al. Predicting the functional, molecular, and phenotypic consequences of amino acid substitutions using hidden Markov models. Hum Mutat. (2013) 34:57–65. doi: 10.1002/humu.22225

PubMed Abstract | Crossref Full Text | Google Scholar

24. Malhis N, Jacobson M, Jones SJM, Gsponer J. LIST-S2: taxonomy based sorting of deleterious missense mutations across species. Nucleic Acids Res. (2020) 48:W154–61. doi: 10.1093/nar/gkaa288

PubMed Abstract | Crossref Full Text | Google Scholar

25. Jagadeesh KA, Wenger AM, Berger MJ, Guturu H, Stenson PD, Cooper DN, et al. M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity. Nat Genet. (2016) 48:1581–6. doi: 10.1038/ng.3703

PubMed Abstract | Crossref Full Text | Google Scholar

26. Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. (2011) 39:e118. doi: 10.1093/nar/gkr407

PubMed Abstract | Crossref Full Text | Google Scholar

27. Steinhaus R, Proft S, Schuelke M, Cooper DN, Schwarz JM, Seelow D. MutationTaster2021. Nucleic Acids Res. (2021) 49:W446–51. doi: 10.1093/nar/gkab266

PubMed Abstract | Crossref Full Text | Google Scholar

28. Adzhubei I, Jordan DM, Sunyaev SR. Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet. (2013) 76:7–20. doi: 10.1002/0471142905.hg0720s76

PubMed Abstract | Crossref Full Text | Google Scholar

29. Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS ONE (2012) 7:e46688. doi: 10.1371/journal.pone.0046688

PubMed Abstract | Crossref Full Text | Google Scholar

30. Sundaram L, Gao H, Padigepati SR, McRae JF, Li Y, Kosmicki JA, et al. Predicting the clinical impact of human mutation with deep neural networks. Nat Genet. (2018) 50:1161–70. doi: 10.1038/s41588-018-0167-z

PubMed Abstract | Crossref Full Text | Google Scholar

31. Matthews AM, Blydt-Hansen I, Al-Jabri B, Andersen J, Tarailo-Graovac M, Price M, et al. Atypical cerebral palsy: genomics analysis enables precision medicine. Genet Med. (2019) 21:1621–8. doi: 10.1038/s41436-018-0376-y

PubMed Abstract | Crossref Full Text | Google Scholar

32. Sun D, Liu Y, Cai W, Ma J, Ni K, Chen M, et al. Detection of disease-causing SNVs/indels and CNVs in single test based on whole exome sequencing: a retrospective case study in epileptic encephalopathies. Front Pediatr. (2021) 9:635703. doi: 10.3389/fped.2021.635703

PubMed Abstract | Crossref Full Text | Google Scholar

33. Lamar T, Vanoye CG, Calhoun J, Wong JC, Dutton SBB, Jorge BS, et al. SCN3A deficiency associated with increased seizure susceptibility. Neurobiol Dis. (2017) 102:38–48. doi: 10.1016/j.nbd.2017.02.006

PubMed Abstract | Crossref Full Text | Google Scholar

34. Zou D, Wang L, Liao J, Xiao H, Duan J, Zhang T, et al. Genome sequencing of 320 Chinese children with epilepsy: a clinical and molecular study. Brain. (2021) 144:3623–34. doi: 10.1093/brain/awab233

PubMed Abstract | Crossref Full Text | Google Scholar

35. van der Ven AT, Johannsen J, Kortüm F, Wagner M, Tsiakas K, Bierhals T, et al. Prevalence and clinical prediction of mitochondrial disorders in a large neuropediatric cohort. Clin Genet. (2021) 100:766–70. doi: 10.1111/cge.14061

PubMed Abstract | Crossref Full Text | Google Scholar

36. Zaman T, Helbig KL, Clatot J, Thompson CH, Kang SK, Stouffs K, et al. SCN3A-related neurodevelopmental disorder: a spectrum of epilepsy and brain malformation. Ann Neurol. (2020) 88:348–62. doi: 10.1002/ana.25809

PubMed Abstract | Crossref Full Text | Google Scholar

37. Yao R, Zhou Y, Tang J, Li N, Yu T, He Y, et al. Genetic diagnosis spectrum and multigenic burden of exome-level rare variants in a childhood epilepsy cohort. Front Genet. (2021) 12:782419. doi: 10.3389/fgene.2021.782419

PubMed Abstract | Crossref Full Text | Google Scholar

38. Trujillano D, Bertoli-Avella AM, Kumar Kandaswamy K, Weiss ME, Köster J, Marais A, et al. Clinical exome sequencing: results from 2819 samples reflecting 1000 families. Eur J Hum Genet. (2017) 25:176–82. doi: 10.1038/ejhg.2016.146

PubMed Abstract | Crossref Full Text | Google Scholar

39. Ziats MN, Ahmad A, Bernat JA, Fisher R, Glassford M, Hannibal MC, et al. Genotype-phenotype analysis of 523 patients by genetics evaluation and clinical exome sequencing. Pediatr Res. (2020) 87:735–39. doi: 10.1038/s41390-019-0611-5

PubMed Abstract | Crossref Full Text | Google Scholar

40. Ma H, Guo Y, Chen Z, Wang L, Tang Z, Zhang J, et al. Mutations in the sodium channel genes SCN1A, SCN3A, and SCN9A in children with epilepsy with febrile seizures plus(EFS+). Seizure. (2021) 88:146–52. doi: 10.1016/j.seizure.2021.04.006

PubMed Abstract | Crossref Full Text | Google Scholar

41. Inuzuka LM, Macedo-Souza LI, Della-Ripa B, Cabral KSS, Monteiro F, Kitajima JP, et al. Neurodevelopmental disorder associated with de novo SCN3A pathogenic variants: two new cases and review of the literature. Brain Dev. (2020) 42:211–6. doi: 10.1016/j.braindev.2019.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

42. He YY, Luo S, Jin L, Wang PY, Xu J, Jiao HL, et al. DLG3 variants caused X-linked epilepsy with/without neurodevelopmental disorders and the genotype-phenotype correlation. Front Mol Neurosci. (2023) 16:1290919. doi: 10.3389/fnmol.2023.1290919

PubMed Abstract | Crossref Full Text | Google Scholar

43. Luo S, Ye XG, Jin L, Li H, He YY, Guan BZ, et al. SZT2 variants associated with partial epilepsy or epileptic encephalopathy and the genotype-phenotype correlation. Front Mol Neurosci. (2023) 16:1162408. doi: 10.3389/fnmol.2023.1162408

PubMed Abstract | Crossref Full Text | Google Scholar

44. He MF, Liu LH, Luo S, Wang J, Guo JJ, Wang PY, et al. ZFHX3 variants cause childhood partial epilepsy and infantile spasms with favourable outcomes. J Med Genet. (2024) 61:652–60. doi: 10.1136/jmg-2023-109725

PubMed Abstract | Crossref Full Text | Google Scholar

45. He YL, Ye YC, Wang PY, Liang XY, Gu YJ, Zhang SQ, et al. CCDC22 variants caused X-linked focal epilepsy and focal cortical dysplasia. Seizure. (2024) 123:1–8. doi: 10.1016/j.seizure.2024.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

46. Luo S, Wang PY, Zhou P, Zhang WJ, Gu YJ, Liang XY, et al. Variants in EP400, encoding a chromatin remodeler, cause epilepsy with neurodevelopmental disorders. Am J. Hum Genet. (2025) 112:87–105. doi: 10.1016/j.ajhg.2024.11.010

PubMed Abstract | Crossref Full Text | Google Scholar

47. Li X, Wang L, Liang XY, Zhang HW, Shi JG, Guo J, et al. Variants in CSMD2 and CSMD3, genes involved in synaptogenesis, are associated with epilepsies. Epilepsia. (2025) 66:4394–410. doi: 10.1111/epi.18539

PubMed Abstract | Crossref Full Text | Google Scholar

48. Luo S, Zhang WJ, Jiang M, Ren RN, Liu L, Li YL, et al. De novo TANC2 variants caused developmental and epileptic encephalopathy and epilepsy. Epilepsia. (2025) 66:2365–78. doi: 10.1111/epi.18358

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: algorithms, DEE, developmental and/or epileptic encephalopathies, ion channel, molecular subregional effects, optimized diagnostics, SCN3A

Citation: Wang P-Y, Zhao J-X, Liu W-H, Chen Y-J and Wang H-W (2026) Toward precision medicine in SCN3A variants-associated encephalopathies and epilepsy: optimizing genetic diagnosis and molecular subregional effects. Front. Neurol. 17:1772239. doi: 10.3389/fneur.2026.1772239

Received: 20 December 2025; Revised: 09 January 2026;
Accepted: 16 January 2026; Published: 05 February 2026.

Edited by:

Miriam Kessi, Central South University, China

Reviewed by:

Song Su, Children's Hospital Affiliated to Shandong University, China
Siyi Wang, Capital Medical University, China

Copyright © 2026 Wang, Zhao, Liu, Chen and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hong-Wei Wang, d2FuZ3JlZGd1YXJkQGZveG1haWwuY29t; Yong-Jun Chen, Y2hlbnlqLXVzY0Bmb3htYWlsLmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.