ORIGINAL RESEARCH article

Front. Endocrinol., 11 April 2025

Sec. Pediatric Endocrinology

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1560698

Genetic variants and molecular profiling of 46,XY gonadal dysgenesis using whole-exome sequencing

Ning Zhang,,&#x;Ning Zhang1,2,3†Shuoming You,&#x;Shuoming You1,2†Jingjing Guo,&#x;Jingjing Guo1,2†Xingyu Chang,Xingyu Chang1,2Junjun Qiu,*&#x;Junjun Qiu1,2*‡Keqin Hua,*&#x;Keqin Hua1,2*‡
  • 1Department of Gynecology, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
  • 2Shanghai Key Laboratory of Female Reproductive Endocrine-Related Diseases, Shanghai, China
  • 3Department of Gynecology, Obstetrics and Gynecology Hospital of Fudan University, Yangtze River Delta Integration Demonstration Zone (QingPu), Shanghai, China

Background: More than 60% of cases of 46,XY gonadal dysgenesis (GD), a condition classified as a disorder of sex development (DSD), remain unexplained, which is due to high genetic and clinical heterogeneity. Whole-exome sequencing (WES) is an efficient primary genetic diagnostic method; specifically, the use of WES in patients with 46,XY GD to explore the underlying genetic variants of the disorder may help us gain a deeper understanding of the pathogenesis and phenotype–genotype correlation of 46,XY GD.

Methods: We performed WES and pedigree studies to investigate the underlying genetic etiology of patients with 46,XY GD (six patients and six familial controls). The variants were confirmed via Sanger sequencing, and detailed functional prediction of the discovered genetic variants was conducted. Furthermore, we performed in-silico protein structural analysis and protein thermodynamic analysis to further explore the pathogenicity of these variants. GATA4 variants in patients with 46,XY GD with/without CHD and patients with cardiac disease alone were also analyzed.

Results: We identified three novel pathogenic variants in GATA4:c.725G>C(p.Cys242Ser), NR5A1:c.370_380del(p.Pro124Glyfs*21), and DHX37:c.2020C>T(p.Arg674Trp), as well as one previously reported MAP3K1:c.1016G>A(p.Arg339Gln) variant. These variant sites are conserved among species and were predicted to be damaging according to functional algorithms and protein analyses. Additionally, 71.4% of the GATA4 amino acid changes in 46,XY GD were located in or close to the N-terminal zinc finger (N-ZF) domain. However, most GATA4 pathogenic variants (31.82%) in patients with isolated cardiac diseases were located in transactivation domain 1 (TAD1), and only 9.09% of the variants were located in the N-ZF domain.

Conclusion: The N-ZF domain may play an exclusive role in the mechanism of GATA4 in the pathogenesis of 46,XY GD; therefore, this domain may be an interesting topic for future investigation. This study enhances our understanding of the genetic etiology and pathogenesis of 46,XY GD, which may aid in the diagnosis and intervention of this disorder.

1 Introduction

46,XY gonadal dysgenesis (GD) is a congenital gonadal development disorder in individuals with a 46,XY karyotype that belongs to the spectrum of disorders/differences of sex development (DSD) (1). GD can be classified as either complete gonadal dysgenesis (CGD, also known as Swyer syndrome) or partial gonadal dysgenesis (PGD) (2). The characteristics of 46,XY CGD include female external genitalia, a well-developed Müllerian structure, and streak gonads without testicular differentiation, whereas 46,XY PGD is characterized by varying degrees of Wolffian (seminal vesicle, epididymis, and vas deferens) and Müllerian (uterus, fallopian tube, and upper third of the vagina) duct development, as well as virilization of external genitalia. The prevalence of 46,XY GD is approximately one per 8,000 births (3). An important characteristic of 46,XY GD is an increased lifetime risk of gonadal tumors, ranging from 15% to 35% (4). Early diagnosis is crucial to identify possible gonadal malignancies and excise ectopic gonads (if necessary), as well as to enable timely hormone treatments, obtain adequate peak bone mass, and provide timely psychosexual consultations. Genetic variants are the most important and common cause of 46,XY GD (5). An understanding of the genetic etiology of 46,XY GD is beneficial for pathogenesis studies, genetic counseling, clinical diagnosis, and treatment. Variants in several genes (including GATA4, SRY, NR5A1, WNT4, and MAP3K1, among others) are related to the pathogenesis of 46,XY GD (1). However, these variants explain only <40% of cases (6, 7).

Recently, whole-exome sequencing (WES) has become an important method for identifying new genes and loci related to genetic disease; currently, this method is a commonly used approach that exhibits a diagnostic rate of 35% (8). Therefore, we recruited six patients with 46,XY GD for this study. WES analysis was performed on these patients and their family members to explore genetic variants and phenotype–genotype correlations. Consequently, we discovered three novel pathogenic variants in GATA4, NR5A1, and DHX37 and one previously reported MAP3K1 variant in patients with 46,XY GD. The pathogenicity of these variants was demonstrated through in-silico studies. Combined with data from the published literature, the importance of the N-terminal zinc finger (N-ZF) domain of GATA4 in 46,XY GD was highlighted. Our research expands on the knowledge of the genetic mechanism underlying 46,XY GD.

2 Materials and methods

2.1 Patients

This study was approved by the Institutional Review Board Committee of the Obstetrics and Gynecology Hospital (2021–223), Fudan University (FE21192), and written informed consent was obtained from the participants and their parents. We recruited six patients who were diagnosed with 46,XY GD at the Obstetrics and Gynecology Hospital of Fudan University. Except for patient 3, blood samples and clinical data were collected from patients and their immediate family members (with the family members serving as six familial controls). Notably, patients 1 and 2, as well as patients 5 and 6, were identified as sibling sisters. The detailed clinical, laboratory, and molecular characteristics of the patients and their families were recorded.

2.2 Whole-exome sequencing

Genomic DNA was extracted from peripheral venous blood using a Blood Genome Column Medium Extraction Kit (9). The xGen Exome Research Panel v1.0 was used to analyze the genomic DNA and identify the exonic regions and flanking splice junctions of the genome (IDT, Coralville, IA, United States). The libraries were subsequently sequenced on a BGI T7 sequencer using PE150, with a minimum of 11 million reads. The paired-end reads were compared against the Ensembl GRCh37/hg19 human reference genome with the Burrows-Wheeler Aligner (BWA). Both single-nucleotide variants (SNVs) and small insertions/deletions (InDels) were identified using the Genomic Analysis Toolkit (GATK) program (10). The Exome Depth algorithm served as the basis for copy number variant (CNV) calling. The total read counts were mapped to each exon in the same batch as previously described (11). Variants were annotated using a Chigene online system that references 35 public databases. We categorized the candidate variants according to international standards established by the Sequence Variant Interpretation Working Group and the American College of Medical Genetics and Genomics (ACMG) (12).

2.3 Functional prediction of genetic results

The pathogenic effects of the identified variants were predicted in silico using four software algorithms: SIFT (http://provean.jcvi.org/index.php) (13), PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/) (14), MutationTaster (http://www.mutationtaster.org/) (15), and REVEL (https://sites.google.com/site/revelgenomics/) (16).

2.4 In silico protein structure analysis and protein thermodynamic analysis

In-silico protein structures were predicted for the NR5A1, DHX37, MAP3K1, and GATA4 variants. First, AlphaFold2 (17) was used to predict the three-dimensional structures of the wild-type proteins encoded by these genes. Based on the predicted wild-type protein structures, we used SWISS-MODEL2 (18) to predict the structures of the variant proteins. The resulting crystal structures were visualized and annotated using PyMOL (http://www.pymol.org). LigPlot (19) was used to explore the hydrogen bond functions of the proteins at the two-dimensional level with and without alterations. Molecular docking analysis of the Zn atom in GATA4 was supplemented with AlphaFill (20). Kyte and Doolittle hydropathy plots were used to investigate the hydrophobicity of the protein region (https://fasta.bioch.virginia.edu/fasta_www2/fasta_www.cgi?rm=misc1&pgm=pkd).

Amino acid conservation between the different species was assessed using UniProt (https://www.uniprot.org), and disruption of the folding energy (ΔΔG) caused by missense variants in the domains was modeled using DUET (21), SDM (22), mCSM (23), and DeepDDG (24).

3 Results

3.1 Clinical and hormonal characteristics of patients with 46,XY GD and their families

Six patients diagnosed with 46,XY GD were included in this study. Patients 1 and 2 were diagnosed with PGD, whereas patients 3, 4, 5, and 6 were diagnosed with CGD. Notably, patients 1 and 2 were siblings, as were patients 5 and 6. The average age at diagnosis was 13.2 years. Basal hormonal evaluation results were available for all of the patients. Specifically, all of the patients exhibited increased gonadotropin levels, whereas patient 1 also had increased testosterone levels, which was consistent with the diagnosis of PGD. Patient 2 (the sister of patient 1) had lower testosterone levels, which may have been related to her young age and the fact that the gonads had not yet developed. None of the patients exhibited symptoms or signs of adrenal insufficiency, and normal basal adrenal function was observed in all of the patients. Moreover, all of the patients (except for patient 2) underwent bilateral gonadectomy; additionally, bilateral gonadoblastoma was observed in patient 3, who underwent three cycles of bleomycin–etoposide–cisplatin (BEP) chemotherapy. Notably, patients 1 and 2 also had congenital heart disease (CHD); specifically, patient 1 had a ventricular septal defect and a congenital diaphragm hernia, whereas patient 2 was diagnosed with a ventricular septal defect, an atrial septal defect, pulmonary arterial hypertension, and heart failure. The clinical features of these patients are presented in Table 1.

Table 1
www.frontiersin.org

Table 1. Phenotypic descriptions, previous clinical findings, and genetic findings of the six patients.

3.2 WES analysis revealed four novel variants and one previously reported variant in five genes

We performed WES analysis on the patients and their family members (blood samples were available from the family members of all the patients except for patient 3) and identified four novel heterozygous variants and one previously reported heterozygous variant in the following five genes: GATA4, LHCGR, MAP3K1, DHX37, and NR5A1 (detailed in Table 2). Variants of GATA4, MAP3K1, and DHX37 were identified via Sanger sequencing (Supplementary Figure S1). Among these variants, the c.370_380del variant in NR5A1 was a frameshift mutation, the c.52_c.53insTGC variant in LHCGR caused lysine duplication, and the remaining three variants were missense variants. None of the variants exhibited a minor allele frequency (MAF) (Table 2) in the dbSNP, 1000 Genomes, and gnomAD databases. Moreover, three variants exhibited a sex-specific, dominant inheritance pattern, which is consistent with the published literature (25). The family pedigrees are shown in Figure 1.

Table 2
www.frontiersin.org

Table 2. Variants identified in patients with 46,XY GD and their frequencies in population databases.

Figure 1
www.frontiersin.org

Figure 1. Family pedigrees associated with 46,XY GD. Genotypes are indicated on the chart. Squares and circles represent male and female family members, respectively. The black circles in the squares represent affected patients with 46,XY GD, who were raised as female. The symbols containing black dots represent heterozygous carriers.

3.3 In silico functional analysis confirmed the pathogenicity of the missense variants

To obtain additional information about the possible functional consequences of the identified variants, we conducted in-silico conformational analysis. We analyzed the missense variants in the GATA4, MAP3K1, DHX37, and LHCGR genes via SIFT, PolyPhen-2, MutationTaster, and REVEL. Among the identified variants, the GATA4, MAP3K1, and DHX37 variants were predicted to be damaging via all four algorithms, whereas the LHCGR variant was predicted to be polymorphic via MutationTaster (Table 3). According to the ACMG criteria (26), the NR5A1 variant was classified as pathogenic, whereas the GATA4, MAP3K1, and DHX37 variants were classified as variants of uncertain significance.

Table 3
www.frontiersin.org

Table 3. In-silico prediction analysis of variants identified in patients with 46,XY GD.

3.4 Protein conformational changes caused by the variants

To obtain additional information about the possible functional consequences of the identified variants, we conducted in-silico conformational analysis on the variants according to at least one of the following conditions: 1) predicted to be pathogenic according to the ACMG criteria or 2) predicted to be damaging or disease-causing by three out of four algorithms (SIFT, PolyPhen-2, MutationTaster, and REVEL). The predicted 3D protein structures are shown in Figure 2.

Figure 2
www.frontiersin.org

Figure 2. In-silico 3D protein structural prediction of GATA4, MAP3K1, DHX37, and NR5A1 variants using AlphaFold2 and SWISS-MODEL2. The amino acid residue is marked in orange, and interacting residues are marked in green. Hydrogen bonds are depicted as yellow dashed lines, with the bond length values shown in (E–H). (A) Close-up view of wild-type Cys242 in GATA4. Cys242 is predicted to form a 3.1-Å hydrogen bond with His246 and a 3.2-Å hydrogen bond with Cys239. (B) The Ser242 amino acid change is predicted to influence the original hydrogen bond with Cys239 and form a new hydrogen bond with Gly243. (C) Close-up view of wild-type Arg339 in MAP3K1. This residue is predicted to form two 2.9-Å hydrogen bonds with Ser348 and one 3.3-Å hydrogen bond with Gly1423. (D) In the MAP3K1 amino acid change, the previously mentioned hydrogen bond with Ser348 and Gly1423 was changed. Moreover, a new hydrogen bond was formed with Arg1406. (E) Close-up view of wild-type Arg674 in DHX37 and its interaction with peripheral domains. The residue of interest is depicted as a green sphere. (F) The Trp674 amino acid change is depicted as a red sphere. No evident interactional or structural changes were observed before or after the alteration. (G) Wild-type protein structure of NR5A1. (H) Protein structure of NR5A1:p.Pro124Glyfs*21. The alteration causes the loss of a large fragment of the protein. (I) Multiple alignments of parts of the GATA4, MAP3K1, and DHX37 protein sequences across various species. The locations of the newly identified human amino acid change are shown as dotted red rectangles with inverted red triangles at the top. Kyte and Doolittle hydropathy plots of the GATA4 protein (J) before and (K) after the p.Cys242Ser alteration. A score >0 indicates hydrophobicity, and a score <0 indicates hydrophilicity. Lower positive values indicate greater hydrophilicity. The alteration caused increased hydrophilicity in the region between codons 230 and 250, as outlined by the red boxes.

In wild-type GATA4, the cysteine at position 242 was predicted to form a 3.1-Å hydrogen bond with the histidine at position 246 and a 3.2-Å bond with the cysteine at position 239 (Figure 2A). In patient 1, cysteine 242 was replaced by serine, which was predicted to form a new dihydrogen bond structure with Cys 239 (3.1 Å and 3.2 Å) by 3D and 2D modeling (Figure 2B; Supplementary Figures S2A, B). Due to the fact that serine is a hydrophilic polar amino acid, we also investigated the hydrophilicity of the protein region via Kyte and Doolittle hydropathy plots. We observed that the p.Cys242Ser amino acid change caused increased hydrophilicity in the region between codons 230 and 250 (Figures 2J, K). Furthermore, the ΔΔG results confirmed the pathogenicity of the amino acid change (Supplementary Table S1).

In wild-type MAP3K1, the arginine at position 339 was predicted to form two 2.9-Å hydrogen bonds with the serine at position 348 and one 3.3-Å hydrogen bond with the glycine at position 1423 (Figure 2C). In patient 3, the arginine residue was replaced with glutamine, which was predicted to interfere with the dihydrogen bond structure with serine 348 (3.0 Å and 2.8 Å), thereby causing a loss of the hydrogen bond with glycine 1423 and the formation of a new hydrogen bond with glutamine 1406 (3.2 Å) (Figure 2D). Combined with the 2D modeling (Supplementary Figures S2C, D) and ΔΔG results (Supplementary Table S1), these findings indicated possibly damaging structural or functional consequences of this amino acid change.

Compared with the wild-type protein, the DHX37 amino acid change, which results in tryptophan being located at position 674, was not predicted to cause any evident structural or functional changes in the 3D structure (Figures 2E, F). However, 2D modeling predicted that this change resulted in the loss of dihydrogen bonds with residues 670 (2.74 Å and 3.35 Å) and 277 (2.69 Å and 3.11 Å), as well as a loss of a hydrogen bond with residue 436 (2.92 Å) (Supplementary Figures S2E, F). The loss of these hydrogen bonds could cause conformational instability in the local domain, which was supported by the ΔΔG results (Supplementary Table S1).

The 3D structural change caused by the in-frame deletion from position 370 to position 380 in steroidogenic factor-1 (SF-1) is shown in Figure 2H. This 10-residue deletion caused codon 144 in the mRNA to become a stop codon, which resulted in the loss of a large fragment of the protein and most likely disrupted protein function.

3.5 Decreased Zn-binding efficiency due to GATA4 alterations was predicted and may play an important role in patients with 46,XY GD

As a Zn transcription factor, the Zn-binding domain of GATA4 is essential for its role in both types of gonadal development. Therefore, we performed a molecular docking analysis of Zn with the GATA4 amino acid changes of patients 1 and 2. We discovered that the GATA4 amino acid change site was located in the Zn-binding zone. This amino acid change decreased the number of Zn-binding sites within the protein domain, which reduced the strength and efficiency of binding and impacted normal biochemical function (Figure 3). This decreased Zn-binding efficiency was also indicated via 2D structure modeling (Supplementary Figure S2B). To further explore the role of different domains of GATA4 in 46,XY GD and CHD, we summarized the locations of GATA4 variants in patients diagnosed with 46,XY GD with/without CHD from the published literature. We found that 71.4% of the GATA4 variants in 46,XY GD were located in/close to the N-ZF domain (Table 4, Figure 4A). Additionally, we analyzed the data of GATA4 variants associated with isolated cardiac diseases from ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/) and determined that only 9.09% of pathogenic/likely pathogenic variants were found to be located in the N-ZF domain; moreover, most of the variants were located in transactivation domain 1 (TAD1, 31.82%), followed by nuclear localization signal (NLS, 22.73%), transactivation domain 2 (TAD2, 13.64%), no conserved domain (13.64%), and N-ZF (9.09%) (Figure 4B). These results revealed that different domains of GATA4 play different roles and the N-ZF domain of GATA4 plays an important role for 46,XY GD.

Figure 3
www.frontiersin.org

Figure 3. Molecular docking analysis of Zn atoms in wild-type and mutant GATA4 using AlphaFill. (A) Structure of WT GATA4 showing the Zn coordination and binding of DNA. Two Zn atoms are indicated by black arrows. (B) The p.Cys242Ser amino acid change was predicted to disrupt Zn binding and protein stability. (C) Close-up image of the Zn-binding site in wild-type GATA4. The Zn-binding site is marked by a red circle and an arrow. (D) Close-up of the p.Cys242Ser amino acid change showing the reduced Zn-binding site with the protein domain (shown by a red circle and arrow). The alteration was predicted to decrease the binding efficacy, which would affect the normal biochemical function of the protein.

Table 4
www.frontiersin.org

Table 4. GATA4 variants in 46,XY GD patients with/without CHD.

Figure 4
www.frontiersin.org

Figure 4. Analysis of the examined GATA4 amino acid change. (A) Schemes of the structure of the GATA4 protein and location of GATA4 amino acid changes in patients with 46,XY GD, as well as other patients with 46,XY GD with/without CHD. GATA4 contains two distinct zinc finger domains (N-ZF and C-ZF) and a C-terminal nuclear localization sequence (NLS), which consists of a DNA-binding domain and a protein–protein interaction domain. Transcriptional activation domains (TADs) are located in the N-terminus. Most GATA4 amino acid changes in patients with 46,XY GD with/without CHD were concentrated in the N-ZF domain. (B) Analysis of the germline classification of GATA4 variants in patients with cardiac diseases alone and the location of pathogenic/likely pathogenic GATA4 variants, as shown in the data from ClinVar.

4 Discussion

An understanding of the genetic etiology of 46,XY GD is beneficial for pathogenesis studies, genetic counseling, clinical diagnosis, and treatment. However, the genetic etiology for most patients with 46,XY GD, as well as the exact genetic mechanism and phenotype–genotype correlation, remains unknown. In this study, we identified three novel pathogenic variants, namely, GATA4, NR5A1, and DHX37 variants, as well as one previously reported MAP3K1 variant, in six patients with 46,XY GD by WES. All of these variants were predicted to be damaging using both functional algorithms and protein modeling. To further explore the GATA4 variants involved in 46,XY GD with/without CHD and their possible phenotype–genotype correlation, we performed molecular docking analysis of Zn in the GATA4 variants. Furthermore, we summarized the locations of GATA4 variants in patients with 46,XY GD with/without CHD and in patients with cardiac disease alone.

GATA4 is expressed in the somatic cells of both mice and humans. This gene has been demonstrated to play important roles in transcriptional activity (31). GATA4 variants can interfere with the transcriptional activation of gonadal promoters such as NR5A1, thereby potentially affecting Sertoli cell development in the testis and causing 46,XY DSD (32). In our study, patients 1 and 2 [who were sisters bearing the novel pathogenic GATA4 variant known as the c.725G>C(p.Cys242Ser) variant] were both diagnosed with 46,XY PGD. This novel missense c.725G>C variant in GATA4 has been shown to occur at a site that is highly conserved among species.GATA4 proteins have several conserved domains, including an N-ZF domain, a C-terminal zinc finger domain (C-ZF), TAD1, TAD2, and NLS. Our GATA4 variant was observed to be located in the N-ZF domain. Notably, 71.4% of the GATA4 variants in 46,XY GD were located in or close to the N-ZF domain, which strengthens the importance of this area in sex development. This domain plays an essential role in protein stability and is necessary for the protein to interact with other transcription factors (27). In our study, the in-silico analysis revealed that the p.Cys242Ser amino acid change of GATA4 in the N-ZF domain decreased the Zn-binding efficacy. This change may affect interactions with important factors involved in gonadal development, such as NR5A1, FOG2, and AMH, thereby leading to the 46,XY GD phenotype (33). Based on these findings, we suggest the implementation of clinical recommendations to prioritize screening for variants in the N-ZF domain of the GATA4 among individuals with 46,XY GD.

GATA4 is not only important for gonadal development but also well known for its role in cardiac development. GATA4 variants can disrupt the formation of the cardiac tube, separation and development of the outflow tract, formation of the atrioventricular canal, and development of the cardiac conduction system, which can contribute to CHD (34). Intriguingly, patients 1 and 2 with the GATA4 variant also demonstrated co-occurrence with CHD. Similarly, Martinez de LaPiscina et al. (27) reported a p.Cys238Arg variant in GATA4 in a patient with 46,XY GD with complex CHD, which was also located in the N-ZF domain. In humans, more than 120 GATA4 variants have been found to be associated with cardiac disease, whereas only 14 variants have been observed to be related to 46,XY DSD (27), thereby suggesting that GATA4 variants are more likely to result in CHD than DSD; however, the underlying reason for this effect remains unclear. A possible explanation is that some GATA4 variants retain partial DNA-binding activity, thus exhibiting different degrees of transcriptional activation of gonadal promoters, such as NR5A1 (28).

Furthermore, we analyzed the data concerning GATA4 variants in patients with cardiac diseases alone, which were obtained from ClinVar. Unlike the GATA4 variant loci identified in patients with 46,XY GD, only 9.09% of the GATA4 variants in patients with cardiac diseases alone were located in the N-ZF domain; moreover, most of the variants (31.82%) were located in TAD1. These results underscore the critical importance of the N-ZF domain in cases of 46,XY GD from a different perspective. Interestingly, some patients with 46,XY GD possessing GATA4 variants also exhibited the CHD phenotype, whereas others did not exhibit this phenotype, and the underlying mechanism for this difference is still unknown. These findings indicate that little is known about the specific underlying genetic mechanisms and phenotype–genotype correlations of 46,XY GD and CHD (28), which warrant further explorations.

We also identified two novel heterozygous variants in DHX37 and NR5A1. Pathogenic variants in DHX37 and NR5A1 are prevalent causes of 46,XY GD and explain approximately 11%–20% of cases (2, 3537). DHX37 is a newly reported 46,XY GD-related gene that plays critical roles in both early human testis determination and the maintenance of testicular tissue during the early phase of testis development (7, 38, 39). The novel c.2020C>T variant detected in patient 4 is located in the VI region of the functional RecA2 domain near the C-terminus; additionally, this site is highly conserved among various species, including yeast, and is involved in ATP binding, hydrolysis, and cross-talk with the unwinding machinery (38). NR5A1, which encodes SF-1, is a key gene involved in testis differentiation (40). We identified a novel c.370_380del variant in the DNA-binding domain of NR5A1 that caused a frameshift variant. This variant has not been previously reported; additionally, it was classified as strongly pathogenic (PVS1-null variant) according to the ACMG criteria (26) and was predicted via 3D modeling to cause the loss of a large fragment of the protein. This maternally inherited variant was also detected in the patient’s older sister (patient 6), who was also diagnosed with 46,XY GD, whereas the patient’s mother had a normal phenotype, which was consistent with a sex-limited, dominant pattern of inheritance. Notably, the risk of germ cell tumors related to NR5A1 variants may be associated with the severity of the phenotype and variant type (41), and DHX37 variants may increase cancer risk when considering their function in ribosomal biogenesis (42), which warrants further investigation. Therefore, cancer screening may be needed for patients with 46,XY GD who harbor DHX37 or NR5A1 variants. Finally, we also identified one previously reported variant in MAP3K1, which plays an important role in gonadal sex determination. Specific variants in MAP3K1 cause domination of the female sex-determining pathway, and variants have been observed in 15%–20% of sporadic and familial cases of 46,XY DSD (43). The variant that was observed in our study is located in the SWI2/SNF2 and MuDR (SWIM) domains (residues Glu303 to His393) (44) and may affect the interaction between MAP3K1 and c-Jun, which is consistent with previous reports (45).

In summary, via WES in six patients with 46,XY GD, we identified three novel pathogenic variants in GATA4, NR5A1, and DHX37 and one previously reported pathogenic variant in MAP3K1. All of these variants are conserved among various species and were predicted to be pathogenic by functional algorithms and in-silico protein analyses. We also discovered that the N-ZF domain may play an exclusive role in the mechanism of GATA4 in 46,XY GD patients. These results advance our understanding of the genetic etiology and phenotypical variability of 46,XY GD, which may improve diagnosis and intervention therapies. However, in conjunction with previous studies, the currently reported variants can only explain a limited number of cases of 46,XY GD. More genetic and functional studies need to be performed in larger samples of patients with 46,XY GD in the future.

Data availability statement

The data presented in the study are deposited in the NCBI repository https://www.ncbi.nlm.nih.gov/sra, accession number: PRJNA1243706.

Ethics statement

The studies involving humans were approved by the Institutional Review Board Committee of the Obstetrics and Gynecology Hospital (2021–223), Fudan University (FE21192). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

NZ: Conceptualization, Funding acquisition, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing. SY: Data curation, Formal Analysis, Investigation, Writing – original draft. JG: Data curation, Resources, Writing – original draft. XC: Methodology, Writing – original draft. JQ: Conceptualization, Methodology, Project administration, Supervision, Writing – review & editing. KH: Conceptualization, Funding acquisition, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by The National Key R&D Program of China (2021YFC2701400 and 2021YFC2701404), the Medical Innovation Research Project of the Science and Technology Commission of Shanghai Municipality (22Y11906500), and the Shanghai Municipal Health Commission (20214Y0220).

Acknowledgments

We would like to acknowledge the patients and their families for their participation in the study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1560698/full#supplementary-material

References

1. Elzaiat M, McElreavey K, Bashamboo A. Genetics of 46,XY gonadal dysgenesis. Best Pract Res Clin Endocrinol Metab. (2022) 36:101633. doi: 10.1016/j.beem.2022.101633

PubMed Abstract | Crossref Full Text | Google Scholar

2. de Oliveira FR, Mazzola TN, de Mello MP, Francese-Santos AP, Lemos-Marini SHV, Maciel-Guerra AT, et al. DHX37 and NR5A1 variants identified in patients with 46,XY partial gonadal dysgenesis. Life (Basel). (2023) 13. doi: 10.3390/life13051093

PubMed Abstract | Crossref Full Text | Google Scholar

3. Michala L, Goswami D, Creighton SM, Conway GS. Swyer syndrome: presentation and outcomes. BJOG. (2008) 115:737–41. doi: 10.1111/j.1471-0528.2008.01703.x

PubMed Abstract | Crossref Full Text | Google Scholar

4. Tang R, Liu X, Pan L, Chen R. Novel mutation in FTHL17 gene in pedigree with 46,XY pure gonadal dysgenesis. Fertil Steril. (2019) 111:1226-1235 e1221. doi: 10.1016/j.fertnstert.2019.01.027

PubMed Abstract | Crossref Full Text | Google Scholar

5. Xie QG, Luo P, Xia K, Li ZQ, Xu Z, Su C, et al. 46,XY disorders of sex development: the use of NGS for prevalent variants. Hum Genet. (2022) 141:1863–73. doi: 10.1007/s00439-022-02465-6

PubMed Abstract | Crossref Full Text | Google Scholar

6. Eggers S, Sadedin S, van den Bergen JA, Robevska G, Ohnesorg T, Hewitt J, et al. Disorders of sex development: insights from targeted gene sequencing of a large international patient cohort. Genome Biol. (2016) 17:243. doi: 10.1186/s13059-016-1105-y

PubMed Abstract | Crossref Full Text | Google Scholar

7. Buonocore F, Clifford-Mobley O, King TFJ, Striglioni N, Man E, Suntharalingham JP, et al. Next-generation sequencing reveals novel genetic variants (SRY, DMRT1, NR5A1, DHH, DHX37) in adults with 46,XY DSD. J Endocrine Soc. (2019) 3:2341–60. doi: 10.1210/js.2019-00306

PubMed Abstract | Crossref Full Text | Google Scholar

8. Baxter RM, Arboleda VA, Lee H, Barseghyan H, Adam MP, Fechner PY, et al. Exome sequencing for the diagnosis of 46,XY disorders of sex development. J Clin Endocrinol Metab. (2015) 100:E333–44. doi: 10.1210/jc.2014-2605

PubMed Abstract | Crossref Full Text | Google Scholar

9. Shi Z, Chen S, Han X, Peng R, Luo J, Yang L, et al. The rare mutation in the endosome-associated recycling protein gene VPS50 is associated with human neural tube defects. Mol Cytogenet. (2019) 12:1–8. doi: 10.1186/s13039-019-0421-9

PubMed Abstract | Crossref Full Text | Google Scholar

10. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. (2010) 20:1297–303. doi: 10.1101/gr.107524.110

PubMed Abstract | Crossref Full Text | Google Scholar

11. Shi Z, Yang X, Li BB, Chen S, Yang L, Cheng L, et al. Novel mutation of LRP6 identified in Chinese Han population links canonical WNT signaling to neural tube defects. Birth Defects Res. (2018) 110:63–71. doi: 10.1002/bdr2.v110.1

PubMed Abstract | Crossref Full Text | Google Scholar

12. Nykamp K, Anderson M, Powers M, Garcia J, Herrera B, Ho Y-Y, et al. Sherloc: a comprehensive refinement of the ACMG–AMP variant classification criteria. Genet Med. (2017) 19:1105–17. doi: 10.1038/gim.2017.37

PubMed Abstract | Crossref Full Text | Google Scholar

13. Vaser R, Adusumalli S, Leng SN, Sikic M, Ng PC. SIFT missense predictions for genomes. Nat Protoc. (2016) 11:1–9. doi: 10.1038/nprot.2015.123

PubMed Abstract | Crossref Full Text | Google Scholar

14. Adzhubei I, Jordan DM, Sunyaev SR. Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet. (2013). Chapter 7:Unit7.20. doi: 10.1002/0471142905.hg0720s76.

PubMed Abstract | Crossref Full Text | Google Scholar

15. Schwarz JM, Rodelsperger C, Schuelke M, Seelow D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods. (2010) 7:575–6. doi: 10.1038/nmeth0810-575

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am J Hum Genet. (2016) 99:877–85. doi: 10.1016/j.ajhg.2016.08.016

PubMed Abstract | Crossref Full Text | Google Scholar

17. Tunyasuvunakool K, Adler J, Wu Z, Green T, Zielinski M, Zidek A, et al. Highly accurate protein structure prediction for the human proteome. Nature. (2021) 596:590–6. doi: 10.1038/s41586-021-03828-1

PubMed Abstract | Crossref Full Text | Google Scholar

18. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. (2018) 46:W296–303. doi: 10.1093/nar/gky427

PubMed Abstract | Crossref Full Text | Google Scholar

19. Laskowski RA, Swindells MB. LigPlot+: multiple ligand-protein interaction diagrams for drug discovery. J Chem Inf Model. (2011) 51:2778–86. doi: 10.1021/ci200227u

PubMed Abstract | Crossref Full Text | Google Scholar

20. Hekkelman ML, de Vries I, Joosten RP, Perrakis A. AlphaFill: enriching AlphaFold models with ligands and cofactors. Nat Methods. (2023) 20:205–13. doi: 10.1038/s41592-022-01685-y

PubMed Abstract | Crossref Full Text | Google Scholar

21. Pires DE, Ascher DB, Blundell TL. DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach. Nucleic Acids Res. (2014) 42:W314–319. doi: 10.1093/nar/gku411

PubMed Abstract | Crossref Full Text | Google Scholar

22. Worth CL, Preissner R, Blundell TL. SDM–a server for predicting effects of mutations on protein stability and malfunction. Nucleic Acids Res. (2011) 39:W215–222. doi: 10.1093/nar/gkr363

PubMed Abstract | Crossref Full Text | Google Scholar

23. Topham CM, Srinivasan N, Blundell TL. Prediction of the stability of protein mutants based on structural environment-dependent amino acid substitution and propensity tables. Protein Eng. (1997) 10:7–21. doi: 10.1093/protein/10.1.7

PubMed Abstract | Crossref Full Text | Google Scholar

24. Cao H, Wang J, He L, Qi Y, Zhang JZ. DeepDDG: predicting the stability change of protein point mutations using neural networks. J Chem Inf Model. (2019) 59:1508–14. doi: 10.1021/acs.jcim.8b00697

PubMed Abstract | Crossref Full Text | Google Scholar

25. Fechner PY, Marcantonio SM, Ogata T, Rosales TO, Smith KD, Goodfellow PN, et al. Report of a kindred with X-linked (or autosomal dominant sex-limited) 46,XY partial gonadal dysgenesis. J Clin Endocrinol Metab. (1993) 76(5):1248–53. doi: 10.1210/jcem.76.5.8496317

PubMed Abstract | Crossref Full Text | Google Scholar

26. Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet In Med. (2015) 17:405–24. doi: 10.1038/gim.2015.30

PubMed Abstract | Crossref Full Text | Google Scholar

27. Martinez de LaPiscina I, de Mingo C, Riedl S, Rodriguez A, Pandey AV, Fernández-Cancio M, et al. GATA4 variants in individuals with a 46,XY disorder of sex development (DSD) may or may not be associated with cardiac defects depending on second hits in other DSD genes. Front In Endocrinol. (2018) 9:142. doi: 10.3389/fendo.2018.00142

PubMed Abstract | Crossref Full Text | Google Scholar

28. Choi J-H, Lee Y, Oh A, Kim G-H, Yoo H-W. Molecular characteristics of sequence variants in GATA4 in patients with 46,XY disorders of sex development without cardiac defects. Sexual Dev. (2019) 13:240–5. doi: 10.1159/000511258

PubMed Abstract | Crossref Full Text | Google Scholar

29. Igarashi M, Mizuno K, Kon M, Narumi S, Kojima Y, Hayashi Y, et al. GATA4 mutations are uncommon in patients with 46,XY disorders of sex development without heart anomaly. Asian J Androl. (2018) 20:629–31. doi: 10.4103/aja.aja_20_18

PubMed Abstract | Crossref Full Text | Google Scholar

30. Globa E, Zelinska N, Shcherbak Y, Bignon-Topalovic J, Bashamboo A, MсElreavey K. Disorders of sex development in a large ukrainian cohort: clinical diversity and genetic findings. Front In Endocrinol. (2022) 13:810782. doi: 10.3389/fendo.2022.810782

PubMed Abstract | Crossref Full Text | Google Scholar

31. Malek Mohammadi M, Kattih B, Grund A, Froese N, Korf-Klingebiel M, Gigina A, et al. The transcription factor GATA4 promotes myocardial regeneration in neonatal mice. EMBO Mol Med. (2017) 9:265–79. doi: 10.15252/emmm.201606602

PubMed Abstract | Crossref Full Text | Google Scholar

32. Li RG, Xu YJ, Wang J, Liu XY, Yuan F, Huang RT, et al. GATA4 loss-of-function mutation and the congenitally bicuspid aortic valve. Am J Cardiol. (2018) 121:469–74. doi: 10.1016/j.amjcard.2017.11.012

PubMed Abstract | Crossref Full Text | Google Scholar

33. Lourenço D, Brauner R, Rybczynska M, Nihoul-Fékété C, McElreavey K, Bashamboo A. Loss-of-function mutation in GATA4 causes anomalies of human testicular development. Proc Natl Acad Sci United States America. (2011) 108:1597–602. doi: 10.1073/pnas.1010257108

PubMed Abstract | Crossref Full Text | Google Scholar

34. Grueso Cerón AL, Arturo-Terranova D, Satizábal Soto JM. Characterization of genomic variants associated with congenital heart disease in patients from southwestern Colombian. Heliyon. (2024) 10:e23678. doi: 10.1016/j.heliyon.2023.e23678

PubMed Abstract | Crossref Full Text | Google Scholar

35. Pearlman A, Loke J, Le Caignec C, White S, Chin L, Friedman A, et al. Mutations in MAP3K1 cause 46,XY disorders of sex development and implicate a common signal transduction pathway in human testis determination. Am J Hum Genet. (2010) 87:898–904. doi: 10.1016/j.ajhg.2010.11.003

PubMed Abstract | Crossref Full Text | Google Scholar

36. Ostrer H. Pathogenic variants in MAP3K1 cause 46,XY gonadal dysgenesis: A review. Sex Dev. (2022) 16:92–7. doi: 10.1159/000522428

PubMed Abstract | Crossref Full Text | Google Scholar

37. Houzelstein D, Eozenou C, Lagos CF, Elzaiat M, Bignon-Topalovic J, Gonzalez I, et al. A conserved NR5A1-responsive enhancer regulates SRY in testis-determination. Nat Commun. (2024) 15:2796. doi: 10.1038/s41467-024-47162-2

PubMed Abstract | Crossref Full Text | Google Scholar

38. McElreavey K, Jorgensen A, Eozenou C, Merel T, Bignon-Topalovic J, Tan DS, et al. Pathogenic variants in the DEAH-box RNA helicase DHX37 are a frequent cause of 46,XY gonadal dysgenesis and 46,XY testicular regression syndrome. Genet In Med. (2020) 22:150–9. doi: 10.1038/s41436-019-0606-y

PubMed Abstract | Crossref Full Text | Google Scholar

39. Zidoune H, Martinerie L, Tan DS, Askari M, Rezgoune D, Ladjouze A, et al. Expanding DSD phenotypes associated with variants in the DEAH-box RNA helicase DHX37. Sexual Dev. (2021) 15:244–52. doi: 10.1159/000515924

PubMed Abstract | Crossref Full Text | Google Scholar

40. Lin L, Achermann JC. Steroidogenic factor-1 (SF-1, Ad4BP, NR5A1) and disorders of testis development. Sexual Dev. (2008) 2:200–9. doi: 10.1159/000152036

PubMed Abstract | Crossref Full Text | Google Scholar

41. Robevska G, van den Bergen JA, Ohnesorg T, Eggers S, Hanna C, Hersmus R, et al. Functional characterization of novel NR5A1 variants reveals multiple complex roles in disorders of sex development. Hum Mutat. (2018) 39:124–39. doi: 10.1002/humu.23354

PubMed Abstract | Crossref Full Text | Google Scholar

42. Sulima SO, Kampen KR, De Keersmaecker K. Cancer biogenesis in ribosomopathies. Cells. (2019) 8. doi: 10.3390/cells8030229

PubMed Abstract | Crossref Full Text | Google Scholar

43. Cheng Y, Xu C, Yang J, Zhou X, Chen N. Identification of a novel MAP3K1 variant in a family with 46, XY DSD and partial growth hormone deficiency. Mol Med Rep. (2022) 26. doi: 10.3892/mmr.2022.12854

PubMed Abstract | Crossref Full Text | Google Scholar

44. Chamberlin A, Huether R, MaChado AZ, Groden M, Liu H-M, Upadhyay K, et al. Mutations in MAP3K1 that cause 46,XY disorders of sex development disrupt distinct structural domains in the protein. Hum Mol Genet. (2019) 28:1620–8. doi: 10.1093/hmg/ddz002

PubMed Abstract | Crossref Full Text | Google Scholar

45. Rieger MA, Duellman T, Hooper C, Ameka M, Bakowska JC, Cuevas BD. The MEKK1 SWIM domain is a novel substrate receptor for c-Jun ubiquitylation. Biochem J. (2012) 445:431–9. doi: 10.1042/BJ20120406

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: 46,XY gonadal dysgenesis, genetic etiology, whole exome sequencing, zinc finger, congenital heart disease

Citation: Zhang N, You S, Guo J, Chang X, Qiu J and Hua K (2025) Genetic variants and molecular profiling of 46,XY gonadal dysgenesis using whole-exome sequencing. Front. Endocrinol. 16:1560698. doi: 10.3389/fendo.2025.1560698

Received: 16 January 2025; Accepted: 13 March 2025;
Published: 11 April 2025.

Edited by:

Núria Camats-Tarruella, Vall d’Hebron Research Institute (VHIR), Spain

Reviewed by:

Idoia Martinez De LaPiscina, Biocruces Bizkaia Health Research Institute, Spain
Helena Fabbri Scallet, State University of Campinas, Brazil

Copyright © 2025 Zhang, You, Guo, Chang, Qiu and Hua. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Keqin Hua, aHVha2VxaW5AZnVkYW4uZWR1LmNu; Junjun Qiu, cWl1anVuanVuMTExM0AxNjMuY29t

These authors have contributed equally to this work and share first authorship

These authors have contributed equally to this work and share senior authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.