Application and discoveries of metabolomics and proteomics in the study of female infertility

Introduction Female infertility is defined as the absence of clinical pregnancy after 12 months of regular unprotected sexual intercourse. Methods This study employed metabolomics and proteomics approaches to investigate the relationship between metabolites and proteins and female infertility. The study used metabolomics and proteomics data from the UK Biobank to identify metabolites and proteins linked to infertility. Results The results showed that GRAM domain-containing protein 1C and metabolites fibrinogen cleavage peptides ADpSGEGDFXAEGGGVR and 3-Hydroxybutyrate had a positive correlation with infertility, whereas proteins such as Interleukin-3 receptor subunit alpha, Thrombospondin type-1 domain-containing protein 1, Intestinal-type alkaline phosphatase, and platelet and endothelial cell adhesion molecule 1 exhibited a negative correlation. These findings provide new clues and targets for infertility diagnosis and treatment. However, further research is required to validate these results and gain a deeper understanding of the specific roles of these metabolites and proteins in infertility pathogenesis. Discussion In conclusion, metabolomics and proteomics techniques have significant application value in the study of infertility, allowing for a better understanding of the biological mechanisms underlying infertility and providing new insights and strategies for its diagnosis and treatment. These research findings provide a crucial biological mechanistic basis for early infertility screening, prevention, and treatment.


Introduction
Female infertility, also known as female sterility, is when clinical pregnancy cannot be established after 12 months of regular unprotected intercourse (1).Infertility affects approximately 8%-12% of couples of reproductive age worldwide (2).While male factors are the sole cause of infertility in 20%-30% of cases, they contribute to 50% of overall infertility cases.The most prevalent type of female infertility worldwide is secondary infertility caused by reproductive tract infections.Many women around the world are infertile, which is a worldwide problem (3).Three main factors that influence the natural conception rate include the period of intention to conceive, the age of the female partner, and infertility related to medical conditions (4).Factors affecting male and female fertility include gonadal dysfunction, hyperprolactinemia, ciliary dysfunction, cystic fibrosis, infections, systemic disorders, and lifestyle/disease-related factors.Meanwhile, premature ovarian insufficiency, polycystic ovary syndrome, endometriosis, uterine fibroids, and endometrial polyps can all lead to female infertility (5).In addition, endocrine-disrupting chemicals may also be linked to infertility by interfering with metabolism (6).
Metabolomics and proteomics are multidisciplinary sciences investigating the metabolites and proteins within organisms (7).Metabolomics is the study of metabolites within an organism to reveal the activity of metabolic pathways, changes in metabolite concentrations, and interactions between metabolites, ultimately revealing the organism's metabolic state and physiological functions (8,9).Proteomics, on the other hand, studies the composition, structure, and function of proteins within organisms to elucidate their roles and regulatory mechanisms in cells and organisms (10).Since metabolites are typically the end products of protein metabolism, and proteins are involved in regulating metabolic pathways and the generation of metabolites, these two techniques complement one other, providing a more comprehensive understanding of an organism's metabolic state and functionality (11,12).
Metabolomics technology can analyze the metabolic profiles of infertility patients and normal populations to identify specific metabolites associated with infertility and reveal potential infertility mechanisms.By identifying these metabolites, an indepth understanding of metabolic abnormalities in infertility is gained, laying the groundwork for diagnosis and treatment (13).Furthermore, metabolomics can examine alterations in metabolic pathways, revealing the mechanisms of metabolic dysfunction in infertility and potentially identifying new treatment targets (14).Any possible identified biomarkers of infertility can be used for diagnosis and prediction.Additionally, metabolomics can assist in evaluating the efficacy of infertility treatment drugs by analyzing their effects on the metabolic spectrum and predicting treatment outcomes (15).Overall, metabolomics has significant value in screening, identifying, diagnosing, and evaluating infertility treatment (13).Proteomics is also significant in infertility studies.By identifying proteins linked to infertility, a better understanding of these diseases can be gained, providing better guidelines for early diagnosis and treatment.Future research should identify specific proteins related to these diseases and embryonic development outcomes and establish their reproducibility and reliability (16).It is intended that by identifying common proteins among different races, ages, and regions, advances to women's overall health worldwide may be achieved.Therefore, metabolomics and proteomics play crucial roles in early screening, prevention, and understanding of the potential biological mechanisms of infertility treatment (17).
Mendelian randomization (MR) evaluates causality by utilizing natural genetic variation.This technique stimulates the effect of a randomized controlled trial by using human genetic variation as a natural random tool to examine whether a factor has a causal effect on a disease or other biological characteristics (18).In this study, we used metabolomics and proteomics data for MR analysis, comprehensively investigating the causal effects of 975 metabolites and 4490 proteins on the onset of infertility.Our goal was to study the biological mechanisms of infertility at the genetic and protein levels, focusing on revealing the etiology of metabolicrelated diseases and deepening understanding of their biological processes (19).

Metabolomics and proteomics GWAS data
Metabolite GWAS data used in this study were obtained from the open GWAS website (https://gwas.mrcieu.ac.uk/).The 975 metabolites retrieved contained 452 human blood metabolites, 150 metabolites of human immune system symptoms, 123 circulating metabolites, and 249 metabolic traits, which were identified in 115,078 participants with nearly 12 million SNPs (Single Nucleotide Polymorphism) from the UK Biobank using the Nightingale Health assay.Four continuous sources of overlapping data were eliminated, and 975 metabolites were retained (20)(21)(22).The proteomics data in this study contained a total of 4490 distinct data from various proteins from three separate sets of studies (23-25).

Outcome GWAS data
The Infertility GWAS data in this study came from the Finngen population database, a large-scale human genomics database incorporating genetic, health, and clinical data.This database serves as a valuable data resource and analytical tool for genetic research and precision medicine, which helps elucidate the genetic basis of diseases and develop individualized medicine(https://risteys.finregistry.fi/endpoints/N14_FEMALEINFERT).In this data representation, female infertility, defined as the inability of female to get pregnant after a specified period of unprotected intercourse, ICD10-N97.

Selection of instrumental variables
To meet the experimental hypothesis of Mendelian randomization [MR], the instrumental variable selection should have strong relevance in the following aspects: 1) Instrumental variables should have a strong correlation with the outcome variable [dependent variable], implying that instrumental variables should have a large explanatory power for the variation of the outcome variable.As a result, the SNPs effectively represent the variation in the outcome variable; 2) Unrelatedness: instrumental variables should be unrelated to other factors that influence variation in the outcome variable.This was done to prevent instrumental variables associated with other potential confounding variables that could generate endogeneity problems; 3) Exclusivity: instrumental variables should only affect the outcome variable through their influence on the dependent variable rather than via additional pathways.This ensures that the instrumental variables only impact the outcome variable via causal pathways without interference from other factors.Therefore, we defined strict standards for the inclusion of instrumental variables, with a whole-genome significant level [P-value < 5× 10 −8 ] and no linkage disequilibrium [LD] with other SNPs [r2< 0.01] as the instrumental variables for these metabolites and proteins.

Statistical analysis
This study used inverse variance-weighted [IVW] meta-analysis to investigate and assess the causal relationships between exposure, intermediate, and outcome (26).The IVW method is the primary MR analysis method used for combining the Wald ratios of individual SNPs (27).The estimates may be biased since the methodology assumes that all variants are effective instruments and that the instrumental variables may exhibit directional pleiotropy (28).Cochran's Q value was used to determine whether the analysis was heterogeneous.Moreover, we employed forest plots to demonstrate the correlations between SNP exposure association and outcome association, limit the effect of errors, increase experimental reliability and accuracy, and measure the contribution of each instrumental variable to the overall causal estimate.

Inclusion of study variables
After a rigorous screening of instrumental variables, 504 metabolites and 971 proteins were included in the study.In the MR study, 5 proteins and 2 metabolites showed potential causal relationships.fibrinogen cleavage peptides ADpSGEGDFXAEGGGVR and 3-Hydroxybutyrate were among the metabolites included, while the proteins were GRAM domain-containing protein 1C, Interleukin-3 receptor subunit alpha, Thrombospondin type-1 domain-containing protein 1, Intestinal-type alkaline phosphatase, and platelet and endothelial cell adhesion molecule 1. Figure 1 depicts the flowchart.
In the subsequent sensitivity analysis, the P-values and Cochran's Q P-values for all included metabolites and proteins were above 0.05, indicating no outliers in our data and the robustness of our results.The sensitivity analysis results can be found in the Supplementary Materials.In the subsequent leave-oneout analysis, our results remained stable even with gradual removal of instrumental variables.MR-Egger analysis could not be performed to test the pleiotropy of 4 proteins and 1 metabolites because the IVs were less than three SNPs.Detailed analysis information can be found in the Supplementary Table 1 and the relevant biological functions or pathways of these targeted proteins and metabolites were list in the Supplementary Table 2.

Discussion
Our study used large-scale metabolomics and proteomics data for MR analysis to discover potential relationships between metabolites, proteins, and female infertility.We identified several metabolites and proteins that may be linked to female infertility, providing insight into its biological mechanisms.Female infertility refers to a woman's inability to conceive under normal sexual conditions and within one year of attempting to do so (29).Ovulation disorders, uterine abnormalities, tubal obstruction, and ovarian dysfunction are the most common causes (30,31).The most prevalent cause is ovulation abnormalities, which can caused by polycystic ovary syndrome, ovarian dysfunction, thyroid dysfunction, and other factors (32).Infection, surgery, or congenital abnormalities can induce tubal obstruction.Ovarian dysfunction includes premature ovarian failure and ovarian cysts.Female infertility symptoms may include irregular, abnormally lengthy, or short menstrual cycles, and difficulty ovulating (33).Proteomics and metabolomics are two significant omics technologies for studying and diagnosing female infertility.Proteomics compares protein spectra between infertility patients and normal populations, detects protein expression differences associated with infertility, and identifies protein biomarkers for infertility diagnosis and treatment.Metabolomics examines the metabolite spectra of infertility patients and normal populations, identifies metabolites related to infertility, and employs them as biomarkers for infertility assessment and treatment.These two technologies can offer vital information on the mechanisms and individual characteristics of infertility, laying the groundwork for infertility research and treatment (34,35).By applying these omics technologies, we can gain a deeper understanding of the pathogenesis of infertility, discover new biomarkers, and provide new insights and strategies for infertility diagnosis and treatment.Furthermore, these omics technologies can also help evaluate the effectiveness of infertility treatments and offer a foundation for more personalized treatments.
3-hydroxybutyric acid (3-HB) is a metabolite commonly associated with fatty acid metabolism and the synthesis of ketone bodies (36).Although no conclusive data links 3-hydroxybutyric acid (3-HB) directly to infertility, it may be indirectly associated with certain aspects of infertility.As a metabolite, 3-HB may be indirectly linked to certain aspects of infertility.3-HB is a ketone body primarily synthesized by the liver, and its levels can rise in specific conditions such as fasting, prolonged exercise, and diabetes (37,38).Infertility can be linked to metabolic disorders and hormonal imbalances; variations in 3-HB levels can reflect these changes.For example, 3-HB levels are often elevated in diabetic patients, which is a factor contributing to infertility.Additionally, 3-HB may be related to ovarian function (39,40).The ovaries are essential organs in the female reproductive system responsible for ovulation and hormone secretion.Some studies have suggested a possible link between 3-HB and ovarian function (41).For instance, one study found that 3-HB levels were associated with ovarian reserve, with higher levels indicating better ovarian reserve.Ovarian reserve is a significant predictor of female fertility (42).Therefore, 3-HB may be related to ovarian dysfunction in female infertility.PECAM-1 (Platelet Endothelial Cell Adhesion Molecule-1) is a protein implicated in infertility.PECAM-1 is widely present in the reproductive system, including the ovaries, endometrium, and fallopian tubes.Some studies have found associations between PECAM-1 and infertility factors such as ovulation disorders, uterine abnormalities, and tubal obstruction (43).For example, PECAM-1 deficiency may lead to follicle development and ovulation disorders.Patients with endometriosis and endometrial cancer generally have an increased expression of PECAM-1.Additionally, elevated PECAM-1 expression has been identified in individuals with fallopian tube cancer and tubal tuberculosis (44).These findings suggest that PECAM-1 may play a role in the onset and progression of infertility.Further research on the relationship between PECAM-1 and infertility should aid in understanding the etiology and treatment of infertility.Unfortunately, research into the interaction between other metabolites and proteins and infertility is limited, and no clear link with infertility has been identified.Further investigation into the role of these substances in infertility is required.
This article has several strengths: 1.The article provides detailed information about the definition of female infertility, its global prevalence, and potential causes and influencing variables, offering readers a thorough grasp of the condition.
2. The article utilized Mendelian randomization (MR) analysis to investigate potential causal relationships between metabolites, proteins, and infertility using large-scale metabolomics and proteomics data.
3. The article provides detailed statistical analysis methods and results, including instrument variable selection, statistical analysis methods, and sensitivity analysis, enhancing the reliability and feasibility of the study.
However, there are some limitations to this article: 1.The article does not provide detailed information about other potential causes and influencing factors of infertility, such as genetic and environmental factors, which may limit a comprehensive understanding of the etiology of infertility.
2. The study results are based only on existing metabolomics and proteomics data, which may be limited by sample size and data quality, requiring further research to validate and replicate these findings.

Conclusion
Based on the integrated findings from metabolomics and proteomics research, several potential relationships have been identified between metabolites, proteins, and female infertility.These findings provide new clues and targets for infertility diagnosis and treatment.

Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material.Further inquiries can be directed to the corresponding author.

FIGURE 2
FIGURE 2Association results of MR analyses with 5 Protein and Infertility.

FIGURE 1 Flow
FIGURE 1Flow chart of this study.

FIGURE 3
FIGURE 3Association results of MR analyses with 2 Metabolites and Infertility.