Preliminary Pharmacogenomic-Based Predictive Models of Tamoxifen Response in Hormone-dependent Chilean Breast Cancer Patients

Tamoxifen (TAM), a selective oestrogen receptor modulator, is one of the most used treatments in oestrogen receptor-positive (ER+) early and metastatic breast cancer (BC) patients. The response to TAM has a high degree of inter-individual variability. This is mainly due to genetic variants in CYP2D6 gene, as well as other genes encoding proteins involved in the TAM pharmacokinetic and/or pharmacodynamic. Therefore, prediction of the TAM response using these genetic factors together with other non-genetic variables may be relevant to improve breast cancer treatment. Thus, in this work, we used genetic polymorphisms and clinical variables for TAM response modelling. One hundred sixty-two ER + BC patients with 2 years of TAM treatment were retrospectively recruited, and the genetic polymorphisms CYP2D6*4, CYP3A4*1B (CYP3A4*1.001), CYP3A5*3, UGT2B7*2, UGT2B15*2, SULT1A1*2, and ESRA V364E were analyzed by PCR-RFLP. Concomitantly, the therapeutic response was obtained from clinical records for association with genotypes using univariate and multivariate biostatistical models. Our results show that UGT2B15*1/*2 genotype protects against relapse (OR = 0.09; p = 0.02), CYP3A5*3/*3 genotype avoids endometrial hyperplasia (OR = 0.07; p = 0.01), SULT1A1*1/*2 genotype avoids vaginal bleeding (OR = 0.09; p = 0.03) and ESRA 364E/364E genotype increases the probability of vaginal bleeding (OR = 5.68; p = 0.02). Logistic regression models, including genomic and non-genomic variables, allowed us to obtain preliminary predictive models to explain relapse (p = 0.010), endometrial hyperplasia (p = 0.002) and vaginal bleeding (p = 0.014). Our results suggest that the response to TAM treatment in ER + BC patients might be associated with the presence of the studied genetic variants in UGT2B15, CYP3A5, SULT1A1 and ESRA genes. After clinical validation protocols, these models might be used to help to predict a percentage of BC relapse and adverse reactions, improving the individual response to TAM-based treatment.


INTRODUCTION
Breast cancer (BC) is the second leading cause of cancer death in women (American Cancer Society, 2017) and is caused by many factors, such as age, breast density, heredity, and exposure to oestrogens (Kumar et al., 2010). Breast cancer (BC) patients with oestrogen receptor positivity (ER+) are treated in addition to surgery, chemotherapy, and/or radiation therapy with hormone therapy (Maughan et al., 2010;(McDonnell and Wardell 2010;NCI 2012). Tamoxifen (TAM), a type of hormonal therapy known as a selective oestrogen receptor modulator, behaves as an oestrogen antagonist in the breast and as an agonist in the endometrium (Lewis and Jordan 2005;Terán and Teppa 2005;Hertz et al., 2009;NCI 2012), and despite the new hormonal agents, TAM remains the drug of choice for any BC stage (Davies et al., 2011) since it increases the disease-free time in pre-or postmenopausal women, reducing the annual death rate by 34% (MINSAL, 2005;Untch and Thomssen, 2010). However, the response among patients is variable in terms of efficacy and side effects.
Differences have been found between the response to TAM and the histological type, degree of differentiation of the breast tumor cells, age, and state of menopause in the patient. TAM has an overall response rate of almost 75% in patients with ER+ and PR + BC. In the case of adjuvant treatment, TAM reduces the risk of relapse by 25% and mortality by 17% (Fisher et al., 1998;Davies et al., 2011). On the other hand, the long-term safety of TAM is well elucidated, and possible harmful effects on bone metabolism, endometrial cancer, thromboembolic diseases, and cognitive disorders due to chronic oestrogen deprivation in brain tissue have been described (Fisher et al., 1998;Cuzick et al., 2002;Cuzick, 2005). A 1% increase in the incidence of endometrial cancer has been observed due to its oestrogen-inducing effect in the uterus and thromboembolic events in 1-2% of cases.
Despite several studies having been conducted, after 3 decades, there are still differences in the TAM response that have not been explained. It is known that these variations in the response to drugs are multifactorial, the result of the interaction between multiple and complex genetic, physiological and environmental factors (Ma and Lu 2011). A potential explanation is the presence of genetic variants in the genes encoding biotransformation enzymes affecting their efficacy and safety (Zhou et al., 2008). In this respect, undoubtedly genetic variation in the CYP2D6 driving to ultrarapid (XN), intermediate metabolizer or poor metabolizer phenotypes (e.g. *3, *4, *5, *9, *10, *17 haplotypes) may affects plasma concentrations of TAM and its metabolites, as CPIC has stablished in the updated guidelines (Goetz et al, 2018). CYP2D6*4 (rs3892097) is associated with decreased side effects, such as hot flashes, disease-free time, and survival rate (Goetz et al., 2005;Jin et al., 2005;Schroth et al., 2007;Bijl et al., 2009;Ramón y Cajal et al., 2010;Rae et al., 2012), and decreased serum levels of TAM metabolites (Gjerde et al., 2007;Kiyotani et al., 2010;Lim et al., 2011;Madlensky et al., 2011). In addition, CYP3A4*1B (currently named CYP3A4*1.001, rs2740574) is associated with a 3-fold higher risk of endometrial cancer in TAM-treated BC patients (Chu et al., 2007). CYP3A5*3 (rs776746) is associated with characteristics of the tumor in BC postmenopausal women treated with TAM (Tucker et al., 2005). On the other hand, CYP2C9 *2 and *3 variants reduce the concentration of TAM metabolites, affecting the therapeutic response (Mürdter et al., 2011).
Phase II enzymes also affect TAM response. SULT1A1*2 (rs9282861) increases the risk of relapse in TAM-treated BC patients (Wegman et al., 2005). UGT2B15*2 (rs1902023) decreases the risk of relapse of BC (Nowell et al., 2005). Furthermore, the presence of UGT2B15*2 and SULT1A1*2 is associated with a lower risk of relapse and a significant reduction in survival time in patients with BC treated with TAM (Nowell et al., 2005).
Finally, several mutations have been identified in the oestrogen receptor α gene (ESRA) in BC patients (Herynk et al., 2004); however, their effect on the efficacy and safety of TAM treatment has not been elucidated. In this respect, using SIFT and PolyPhen bioinformatic tools, it was established that the single nucleotide polymorphism (SNP) ESRA (OMIM* 133430) V364E (rs121913044; 1453T>A; Val364Glu) generates a deleterious change with possible damage to the N-terminus of the hormone-binding domain in the ER, giving rise to a 40 times lower affinity for oestrogen (Flicek et al., 2011). This SNP also exhibits a dependence on oestrogen for binding to an ERE, although it maintains its negative dominant activity entirely. Therefore, ESRA V364E is highly active and capable of repressing ER-mediated transcription, both when ESRA V364E and normal ESRA proteins are present together in cells and even without DNA binding (Wrenn and Katzenellenbogen 1993;Ince et al., 1995;McInerney et al., 1996).
Although in the last decade several studies have investigated genetic variants in TAM-metabolizing enzymes that might determine the differences in the response to treatment in patients with BC, there are still controversies about their relationship with the response (Brauch and Schwab, 2014;Brewer et al., 2014;Province et al., 2014;Binkhorst et al., 2015). To address this question, we aimed to study the association among response (relapse and ADR) and 7 genetic variants in genes encoding proteins involved in the pharmacokinetics and pharmacodynamics of TAM (CYP2D6p4, CYP3A4p1B (CYP3A4p1.001), CYP3A5p3, SULT1A1p2, UGT2B7p2, UGT2B15p2 and ESRA V364E) in women with hormone-dependent BC and under adjuvant treatment with TAM. The main goal is to generate predictive models to approximate the response of patients according to their genetic-metabolic characteristics.

Study Design
A retrospective case-control study was carried out from January 2014 to January 2015 at the Polyclinic of Oncology of the National Institute of Cancer (INC). The sample size was determined according to the frequency of carriers with the variant allele carriers in the population under study, using PS Power and Sample Size Calculations Version 3.0, January 2009, considering an 80% power; α 0.05; OR 2.0; and the less frequent CYP3A4 p1B (CYP3A4 p1.001 , rs2740574), according to the literature (Fuentes and Silveyra, 2019).

Patients
One hundred sixty-two (162) patients were enrolled. The inclusion criteria were as follows: 1) patients with histologically confirmed BC, 2) in adjuvant treatment with 20 mg of tamoxifen (Novadex ® ) daily for at least 24 months; 2) > 18 years-old, 3) positive ER and negative HER-2 status, 4) cell differentiation degree I to III. The exclusion criteria were as follows: 1) patients diagnosed with in situ BC; 2) negative ER, negative PR, triple-negative or Her-2 positive patients; 3) concomitant treatments with vitamin K antagonists (warfarin, acenocoumarol or dicoumarol), serotonin reuptake antagonist antidepressants (fluoxetine or paroxetine), mitomycin, nitonavir, primidone, or chemotherapy (fluorouracil, methotrexate or cyclophosphamide), 4) chronic unbalanced systemic pathology or other active cancers and, 5) aromatase inhibitor treatment or LHRH agonist. The events (relapses and ADR) were evaluated after 6 months of TAM treatment. All the patients signed a written consent form and agreed to be included in this study.
Toxicity was assessed based on the Common Terminology Criteria for Adverse Events, v4.03. Vaginal bleeding was defined as any vaginal bleeding in postmenopausal patients (more than 1 year without menstruation or oophorectomized) or any abnormal bleeding, such as unexpected bleeding or in quantity or habitual bleeding in premenopausal patients. Endometrial hyperplasia was defined as an endometrial thickness greater than 5 mm detected by transvaginal echotomography in postmenopausal patients or an abnormal thickness of the endometrium according to the phase of the menstrual cycle in premenopausal patients.
The ethnicity of the study group was approached by the percentage of Amerindian-Caucasian admixture (%M a-c ) based on the ABO system (Valenzuela et al., 1987;Acuña et al., 2000).
The patients were gathered into two groups: the control group include patients who do not present the analyzed ADRs (relapse, hot flashes, etc.), and the cases group include patients who have the ADRs.

Genotyping Analyses
Potentially functional SNPs encoding the proteins related to the TAM response were obtained from the PharmGKB database (Whirl-Carrillo et al., 2012), the NCBI dbSNP database (https://www.ncbi.nlm.nih.gov/snp/), the SNPinfo Web Server (https://snpinfo.niehs.nih.gov), the Ensembl ® genome database project (https://www.ensembl.org/index.html) and the level of evidence for each SNP (Supplementary Table S1). Genomic DNA was isolated from the peripheral blood samples of the subjects using a Genomic DNA Extraction Blood DNA Kit (FAVORGEN ® , BIOTECH CORP, Headquarters, Taiwan, China) and from buccal mucosa cells using a MasterAmp ™ Buccal Swab Kit (Epicentre ® , an Illumina company, Madison, USA). DNA samples were quantified at 260/280 nm using a Nanodrop spectrophotometer (model DS-11, FX Series Spectrophotometer, UGT2B15p2 (rs1902023), and ESRA V364E (rs121913044) were analyzed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) (Cavalli et al., 2001;Schur et al., 2001;Hajdinjak and Zagradišnik 2004;Lee et al., 2005;Kagaya et al., 2007;Arslan et al., 2011). The primers and restriction enzymes used are presented in Supplementary  Table S2. Each assay contained four controls: one sample for each genotype as a positive control and one negative sample with nuclease-free pure water to volume. For quality assurance purposes, we randomly chose 20% of the samples for repetition of the analysis. When analyses were not coincident, we excluded the samples.

Statistical Analyses
Statistical analyses were performed using GraphPad Prism 4.0 and STATA 11.1, with p < 0.05 considered statistically significant. The results are expressed as the mean ± standard deviation (SD), number, percentage, or frequency where appropriate. The Shapiro-Wilk test was used to determine quantitative variable distributions. Mean values between the two groups were compared using the unpaired t-test for variables with a normal distribution, while the Mann-Whitney U test was used for variables with non-normal distributions. Frequencies of qualitative variables were also calculated. The chi-square test with Fischer's exact test was used to investigate differences in genotypic and allelic frequencies between the two groups.
We checked for Hardy-Weinberg equilibrium (HWE) in our sample even though the conditions for HWE are not applicable because it is not a random sampling in a random-mating population, a control population or the general population (Namipashaki et al., 2015) and is a group with a selection bias based on the disease (i.e., SNPs can also be related to cancer). The polymorphic variants CYPD6 p4, CYP3A4p1B (CYP3A4p1.001), CYP3A5p3, SULT1A1p2, and accomplish HWE, while UGT2B15p2 and ESRA V364E do not (cut-off chi 2 : 3.84). The polymorphisms were evaluated using co-dominant (wild type vs heterozygote vs variant), dominant (wild type vs. heterozygote/variant), and recessive (wild type/heterozygote vs. variant) inheritance models. Bi-variable analyses were performed to determine the association between the events (relapses and ADR), considered the dependent variables, and the characteristics of the patients, considered the independent variables. Thus, the independent variables considered were BMI, blood group, smoking habit, socioeconomic status, treatment with oral contraceptives, hormone replacement therapies, age at menarche, menopausal status, family history of cancer, age at diagnosis, cancer stage, and presence of genetic variants (exposure variable).
Multivariate logistic regression models and multivariate linear regression analyses were employed to investigate associations between genetic or non-genetic characteristics and relapse or ADR. The logistic multivariate models were adjusted stepwise using a forward and backward procedure to include potentially relevant variables to derive statistical association models characterized by pseudo R 2 . All association studies were assayed by choosing parameters with a better statistical association for each analysis. The odds ratio (OR), or coefficients (Coef.), and 95% confidence intervals (CIs) are reported in the multivariate logistic regression models.

Ethics
The study was carried out under strict ethical procedures recommended by the Ethics Committee of the Faculty of Medicine of the University of Chile (July 24, 2013) and the

Polymorphisms N Frequency
Enzymes involved in the activation of TAM

RESULTS
The baseline characteristics of the patients (58 ± 13 years; BMI: 29 ± 6) are shown in Table 1, and the characteristics of the patients according to genotype are shown in Supplementary  Table S3. Sociogenetic gradients, risk factors, gynecological characteristics, and pathological features for descriptive analyses can also be observed. Of the total patients, 68 were premenopausal, and 94 were postmenopausal. In relation to the cancer stage at diagnosis, the majority of patients were in stage II (51.23%), the predominant histology type was invasive ductal carcinoma (86.88%), and the cell differentiation degree was predominantly G2 (54.17%). A 49.3% M a-c was found. Table 2 shows the therapeutic response characteristics of the patients. Relapse was found in 9 patients (5.73%). The most severely observed ADRs among patients were endometrial hyperplasia (9.20%) and vaginal bleeding (4.91%), and the most frequent were hot flashes (61.96%), bone pain (13.50%), and cramps (12.27%).
The genotypic and allelic frequencies for the analyzed polymorphisms are shown in Table 3.
We performed univariable logistic regression of the risk of relapse (Supplementary Tables S4, S5)   Frontiers in Pharmacology | www.frontiersin.org November 2021 | Volume 12 | Article 661443 7 and non-genetic variables. In Table 4, only statistically significant results for the univariable logistic regression analysis (risk of relapse, endometrial hyperplasia, and vaginal bleeding) are shown. The results show that UGT2B15p2 A/C (rs1902023) in a dominant model of inheritance was associated with relapse; CYP3A5 p3 A/G (rs776746) in a recessive model of inheritance was associated with endometrial hyperplasia; and SULT1A1 p2 G/ A (rs9282861) in a dominant model of inheritance and ESRA V364E (rs121913044) in a recessive model of inheritance were associated with vaginal bleeding.
After using stepwise forward and backward procedures, multivariate logistic regression analyses for the risk of relapse and ADRs, including genetic and non-genetic variables, were performed. We obtained significant models for endometrial hyperplasia, vaginal bleeding, phlebitis, headache, nausea, hot flash, cramps, bone pain, and urticaria ( Figure 1). Supplementary Tables S10-S12 show significant multivariate logistic regression analyses or logit models for relapse (Supplementary Associations among genetic variants with any adverse effect versus no adverse effect or with some grades of adverse effects were also assessed, but no statistical significance was observed (Supplementary Tables S13, S14).

DISCUSSION
Patient response to TAM has been investigated for a long time. In this respect, differences of approximately 25-50% in patients are observed (Fisher et al., 1998;Davies et al., 2011). This is an important issue due to the harmful effects on bone metabolism, cancer of the endometrium, thromboembolic diseases, and cognitive disorders caused by chronic oestrogen deprivation in brain tissue (Fisher et al., 1998;Cuzick et al., 2002;Cuzick, 2005). Moreover, various side effects similar to those seen in menopause may occur, such as hot flashes, weight gain, vaginal dryness, nausea, and elevated liver transaminases (Cuzick, 2005;Carpenter and Miller 2005).
Although it is well known that genetic variants in CYP2D6 could explain between 10-20% of these cases and its use as a biomarker for TAM is recommended by the FDA, there is still no agreement on its clinical utility for predicting results in BC, the studies reveal contradictory results and the role of other pharmacokinetic and pharmacodynamic proteins have been poorly studied (Brauch and Schwab, 2014;Brewer et al., 2014;Province et al., 2014;Binkhorst et al., 2015;Goetz et al., 2018). Therefore, the present research focused on studying the association between 7 genetic variants in several genes encoding proteins involved in the pharmacokinetics and pharmacodynamics of TAM and response (relapse) and ADRs in TAM-treated hormone-dependent BC women and contributing to the field, generating predictive models that approximate the response of patients according to their genetic-metabolic characteristics. CYP2D6p3 and p17 alleles were previously found to be absent in different sub-groups of the Chilean population (Roco et al., 2012;Varela et al., 2015) therefore we exclude them from this study and other potentially relevant variants could not be studied, unfortunately.
In this study, we recorded ADRs after 6 months of TAM treatment. This is because several other factors influence TAM ADRs, especially before TAM active metabolites reach steady state, which is after 2 months of treatment (Buckley and Goa, 1989;Lien et al., 1992;Kisanga et al., 2004). Therefore, we considered that after 6 months of treatment, the ADRs might be associated mainly with TAM. Similarly, after 2 years of treatment, relapses can be properly evaluated (Cheng et al., 2012).
The study patients have a 49.3% M a-c , which agrees with that described by Valenzuela et al. in 1987, in relation to admixture and socioeconomic stratum, because main of the enrolled population of patients (94%) belong to a low socioeconomic stratum with higher Mapuche ancestry; this was an expected result according to the place of recruitment of the patients.
The study of the association between genetic variants and clinical parameters showed that the presence of UGT2B15 p1/p2 protects against relapse (OR 0.09; p 0.008), CYP3A5p3/p3 protects against presenting endometrial hyperplasia (OR 0.07, p 0.01), and SULT1A1p1/p2 protects against vaginal bleeding (OR 0.09; p 0.03) and that ESRA 364E/364E is a risk factor for vaginal bleeding (OR 5.68; p 0.03). In this respect, it seems to be relevant to perform more studies regarding the ESRA V364E variant, a very poorly studied variant that can be determined in patients with ER + BC to prevent vaginal bleeding caused by TAM therapy, which is also a suspicious sign of endometrial cancer.
In our study, some analyses showed associations without statistical significance, which may be due to the low number of occurrences of the event in each subgroup, lack of clinical data in any of the contrasting groups, or because the variant explains a smaller part of the response, which could be clarified by increasing the sample size in future studies. Thus, in small groups, Fisher's exact test was used to obtain proper results.
The biostatistical tools used in this field have established that non-significant trends, but with p < 0.2 and physiological significance, can be included in predictive models and in multi-variable logistic regression analysis to help predict the response (relapses) and adverse reactions to treatment (Núñez et al., 2011). Therefore, preliminary predictive models to predict the response (efficacy and safety) in TAM-treated BC patients were generated. The efficacy result, analyzed as relapse, led to a model that explains 49.5% of the possibility of presenting a relapse (p 0.01) by including SULT1A1p2 and UGT2B15p2 and a series of relevant non-genomic variables. These results were expected since both the SULT1A1 and UGT2B15 enzymes have specificity for 4-hydroxyTAM, even though the genetic variant UGT2B15 p2 presents a nucleotide change located in a substrate binding site that is associated with a decrease in catalytic activity and alterations in the Km and Vmax values (Lévesque et al., 1997). Similarly, SULT1A1 p2 has a lower catalytic activity than the wild-type allele (Arslan et al., 2011). Thus, these variants would produce a decrease in the elimination of 4-hydroxyTAM. Moreover, our results are similar to those published by Nowell and his colleagues (Nowell et al., 2002 andNowell et al., 2005), who conducted a retrospective study in 162 TAM-treated BC patients from Arkansas who received TAM to determine whether genetic variability (CYP2D6 p4, SULT1A1p2, and UGT2B15 p2 ) in the TAM metabolic pathway was associated with recurrence. They found that high activity genotypes UGT2B15 p1/p1 and combined genotypes UGT2B15 p1/p1 and SULT1A1 p2/p2 lead to an increased risk of disease recurrence and that SULT1A1 p2/p2 causes an approximately 3 times higher risk of death than SULT1A1p1/p1 and p1/p2. Conversely, Wegman et al. (2005) found that the genetic variants CYP2D6p4/p4, and SULT1A1p1/ p1 cause an increase in the disease-free relapse time in patients with premenopausal and postmenopausal BC treated with 40 mg TAM using proportional hazards Cox regression adjusted for age, tumor size, and lymph nodes. However, Ahern et al. (2011) andLash et al. (2011) in homologous models found that the genetic variants CYP2D6 p4, UGT1A8p3, UGT2B7p2, and UGT2B15 p2 did not modify the relapse rate of the disease in patients with ER+/TAM + BC. On the other hand, Sensorn et al., 2013 investigated genetic variant CYP3A4 p1B (−392A>G, actually CYP3A4p1.001 G/A)/p18 (878T>C), CYP3A5p3 (6986 G>A), ABCB1 3435 C>T, ABCC2p1C (−24C > T) and ABCC2 68231 A>G to evaluate the risk of recurrence within 3 years among thirty Thai women after receiving tamoxifen adjuvant therapy but they could not find CYP3A4p1B/p18, and did not observe a statistical association between CYP3A5p3, ABCC2p1C, and ABCC2 68231 A>G with clinical outcome.
Additionally, preliminary predictive models were generated to predict ADRs. The endometrial hyperplasia model explained 41.6% of the possibility of presenting the event (p 0.002). The model includes the CYP2D6 p4, CYP3A4p1B (CYP3A4p1.001), CYP3A5p3, SULT1A1p2, UGT2B15p2 and ESRA V364E genotypes plus several relevant non-genomic factors. Likewise, a preliminary predictive model was generated that explains 33.7% of the possibility of presenting vaginal bleeding (p 0.014) by including the CYP2D6p4, CYP3A4p1B (CYP3A4p1.001), SULT1A1p2, and ESRA V364E genotypes and relevant non-genomic factors. Currently, there is no evidence regarding the association of genetic variants with TAM-induced vaginal bleeding or endometrial thickening. Chu et al. (2007) found that women with BC who carried CYP3A4 p1B (CYP3A4p1.001), had a 3-fold increased risk of developing endometrial cancer after treatment with TAM. The phlebitis model explained 66% of the possibility of presenting this ADR (p 0.036), a significant finding that has not been previously reported. In the same way, preliminary predictive models were generated to predict the appearance of headache (42.2%; p 0.028), nausea (36.3%; p 0.013), and hot flashes (12.1%; p 0.023) in patients. Our results differ from those reported by Goetz et al. (2005), where CYP2D6 p4/p4 tends to contribute to a higher risk of disease relapse and a lower incidence of hot flashes in a similar study. No other studies were found in the literature describing associations with headache and nausea. Finally, predictive models to predict the appearance of cramps (24.6%; p 0.053), bone pain (14.1%; p 0.017) and urticaria (34%; p 0.039) were obtained. We were not able to find any similar study in the scientific literature.
Altogether, our results show that some specific genetic variants in genes encoding proteins involved in the TAM pharmacokinetic and/or pharmacodynamic can influence the efficacy of TAM therapy in BC.
Since there are still discrepancies between the findings, which may be due to the genetic differences present in the subjects analyzed, it is crucial to continue with similar and larger studies that confirm the association of CYP2D6 p4, CYP2C9p2 and p3, CYP3A4p1B (CYP3A4p1.001), CYP3A5p3, SULT1A1p2, UGT2B15p2 and ESR1 V364E and response to treatment with TAM. Differences in allele frequencies of variants influencing drug's pharmacokinetics across populations are considered to result in the interethnic differences in pharmacokinetics observed. For example, the lower mean body weights of populations of East Asian in relation to European populations are well established to contribute to differences in the clearance or volume of distribution of some drugs (Lu et al., 2018). On the other hand, in relation to that scarce information is available in Latin American populations, but allele frequencies heterogeneously differ from Asian, Caucasian and African populations (Mendoza et al., 2001;López et al., 2005;Roco et al., 2012;Friedrich et al., 2014), giving rise an idea of the phenotypic response and patient outcome.
Despite our analysis, the present study has some limitations. Even though we had a good sample size for combinatorial analyses, a small number of patients examined in each subgroup comparison could mask potential associations, especially for low-frequency polymorphisms, particularly in the multivariate analyses. The small number of variants analyzed could restrict the statistical power and predictive potential of models, considering that some other candidate genes/ polymorphisms were not evaluated in this study (e.g. CYP2D6 p2, p5, p6, p10 and p41, CYP2C9p2, CYP2C19 p2 and p17, UGT1A10 139Lys, UGT1A8 173Gly/277Cys and ABCC2), which could still be relevant. In some cases, missing clinical data was relevant, giving rise to a possible differential misclassification bias affecting the estimated associations between potentially relevant combinations of risk factors and adverse reactions. Other environmental factors, such as contaminants and stress, or social determinants were not explored in this study, which could potentially affect the associations.
However, our findings and preliminary predictive models proposed can be used to prevent, in a good percentage of patients, the negative effects of TAM therapy. Additionally, they could help in the decision-making of a change in therapy to aromatase inhibitors (AIs) (Bernhard et al., 2015).
Certainly, for routine clinical application, the models must be validated in terms of their specificity, sensitivity, and predictive value, so it is necessary to carry out a second stage of research that involves treatment intervention in the patient according to their genotypic profile and relevant non-genomic variables derived from this study. It should be considered that the predictive models generated in this study after being validated could be used in the Chilean population. They could be used in another population with the same percentage of Caucasian aboriginal mix.

CONCLUSION
In our study, UGT2B15*2 A/C is likely to have a clinically significant impact on recurrence risk, CYP3A5 p3 A/G on endometrial hyperplasia, and SULT1A1p2 G/A, and ESRA V364E on vaginal bleeding in Chilean patients with breast cancer who receive tamoxifen adjuvant therapy. The obtained predictive models showed that the response to TAM treatment in BC patients is partly associated with some of the genetic variants studied. This, after a clinical validation protocol, might help to predict a percentage of disease relapse and negative effects of TAM therapy, improving the efficacy and safety of TAM pharmacotherapeutic treatment.

DATA AVAILABILITY STATEMENT
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation, to any qualified researcher.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by the Ethics Committee of the University of Chile (July 24, 2013), the Northern Metropolitan Health Service, National Cancer Institute (May 8, 2013), and Research Ethics Committee of the North Metropolitan Health Service (June 3, 2012. The patients/participants provided their written informed consent to participate in this study.

AUTHOR CONTRIBUTIONS
CM: The conception of the research, enrolment of patients, experimental analyses, analysis of data and, writing the manuscript. MG: Experimental analyses and analyses of data. RT: Enrolment of patients. KT: Enrolment of patients. DC: Analysis of data. KL: Analysis of data. MR: Analysis of data and writing the manuscript. NV: Analysis of data and writing the manuscript. LQ: The conception of the research, analysis of data and, writing the manuscript.