Star Allele-Based Haplotyping versus Gene-Wise Variant Burden Scoring for Predicting 6-Mercaptopurine Intolerance in Pediatric Acute Lymphoblastic Leukemia Patients

Park, Yoomi; Kim, Hyery; Choi, Jung Yoon; Yun, Sunmin; Min, Byung-Joo; Seo, Myung-Eui; Im, Ho Joon; Kang, Hyoung Jin; Kim, Ju Han

doi:10.3389/fphar.2019.00654

ORIGINAL RESEARCH article

Front. Pharmacol., 11 June 2019

Sec. Pharmacogenetics and Pharmacogenomics

Volume 10 - 2019 | https://doi.org/10.3389/fphar.2019.00654

Star Allele-Based Haplotyping versus Gene-Wise Variant Burden Scoring for Predicting 6-Mercaptopurine Intolerance in Pediatric Acute Lymphoblastic Leukemia Patients

YP
Yoomi Park ¹
HK
Hyery Kim ²
JY
Jung Yoon Choi ^3,4
SY
Sunmin Yun ¹
BM
Byung-Joo Min ¹
MS
Myung-Eui Seo ¹
HJ
Ho Joon Im ²
HJ
Hyoung Jin Kang ^3,4
JH
Ju Han Kim ^1,5^*

1. Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, South Korea
2. Department of Pediatrics, Asan Medical Center, University of Ulsan College of Medicine, Seoul, South Korea
3. Department of Pediatrics, Seoul National University College of Medicine, Seoul, South Korea
4. Seoul National University Cancer Research Institute, Seoul, South Korea
5. Center for Precision Medicine, Seoul National University Hospital, Seoul, South Korea

Article metrics

View details

Citations

4,5k

Views

1,6k

Downloads

Abstract

Nudix Hydrolase 15 (NUDT15) and Thiopurine S-Methyltransferase (TPMT) are strong genetic determinants of thiopurine toxicity in pediatric acute lymphoblastic leukemia (ALL) patients. Since patients with NUDT15 or TPMT deficiency suffer severe adverse drug reactions, star (*) allele-based haplotypes have been used to predict an optimal 6-mercaptopurine (6-MP) dosing. However, star allele haplotyping suffers from insufficient, inconsistent, and even conflicting designations with uncertain and/or unknown functional alleles. Gene-wise variant burden (GVB) scoring enables us to utilize next-generation sequencing (NGS) data to predict 6-MP intolerance in children with ALL. Whole exome sequencing was performed for 244 pediatric ALL patients under 6-MP treatments. We assigned star alleles with PharmGKB haplotype set translational table. GVB for NUDT15 and TPMT was computed by aggregating in silico deleteriousness scores of multiple coding variants for each gene. Poor last-cycle dose intensity percent (DIP < 25%) was considered as 6-MP intolerance, resulting therapeutic failure of ALL. DIPs showed significant differences ( p < 0.05) among NUDT15 poor (PM, n = 1), intermediate (IM, n = 48), and normal (NM, n = 195) metabolizers. TPMT exhibited no PM and only seven IMs. GVB showed significant differences among the different haplotype groups of both NUDT15 and TPMT ( p < 0.05). Kruskal–Wallis test for DIP values showed statistical significances for the seven different GVB score bins of NUDT15. GVB^NUDT15 outperformed the star allele-based haplotypes in predicting patients with reduced last-cycle DIPs at all DIP threshold levels (i.e., 5%, 10%, 15%, and 25%). In NUDT15-and-TPMT combined interaction analyses, GVB^NUDT15^,^TPMT outperformed star alleles [area under the receiver operating curve (AUROC) = 0.677 vs. 0.645] in specificity (0.813 vs. 0.796), sensitivity (0.526 vs. 0.474), and positive (0.192 vs. 0.164) and negative (0.953 vs. 0.947) predictive values. Overall, GVB correctly classified five more patients (i.e., one into below and four into above 25% DIP groups) than did star allele haplotypes. GVB analysis demonstrated that 6-MP intolerance in pediatric ALL can be reliably predicted by aggregating NGS-based common, rare, and novel variants together without hampering the predictive power of the conventional haplotype analysis.

Introduction

6-Mercaptopurine (6-MP) is a commonly used drug in the maintenance therapy of pediatric acute lymphoblastic leukemia (ALL). Since patients have a potential to experience medication-induced life-threatening side effects including bone marrow suppression and hepatotoxicity, providing a tailored drug dosing regimen is essential in clinical practice (Vogenberg et al., 2010).

One of the strongest ways to determine initial 6-MP dose is an experimental assessment of potential for drug adverse reactions, such as severe neutropenia by monitoring 6-MP metabolite concentration or using in vitro activity profiles (Dubinsky et al., 2000; Ansari et al., 2002;; Cuffari, 2005; Bradford, 2011; Supandi et al., 2018). However, applying such methods into routine clinical practice for predicting drug-induced toxicity is still challenging because it is extremely time-consuming, expensive, and inefficient (González-Lama and Gisbert, 2016).

As recent studies have demonstrated the strong association between genetic polymorphisms and inter-individual variability in 6-MP dose intensity, approaches to predict drug tolerance on the basis of individual genomic profiles have arisen. The primary genetic determinant of thiopurine toxicity is TPMT, which plays a crucial role in identifying patients in need of treatment modification with reduced enzyme activity (Lennard, 2014). However, this has not been applicable to East Asian populations since the frequency of TPMT polymorphisms varies by ethnicity (Relling et al., 2013). Recently, a novel pharmacogenetic marker, NUDT15, has clarified its role in predicting thiopurine toxicity in Asian populations (Yang et al., 2014; Yang et al., 2015; Zgheib et al., 2016; Kakuta et al., 2017). Clinical Pharmacogenetics Implementation Consortium (CPIC) published an updated guideline for thiopurine dosing based on both TPMT and NUDT15 genotypes using the star allele-based dose prediction method (Relling and Klein, 2009; Relling et al., 2018). This prevailing method provides therapeutic recommendations for dosing based on star allele genotypes. However, the utilization of star alleles in clinical practice has many obstacles that occur mainly due to 1) the extremely complex nomenclature system, 2) the limited resolution of phenotype prediction due to many unknown and uncertain function alleles, 3) ignorance of functional impacts of rare and/or novel variants, and 4) limited use in previously studied populations only (Robarge et al., 2007). Next-generation sequencing (NGS) challenges the conventional star alleles on the basis of genotyping technologies and clinical studies in case–control settings.

In the era of NGS, the comprehensive genotyping capabilities of NGS platform have enabled us to capture the true diversity of gene variation, and researchers propose alternative ways to predict individual intolerance towards a drug. One promising method is a gene-wise variant burden (GVB) scoring approach that can calculate gene-wise cumulative variant deleteriousness scores including common, rare, and even novel genetic variants for each gene (Lee et al., 2016). Here, we assessed the utility of GVB scoring method in quantifying the potential contributing effect of variants on enzymatic activity. By combining the clinically proven and well-established associations between the two genes, i.e., NUDT15 and TPMT, and 6-MP dose intensity percent (DIP, actual/planned dose) as a clinical endpoint, we performed a comparison study of the conventional star allele-based haplotyping and GVB scoring methods for predicting the last-cycle 6-MP DIP as an indicator for 6-MP intolerance of ALL patients with NUDT15 and/or TPMT deficiency. Overall, both star alleles and GVB showed significant correlations with 6-MP DIP values. Star allele-based haplotype groups showed significant correlation with GVB score groups. For predicting reduced last-cycle DIP values, GVB analysis outperformed the conventional star allele method for NUDT15 and showed comparable result for TPMT. In NUDT15-and-TPMT combined interaction analyses, GVB^NUDT15^,^TPMT outperformed star allele-based predictions [area under the receiver operating curve (AUROC) = 0.677 vs. 0.645] in specificity (0.813 vs. 0.796), sensitivity (0.526 vs. 0.474), and positive (PPV; 0.192 vs. 0.164) and negative (NPV; 0.953 vs. 0.947) predictive values. It is demonstrated that gene-wise evaluation of in silico deleterious variant score burden can be a useful method for predicting 6-MP intolerance in pediatric ALL patients, considering NGS-based common, rare, and novel variants concurrently while not hampering the predictive power of the conventional haplotype analysis.

Materials and Methods

Patients and Clinical Data Collection

A total of 298 Korean pediatric ALL patients with 6-MP treatment during maintenance therapy were recruited in the present study from two major teaching hospitals, i.e., Asan Medical Center (AMC) and Seoul National University Hospital (SNUH). Of the 298 subjects, 244 individuals who did not meet the exclusion criteria (i.e., relapse of the disease, stem cell transplantation, Burkitt’s lymphoma, mixed phenotype acute leukemia, infant ALL, or very high risk) were selected. All participants provided written informed consent. The study was approved by the AMC Review Boards and the SNUH Review Boards. The 6-MP dose per meter body surface area over a 12-week cycle was recorded. The maximum tolerated dose of 6-MP was defined as the dose at the last maintenance cycle for each patient. Patients from two hospitals had received treatment under the same treatment protocol and dose adjustment guidelines to maintain the ANC levels within target levels (500–1,500/µL). Genotype-guided dose modification was not conducted. Additional demographic data are shown in Table 1.

Table 1

Characteristics	Study cohorts
	AMC	SNUH	Total
No. of subjects	95	149	244
Age at diagnosis (year), mean ± SD^†	5.23 ± 1.8	8.57 ± 4.6	7.26 ± 4.1
Sex
Male	52	93	145
Female	43	56	99
Last-cycle 6-MP dose (mg/m²/day), mean ± SD (N)
6-MP < 12.5	8.14 ± 1.7 (2)	6.25 ± 2.9 (4)	6.88 ± 2.6 (6)
12.5 ≤ 6-MP < 25	17.39 ± 3.4 (4)	19.40 ± 3.6 (9)	18.78 ± 3.7 (13)
25 ≤ 6-MP < 37.5	32.19 ± 3.4 (10)	30.72 ± 4.0 (16)	31.28 ± 3.8 (26)
37.5 ≤ 6-MP < 50	44.52 ± 3.7 (13)	45.80 ± 3.5 (14)	45.18 ± 3.6 (27)
6-MP ≥ 50	79.15 ± 18.1 (66)	78.84 ± 23.1 (106)	78.96 ± 21.3 (172)
Total	65.37 ± 26.6 (95)	65.03 ± 30.0 (95)	65.16 ± 28.7 (244)

Clinical characteristics of study subjects.

^†Data for age at diagnosis were not available for one subject. 6-MP, 6-mercaptopurine; AMC, Asan Medical Center; SNUH, Seoul National University Hospital.

Data Generation and Sequencing

Exome sequencing was performed using Ion AmpliSeq^™ Exome panel to screen coding sequence region of entire genome. This panel included the exome of 19,072 genes and the size of the total targeted region was 57.7 Mb. The panel contained 293,903 primer pairs that were multiplexed into 12 pools to avoid primer-dimer formation and interference during PCR. The range of amplicons amplified by these oligo primer pairs ranged from 125 to 275 bp, and the rate of “on target” coverage for this panel was 95.69%. PCR assays were performed directly to amplify 100 ng of genomic DNA samples extracted from normal blood cells in bone marrow aspirates or peripheral blood so as to collect the target regions using the oligo primer pairs of the panel. Reaction parameters were as follows: 99°C for 2 min, followed by 10 cycles of 99°C for 15 s, 60°C for 16 min, and 10°C for 1 min. After amplification, library construction was performed by using the Ion AmpliSeq library kit plus as described in the manufacturer’s instructions (Thermo Scientific, Waltham, MA). Libraries were quantified using an Agilent 2100 Bioanalyzer (Agilent, Santa Clara, CA) and then diluted to ∼10 pM. Subsequently, 33.3 μL of the barcoded libraries was combined in sets of three barcodes. The combined libraries were sequenced using the Ion Proton platform with PI chip V3, following the manufacturer’s instructions (Thermo Scientific, Waltham, MA). Reads were mapped to the human reference genome build (hg19) with a mapping alignment program from Thermo Fisher (version 4.4, Torrent Suite Software) on germ-line and low stringency settings (minimum observed allele frequency required for a non-reference variant call is 0.18 for single-nucleotide variant (SNV) and 0.23 for InDel, minimum phred scales call quality is 14 for SNV and 19 for InDel, minimum coverage for called variants is 35 for SNV and 40 for InDel, and maximum strand bias is 0.95 for SNV and 0.75 for InDel). Single-nucleotide variants (SNVs) and short insertions/deletions (InDels) were identified via Genome Analysis Toolkit (GATK) 2.8-1 Unified Genotyper (DePristo et al., 2011). To estimate the pathogenicity of variants, two in silico variant deleteriousness prediction scores were annotated: sorting intolerant from tolerant (SIFT) (Ng, 2003) and combined annotation dependent depletion (CADD) (Kircher et al., 2014). The protein-coding gene region was defined using ANNOVAR (http://annovar.openbioinformatics.org/) (Wang et al., 2010). All the variants identified in 244 ALL samples are described in Supplementary Table S1 and S2.

Calculation of Gene-Wise Variant Burden Score

Gene-wise deleterious variant burden was computed for NUDT15 and TPMT as described by Lee et al. (2016) and Seo et al. (2018). Under the hypothesis that variants that have potential effects to change protein function not necessarily guarantee but have power to cause harmful phenotypes, only variants with SIFT scores less than 0.7 were further considered.

As SIFT does not provide functional scores for InDels, _adjv for all InDel variants were assigned as 1e-8 under the hypothesis that InDels are more deleterious than single-nucleotide substitutions. Considering the dosage effects, adjusted SIFT score_adjv was calculated for each variant according to their genotype.

For each gene G_i with n deleterious variants, we calculated GVB(G_i), the cumulative genic effect for all coding variants of the gene, by calculating the geometric mean of _adjv (Equation). GVB_g is considered as 1 if the count n of variant j with scores less than 0.7 in a gene is 0, indicating that the gene is not displaying any deleterious variant.

We obtained GVB_g values for each individual ranging from 0 to 1. To predict 6-MP sensitivity, GVB^NUDT15^,^TPMT was generated by calculating the geometric mean of GVB^NUDT15 and GVB^TPMT.

Prediction of Star Allele Diplotypes for 244 Acute Lymphoblastic Leukemia Samples

To classify 244 ALL samples into three metabolizer groups, we inferred haplotypes using the PHASE 2.1.1 software (Stephens et al., 2001; Stephens and Scheet, 2005) (Supplementary Figure S3). On the basis of the inferred haplotype information, we extracted star alleles that matched the haplotype set translational table from PharmGKB (https://www.pharmgkb.org/) (Whirl-Carrillo et al., 2012). Predicted genotypes were translated into molecular phenotypes on the basis of the coded genotype–phenotype translation tables from Moriyama et al. (2016) for NUDT15 and from PharmGKB tables for TPMT.

Estimation of Diagnostic Accuracies by Receiver Operating Curve Analyses

To assess prediction accuracies, we calculated DIP, the percentage of the actual administered dose to the planned dose, as an index for 6-MP drug toxicity. Dose in the last maintenance cycle was used, since the doses of 6-MP in the final maintenance cycle were supposed to be the maximum tolerated doses for patients (Kim et al., 2012). DIP prediction accuracies of GVB (GVB^NUDT15, GVB^TPMT, and GVB^NUDT15^,^TPMT) and star allele-based predictions were compared using AUROC analysis with the R language pROC package (Robin et al., 2011). We computed specificity, sensitivity, PPV, and NPV under the binary classification model with nine different cutoff levels (i.e., 5%, 10%, 15%, 25%, 35%, 45%, 60%, 80%, and 100%) for defining high-risk DIP groups. All statistical analyses were performed using R version 3.5.1.

Results

Relation of Gene-Wise Variant Burden and Star Allele-Based Molecular Phenotypes

NUDT15 and TPMT haplotypes of each subject were first inferred from whole exome sequencing (WXS) data by using the PHASE tool, and matched star allele genotypes were assigned for each subject. The star allele genotypes were then translated into three molecular phenotype groups according to their allele combinations; poor (PM, No function|No function), intermediate (IM, Normal|No function or Normal|Decreased), and normal (NM, Normal|Normal) metabolizers. Six and four star alleles were identified for NUDT15 and TPMT genes, respectively, from the 244 ALL patients with their frequencies (Table 2). Table 3 shows the distribution of subsequently predicted enzymatic metabolizer phenotypes for NUDT15 and TPMT among the 244 ALL patients.

Table 2

Gene	Number of identified alleles	Alleles identified in 244 ALL samples	Frequencies (%)
NUDT15	6	*1	438 (89.75)
		*2	6 (1.23)
		*3	35 (7.17)
		*4	4 (0.82)
		*5	4 (0.82)
		*6	1 (0.20)
TPMT	4	*1	127 (26.02)
		*1S	354 (72.54)
		*3C	6 (1.23)
		*6	1 (0.20)

Alleles identified in 244 ALL samples with known allele functions.

Haplotypes were inferred via PHASE 2. Star alleles were assigned by the PharmGKB haplotype set translational table. ALL, acute lymphoblastic leukemia.

Table 3

Molecular phenotype	Function	NUDT15	TPMT
Poor (%)	No function \| No function	1 (0.41)	NA
Intermediate (%)	Normal \| No function	48 (19.67)	6 (2.46)
	Normal \| Decreased	NA	1 (0.41)
Normal (%)	Normal \| Normal	195 (79.92)	237 (97.13)
Total (%)		244 (100)	244 (100)

Distribution of predicted enzymatic metabolizer phenotypes.

Molecular phenotypes were assigned using the PharmGKB haplotype set translational table. Star (*) allele genotype-to-phenotype correlation was adapted from information available at the Moriyama et al. (NUDT15) and the Clinical Pharmacogenetics Implementation Consortium (CPIC) guideline ( TPMT ); NA, not available.

While 49 (20.1%) of 244 ALL patients were classified into non-NM (one PM and 48 IMs) phenotype for NUDT15, only seven (2.9%) IMs were identified for TPMT, reflecting ethnic variation of NUDT15 and TPMT variants, in a consistent manner (Table 3). Since individuals with TPMT homozygous mutant alleles are rarely observed in East Asian population, none of the patients were classified into the poor metabolizer group. IMs were stratified into two groups: 1) individuals carrying one copy of a normal function allele and one copy of a decreased function allele and 2) individuals carrying one copy of a normal function allele and one copy of no function allele. Carriers of non-functional allele, compared with carriers of decreased function allele, are considered to be at an increased risk for functional decline.

Patients with NUDT15 normal metabolizing alleles (DIP = 67.608 ± 28.2, n = 195) tolerated significantly higher DIPs of 6-MP than did slow metabolizers [5.712 (PM, n = 1), 56.452 ± 28.2 (IM, n = 48)] (Figure 1A). Clinical usefulness of the conventional star allele-based classification was successfully demonstrated for NUDT15 variants in the present study. Due to the small number of non-NM subjects for TPMT in Korean ALL patients, the difference of DIPs between NM (65.702 ± 28.4, n = 237) and IM (46.805 ± 35.7, n = 7) did not reach statistical significance (p = 0.10, Figure 1B).

Figure 1

GVB scores among different molecular phenotype groups for NUDT15 (PM = 0.09, IM = 0.248 ± 0.1, and NM = 0.995 ± 0.1, Figure 2A) and for TPMT (IM = 0.229 ± 0.3, NM = 1 ± 0.0, Figure 2B) showed statistically significant differences. The observed positive correlation between our GVB score and the conventional enzymatic metabolizer phenotypes for both NUDT15 and TPMT variants strongly supported our further analysis. Note that those pharmacogenetic star alleles have long been empirically developed by clinical case–control studies and/or animal and molecular studies. In contrast, the GVB analysis is based on purely theoretical ab initio and in silico methods without requiring empirical studies that are prohibitively costly considering the numerous drugs and genetic variants discovered by NGS technologies and the interactions. In the following sections, we explore the potential of the GVB scoring method for predicting DIPs as an indicator of 6-MP intolerance in pediatric ALL patients.

Figure 2

Gene-Wise Variant Burden Scores for Predicting Last-Cycle 6-Mercaptopurine Dose Intensity Percent

Since both NUDT15 and TPMT genes are not highly variable, only seven and two GVB value groups for NUDT15 and TPMT, respectively, were identified in the 244 ALL patients. GVB^NUDT15 demonstrated statistically significant positive correlation with DIP (p = 0.016 by Kruskal–Wallis test, p = 0.001 (p = 0.21) by Spearman’s rank correlation, p = 0.001 (τ = 0.17) by Kendall’s rank correlation) (Figure 3A). Exclusion of the two patients having both NUDT15 and TPMT variants slightly improved statistical significance (Supplementary Figure S4). Due to the low frequency of TPMT alleles in East Asian population, 97.5% (n = 238) of all ALL patients were classified into wild type (GVB^TPMT = 1.00 ± 0.00) and only six (2.50%) were classified into variant type (GVB^TPMT = 0.10 ± 0.00) groups, resulting in poor statistical significance (p = 0.408 by T-test, p = 0.272 (ρ = 0.07) by Spearman’s rank correlation, p = 0.271 (τ = 0.06) by Kendall’s rank correlation) (Figure 3B).

Figure 3

Performance Comparisons Between Gene-Wise Variant Burden and Star Allele-Based Molecular Phenotypes Across Different Risk Group Decision Thresholds

Using ROC analysis, we evaluated the performances of GVB at nine cutoff levels (i.e., DIP < 5%, 10%, 15%, 25%, 35%, 45%, 60%, 80%, and 100%) for defining the 6-MP high-risk groups. Star allele-based classification was also applied for systematic comparison across different DIP threshold levels. DIP below 25% of planned dose of 6-MP is a generally accepted threshold for predicting 6-MP intolerance. Figure 4A demonstrates that GVB^NUDT15 showed better AUCs at all threshold DIP levels below 25% (0.998 (DIP < 5%), 0.676 (DIP < 10%), 0.669 (DIP < 15%), and 0.653 (DIP < 25%)) than did the conventional star allele-based molecular phenotypes (AUC = 0.618). Moreover, exclusion of the two confounding patients with both NUDT15 and TPMT variant alleles slightly improved performances than did both before-exclusion GVB^NUDT15 at all threshold DIP levels below 25% [AUC = 0.998 (DIP < 5%), 0.676 (DIP < 10%), 0.639 (DIP < 15%), and 0.627 (DIP < 25%)] and the star allele-based (AUC = 0.596) analyses (Figure 4B). Mainly due to the low frequency of TPMT variant alleles in East Asian population, both GVB^TPMT and star allele-based predictions using TPMT seem to show poor AUCs for predicting DIP at all threshold levels (Figure 4C and D).

Figure 4

More importantly, we performed ROC analysis by aggregating the genetic effects of these two genes, NUDT15 for East Asian and TPMT for European heritages. We computed and evaluated GVB^NUDT15^,^TPMT, which outperformed GVB^NUDT15 or GVB^TPMT alone as well as the combined molecular phenotypes of both NUDT15 and TPMT at all DIP threshold levels (Figure 5). In summary, at the clinically important DIP level of below or above 25%, the best AUC values for GVB^NUDT15^,^TPMT, GVB^NUDT15, GVB^TPMT, and combined star alleles were 0.677, 0.653, 0.574, and 0.645, respectively. GVB^NUDT15^,^TPMT not only showed the best performance but also successfully included the two confounding patients with both NUDT15 and TPMT variant alleles. While combining GVB scores of multiple genes is simple and straightforward, it is not the case for star alleles, which do not provide a uniform way of combining method for multiple genes.

Figure 5

Comparison of Prediction Accuracies Between Gene-Wise Variant Burden and Star Allele-Based Methods

To test the clinical utility of GVB method for guiding 6-MP dosing and/or for providing systematic framework for clinical studies of 6-MP intolerance and its genetic determinants of NUDT15 and TPMT for predicting DIP groups, we evaluated the diagnostic characteristics of the conventional star allele-based and GVB scoring methods in a simulated clinical setting. Table 4A and 4B exhibits diagnostic accuracies for star allele-based molecular phenotype groups and gene-wise variant burden score groups, respectively, for 6-MP intolerance among 244 pediatric ALL patients by the last-cycle DIP of 6-MP. Of the 244 ALL patients, 189 (84.4%) exhibited no NUDT15 or TPMT variant and hence was classified into NMs for both genes (Table 4A). Of the rest 55 non-NM patients, nine (16.4%) showed DIP below 25%, while 10 of 189 (5.3%) NM patients showed low DIP values.

Table 4

(A)
NUDT15 and TPMT metabolizer	Dose intensity percent groups		Total
	≤25	>25
PM + IM	9	46	55	PPV 16.36% (9/55)
NM	10	179	189	NPV 94.70% (179/189)
Total	19	225	244
	Sensitivity 47.36% (9/19)	Specificity 79.56% (179/225)		Accuracy 77.05% (188/244)
(B)
Gene-wise variant burden score	Dose intensity percent groups		Total
	≤25	>25
GVB^NUDT15,TPMT≤ 0.3	10	42	52	PPV 19.23% (10/52)
GVB^NUDT15,TPMT> 0.3	9	183	192	NPV 95.31% (183/192)
Total	19	225	224
	Sensitivity 52.63% (10/19)	Specificity 81.33% (183/225)		Accuracy 79.10% (193/244)

Comparison of star allele-based haplotyping versus gene-wise variant burden (GVB^NUDT15^,^TPMT) analyses for 6-mercaptopurine intolerance measured by last-cycle dose intensity percent in ALL. Diagnostic accuracy table of (A) star allele-based haplotypes and dose intensity percent groups and (B) gene-wise variant burden score and dose intensity percent groups.

PM, poor metabolizer; IM, intermediate metabolizer; NM, normal metabolizer; PPV, positive predictive value; NPV, negative predictive value.

Although one can choose many threshold levels of GVB, because star alleles can just provide a small number of categories, we chose the most reliable binning threshold of GVB^NUDT15^,^TPMT ≤ 0.3, the cut-point that maximizes the Youden index (Supplementary Figure S5), for classifying the patients into the below and above 25% DIP groups as shown in Table 4B. It is a coincidence that Lee et al. (2016) also suggested GVB^{Pharmacogenes} ≤ 0.3 as the threshold for predicting pharmaceutical market withdrawals in general. GVB^NUDT15^,^TPMT correctly classified one more high-risk (DIP ≤ 25%) and four more low-risk (DIP > 25%) patients into the correct-risk groups (Table 4B) than did the traditional haplotype-based method (Table 4A), with an improved sensitivity from 47.36% to 52.63% and an improved specificity from 79.56% to 81.33%, though the difference did not reach statistical significance (p-value for sensitivity = 1 and p-value for specificity = 0.134, as determined using a McNemar test). Both PPV and NPV increase from 16.36% to 19.23% and from 94.70% to 95.31%, respectively. Overall, it is suggested that the “computational” GVB^NUDT15^,^TPMT is an improved or at least comparable predictor than the “empirical” star allele-based haplotypes for determining subjects with increased risk of 6-MP intolerance in pediatric ALL patients measured by the last-cycle 6-MP DIP.

Discussion

An enduring challenge in precision medicine is to predict adequate drug responses for individual patients (Shah and Shah, 2012). Recent discoveries have revealed a few highly functional and clinically relevant novel variants associated with 6-MP intolerance. However, since implicating drug toxicity based on a single variant is notoriously unreliable as shown in Supplementary Figure S1 for SIFT and Supplementary Figure S2 for CADD, developing strategies to aggregate the key effects over a range of genomic region is highly required. In the present study, we evaluated the utility of gene-wise deleterious variant burden scoring method, as a sequencing-based, simple, reliable, quantitative, and easy-to-compare score for predicting 6-MP intolerance of 244 pediatric ALL patients. In addition to DIP, GVB showed a statistically significant negative correlation with the incidence of grade 4 neutropenia (p = 1.43E−04 by Kruskal–Wallis test, p = 3.89E−07 (ρ = −0.32) by Spearman’s rank correlation, and p = 8.06E−07 (τ = −0.27) by Kendall’s rank correlation (Supplementary Figure S6). This implies that GVB is a reliable score that can predict hematological toxicity in pediatric ALL patients. When beginning treatment, NGS-based drug intolerance prediction is useful because it is practical to detect patients at high risk of toxicity. For example, patients with low GVB have a high probability of 6-MP toxicity at the initial recommended dose range; thus, clinicians may attempt to reduce the initial target dose of 6-MP. After an initial target dose is determined, a close therapeutic drug monitoring could help to avoid potential causes for toxicity, such as clinically relevant drug–drug interactions, reduced drug clearance due to liver and/or renal impairment, and altered drug utilization due to physiological conditions, as a complementary type of practice during the treatment (Ju-Seop Kang, 2009).

GVB analysis has several benefits over conventional star allele-based approaches. GVB 1) quantitates gene-wise variant burden with a single score; 2) provides a measure of inter-individual genetic variability for each gene; 3) considers common, rare, and novel genetic variants together; 4) provides an ethnic variability-neutral method for studying pharmacogenomics; 5) provides a systematic and reliable framework for designing further pharmacogenomics studies considering many gene interactions for clinical endpoints; and 6) adopts the contributing effect of novel low-frequency variants with potentially reduced function in predicting individual drug toxicity.

Based upon the very recent CPIC updates on NUDT15, three newly enrolled alleles were characterized (Moriyama et al., 2017). Since new haplotype designation is highly dependent on the characteristics of the study population, there will be restrictions in incorporating new or even as-yet-unidentified evidences in predicting future drug intensity. GVB can be used to develop a model to determine optimal doses without requiring a multi-ethnic population study, especially for under-studied subpopulations.

The following limitations are inherent in the present study. To evaluate the validity of GVB, independent replication studies for an expanded gene–drug set with sufficient sample sizes in diverse ethnic groups are required as no novel variant was identified in the current study. A conventional single variant-based association test of rare variants requires infeasible magnitude of sample sizes (Bansal et al., 2010), but approaches that aggregate common, rare, and novel variants jointly will substantially reduce a required effective sample sizes (Witte, 2012). The robustness of the analysis framework shall further be improved as novel prognostic markers on 6-MP DIP are acquired. The limitations in interpreting the score includes that all InDels are treated as highly damaging as SIFT provides scores for only single-nucleotide variants. As there are many in silico variant deleteriousness scoring method based on different principles, comprehensive evaluation of different method is required (Supplementary Figure S7). We also performed CADD-based computation of GVB values, resulting in similar results (Supplementary Figures S8 and S9). It has been reported that CADD tends to evaluate in-frame InDels as relatively benign (Kircher et al., 2014). However, recent in vitro activity assay of NUDT15 (Moriyama et al., 2017) proved that in-frame InDel carriers are more likely to be in states with severely diminished response to 6-MP. It is strongly recommended that for clinical applications, potential clinical impacts of genetic variants on drug sensitivity should be further examined to improve estimation accuracy, as in silico prediction scores can exhibit incorrect predictions. Producing a custom capture panel for clinically actionable genes could be more cost-effective than an exome-based approach.

One subject who was correctly classified by GVB carried a low-frequency novel deletion and predicted to belong to the high-risk group by GVB, whereas star allele-based prediction classified this patient into the NM group for both NUDT15 and TPMT. The patient required reduced dose than recommended (DIP = 23.7%), supporting that GVB analysis resulted in 6-MP dose-related adverse drug reactions. The patient’s variant was heterozygous p.Gly17_Val18del, which was very recently assigned as NUDT15*9 with uncertain functionality. The other four who were correctly classified by GVB had p.Arg139His on one allele, which has assigned them to the IM (NUDT15 *1/*4) group. GVB classified them as relatively safe for drug toxicity, and none of them required a 25% reduction from the starting dose. Additionally, one patient who was classified as high risk by GVB was assigned to IM for both NUDT15 and TPMT and required a severely reduced dose (14%), suggesting that GVB^NUDT15^,^TPMT exhibits benefits in aggregating effects of many moderate genetic determinants into a single quantitative value.

Funding

This research was supported by a grant (16183MFDS541) from the Ministry of Food and Drug Safety in 2019.

Statements

Ethics statement

The study was approved by the Asan Medical Center (AMC) Review Boards and the Institutional Review Board of Seoul National University Hospital (SNUH). Informed written consents for blood sampling and analyses were obtained from all participants.

Author contributions

YP and JK designed the model and the framework. HK, JC, HI, and HK collected samples and clinical data. BM and MS carried out the experiment. YP and SY analyzed the data and carried out the implementation. YP performed the calculations. YP and JK wrote the manuscript. JK conceived the study and was in charge of overall direction and planning. All authors read and approved the final manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2019.00654/full#supplementary-material

References

1
AnsariA.HassanC.DuleyJ.MarinakiA.Shobowale-BakreE. M.SeedP.et al. (2002). Thiopurine methyltransferase activity and the use of azathioprine in inflammatory bowel disease. Aliment. Pharmacol. Ther.16, 1743–1750. doi: 10.1046/j.1365-2036.2002.01353.x
- CrossRef
- Google Scholar
2
BansalV.LibigerO.TorkamaniA.SchorkN. J. (2010). Statistical analysis strategies for association studies involving rare variants. Nat. Rev. Genet.11, 773–785. doi: 10.1038/nrg2867
- CrossRef
- Google Scholar
3
BradfordK. (2011). Optimizing 6-mercaptopurine and azathioprine therapy in the management of inflammatory bowel disease. WJG.17, 4166–4168. doi: 10.3748/wjg.v17.i37.4166
- CrossRef
- Google Scholar
4
CuffariC. (2005). A physician’s guide to azathioprine metabolite testing. Gastroenterol. Hepatol.2, 58–63. Retrieved from http://www.gastroenterologyandhepatology.net/.
- Google Scholar
5
DePristoM. A.BanksE.PoplinR.GarimellaK. V.MaguireJ. R.HartlC.et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet.43, 491–498. doi: 10.1038/ng.806
- CrossRef
- Google Scholar
6
DubinskyM. C.LamotheS.YangH. Y.TarganS. R.SinnettD.ThéorêtY.et al. (2000). Pharmacogenomics and metabolite measurement for 6-mercaptopurine therapy in inflammatory bowel disease. Gastroenterology118, 705–713. doi: 10.1016/S0016-5085(00)70140-5
- CrossRef
- Google Scholar
7
González-LamaY.GisbertJ. P. (2016). Monitoring thiopurine metabolites in inflammatory bowel disease. Frontline Gastroenterol.7, 301–307. doi: 10.1136/flgastro-2015-100681
- CrossRef
- Google Scholar
8
Ju-Seop KangM.-H. L. (2009). Overview of therapeutic drug monitoring. Korean J. Inter. Med.24, 1. doi: 10.3904/kjim.2009.24.1.1
- CrossRef
- Google Scholar
9
KakutaY.KinouchiY.ShimosegawaT. (2017). Pharmacogenetics of thiopurines for inflammatory bowel disease in East Asia: prospects for clinical application of NUDT15 genotyping. J. Gastroenterol.53, 172–180. doi: 10.1007/s00535-017-1416-0
- CrossRef
- Google Scholar
10
KimH.KangH. J.KimH. J.JangM. K.KimN. H.OhY.et al. (2012). Pharmacogenetic analysis of pediatric patients with acute lymphoblastic leukemia: a possible association between survival rate and ITPA polymorphism. PLoS ONE7, e45558–45510. doi: 10.1371/journal.pone.0045558
- CrossRef
- Google Scholar
11
KircherM.WittenD. M.JainP.O’RoakB. J.CooperG. M.ShendureJ. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet.46, 310–315. doi: 10.1038/ng.2892
- CrossRef
- Google Scholar
12
LeeK. H.BaikS. Y.LeeS. Y.ParkC. H.ParkP. J.KimJ. H. (2016). Genome sequence variability predicts drug precautions and withdrawals from the market. PLoS ONE11, e0162135–0162115. doi: 10.1371/journal.pone.0162135
- CrossRef
- Google Scholar
13
LennardL. (2014). Implementation of TPMT testing. Br. J. Clin. Pharmacol.77, 704–714. doi: 10.1111/bcp.12226
- CrossRef
- Google Scholar
14
MoriyamaT.NishiiR.Perez-AndreuV.YangW.KlussmannF. A.ZhaoX.et al. (2016). NUDT15 polymorphisms alter thiopurine metabolism and hematopoietic toxicity. Nat. Genet.48, 367–373. doi: 10.1038/ng.3508
- CrossRef
- Google Scholar
15
MoriyamaT.YangY.-L.NishiiR.AriffinH.LiuC.LinT.-N.et al. (2017). Novel variants in NUDT15 and thiopurine intolerance in children with acute lymphoblastic leukemia from diverse ancestry. Blood130, 1209–1212. doi: 10.1182/blood-2017-05-782383
- CrossRef
- Google Scholar
16
NgP. C. (2003). SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res.31, 3812–3814. doi: 10.1093/nar/gkg509
- CrossRef
- Google Scholar
17
RellingM. V.KleinT. E. (2009). CPIC: Clinical Pharmacogenetics Implementation Consortium of the pharmacogenomics research network. Clin. Pharmacol. Ther.89, 464–467. doi: 10.1038/clpt.2010.279
- CrossRef
- Google Scholar
18
RellingM. V.GardnerE. E.SandbornW. J.SchmiegelowK.PuiC.-H.YeeS. W.et al. (2013). Clinical Pharmacogenetics Implementation Consortium guidelines for thiopurine methyltransferase genotype and thiopurine dosing: 2013 update. Clin. Pharmacol. Ther.93, 324–325. doi: 10.1038/clpt.2013.4
- CrossRef
- Google Scholar
19
RellingM. V.SchwabM.Whirl-CarrilloM.Suarez-KurtzG.PuiC.-H.SteinC. M.et al. (2018). Clinical Pharmacogenetics Implementation Consortium (CPIC) guideline for thiopurine dosing based on TPMT and NUDT15 genotypes: 2018 update. Clin. Pharmacol. Ther.105, 1095–1105. doi: 10.1002/cpt.1304
- CrossRef
- Google Scholar
20
RobargeJ. D.LiL.DestaZ.NguyenA.FlockhartD. A. (2007). The star-allele nomenclature: retooling for translational genomics. Clin. Pharmacol. Ther.82, 244–248. doi: 10.1038/sj.clpt.6100284
- CrossRef
- Google Scholar
21
RobinX.TurckN.HainardA.TibertiN.LisacekF.SanchezJ.-C.et al. (2011). pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinforma.12, 77. doi: 10.1186/1471-2105-12-77
- CrossRef
- Google Scholar
22
SeoH.KwonE. J.YouY.-A.ParkY.MinB. J.YooK.et al. (2018). Deleterious genetic variants in ciliopathy genes increase risk of ritodrine-induced cardiac and pulmonary side effects. BMC Med. Genomics11, 31. doi: 10.1186/s12920-018-0323-4
- CrossRef
- Google Scholar
23
ShahR. R.ShahD. R. (2012). Personalized medicine: is it a pharmacogenetic mirage? Br J Clin. Pharmacol.74, 698–721. doi: 10.1111/j.1365-2125.2012.04328.x
- CrossRef
- Google Scholar
24
StephensM.ScheetP. (2005). Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation. Am. J. Hum. Genet.76, 449–462. doi: 10.1086/428594
- CrossRef
- Google Scholar
25
StephensM.SmithN. J.DonnellyP. (2001). A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet.68, 978–989. doi: 10.1086/319501
- CrossRef
- Google Scholar
26
SupandiS.HarahapY.HarmitaH.AndalusiaR. (2018). Quantification of 6-mercaptopurine and its metabolites in patients with acute lympoblastic leukemia using dried blood spots and UPLC-MS/MS. Sci. Pharm.86, 18–18. doi: 10.3390/scipharm86020018
- CrossRef
- Google Scholar
27
VogenbergF. R.BarashC. I.PurselM. (2010). Personalized medicine, part 1: evolution and development into theranostics. Pharm. Ther.35, 560–576. Retrieved from https://www.ptcommunity.com/.
- Google Scholar
28
WangK.LiM.HakonarsonH. (2010). ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res.38, e164–e164. doi: 10.1093/nar/gkq603
- CrossRef
- Google Scholar
29
Whirl-CarrilloM.McDonaghE. M.HebertJ. M.GongL.SangkuhlK.ThornC. F.et al. (2012). Pharmacogenomics knowledge for personalized medicine. Clin. Pharmacol. Ther.92, 414–417. doi: 10.1038/clpt.2012.96
- CrossRef
- Google Scholar
30
WitteJ. S. (2012). Rare genetic variants and treatment response: sample size and analysis issues. Stat. Med.31, 3041–3050. doi: 10.1002/sim.5428
- CrossRef
- Google Scholar
31
YangJ. J.LandierW.YangW.LiuC.HagemanL.ChengC.et al. (2015). Inherited NUDT15 variant is a genetic determinant of mercaptopurine intolerance in children with acute lymphoblastic leukemia. JCO33, 1235–1242. doi: 10.1200/JCO.2014.59.4671
- CrossRef
- Google Scholar
32
YangS.-K.HongM.BaekJ.ChoiH.ZhaoW.JungY.et al. (2014). A common missense variant in NUDT15 confers susceptibility to thiopurine-induced leukopenia. Nat. Genet.46, 1017–1020. doi: 10.1038/ng.3060
- CrossRef
- Google Scholar
33
ZgheibN. K.AkikaR.MahfouzR.AridiC. A.GhanemK. M.SaabR.et al. (2016). NUDT15 and TPMT genetic polymorphisms are related to 6-mercaptopurine intolerance in children treated for acute lymphoblastic leukemia at the Children’s Cancer Center of Lebanon. Pediatr. Blood Cancer64, 146–150. doi: 10.1002/pbc.26189
- CrossRef
- Google Scholar

Summary

Keywords

6-mercaptopurine, drug toxicity, variant burden, pharmacogenetics, pharmacogenomics, next-generation sequencing, Nudix Hydrolase 15 (NUDT15), Thiopurine S-Methyltransferase (TPMT)

Citation

Park Y, Kim H, Choi JY, Yun S, Min B-J, Seo M-E, Im HJ, Kang HJ and Kim JH (2019) Star Allele-Based Haplotyping versus Gene-Wise Variant Burden Scoring for Predicting 6-Mercaptopurine Intolerance in Pediatric Acute Lymphoblastic Leukemia Patients. Front. Pharmacol. 10:654. doi: 10.3389/fphar.2019.00654

Received

14 January 2019

Accepted

20 May 2019

Published

11 June 2019

Volume

10 - 2019

Edited by

Martin A. Kennedy, University of Otago, New Zealand

Reviewed by

Chakradhara Rao, Satyanarayana Uppugunduri, Université de Genève, Switzerland; Maria J. Prata, University of Porto, Portugal; William Newman, University of Manchester, United Kingdom

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ju Han Kim, juhan@snu.ac.kr

This article was submitted to Pharmacogenetics and Pharmacogenomics, a section of the journal Frontiers in Pharmacology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Pharmacogenetics and Pharmacogenomics

ORIGINAL RESEARCH article

Star Allele-Based Haplotyping versus Gene-Wise Variant Burden Scoring for Predicting 6-Mercaptopurine Intolerance in Pediatric Acute Lymphoblastic Leukemia Patients

Abstract

Introduction