ORIGINAL RESEARCH article
Evaluation of Host Serum Protein Biomarkers of Tuberculosis in sub-Saharan Africa
- 1Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, United Kingdom
- 2Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, United States
- 3DST-NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
- 4Centre for Statistical Consultation, Stellenbosch University, Cape Town, South Africa
- 5Department of Medicine, Wellcome Centre for Infectious Diseases Research in Africa, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
- 6MRC Epidemiology Unit, University of Cambridge, Cambridge, United Kingdom
- 7The Francis Crick Institute, London, United Kingdom
- 8Department of Infection Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, London, United Kingdom
- 9Malawi Epidemiology and Intervention Research Unit, Karonga Prevention Study, Lilongwe, Malawi
- 10Department of Infectious Disease Epidemiology, Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, London, United Kingdom
- 11Institute of Health and Wellbeing, University of Glasgow, Glasgow, United Kingdom
- 12Department of Clinical Infection, Microbiology and Immunology, Institute of Infection and Global Health, University of Liverpool, Liverpool, United Kingdom
Accurate and affordable point-of-care diagnostics for tuberculosis (TB) are needed. Host serum protein signatures have been derived for use in primary care settings, however validation of these in secondary care settings is lacking. We evaluated serum protein biomarkers discovered in primary care cohorts from Africa reapplied to patients from secondary care. In this nested case-control study, concentrations of 22 proteins were quantified in sera from 292 patients from Malawi and South Africa who presented predominantly to secondary care. Recruitment was based upon intention of local clinicians to test for TB. The case definition for TB was culture positivity for Mycobacterium tuberculosis; and for other diseases (OD) a confirmed alternative diagnosis. Equal numbers of TB and OD patients were selected. Within each group, there were equal numbers with and without HIV and from each site. Patients were split into training and test sets for biosignature discovery. A nine-protein signature to distinguish TB from OD was discovered comprising fibrinogen, alpha-2-macroglobulin, CRP, MMP-9, transthyretin, complement factor H, IFN-gamma, IP-10, and TNF-alpha. This signature had an area under the receiver operating characteristic curve in the training set of 90% (95% CI 86–95%), and, after adjusting the cut-off for increased sensitivity, a sensitivity and specificity in the test set of 92% (95% CI 80–98%) and 71% (95% CI 56–84%), respectively. The best single biomarker was complement factor H [area under the receiver operating characteristic curve 70% (95% CI 64–76%)]. Biosignatures consisting of host serum proteins may function as point-of-care screening tests for TB in African hospitals. Complement factor H is identified as a new biomarker for such signatures.
Tuberculosis (TB) remains a leading cause of death from any infection worldwide. The number of people accessing treatment is increasing each year, but in 2019 there were still an estimated 10 million cases and 1.4 million deaths (1). The region with the highest incidence and fatality rate is Africa, where the prevalence of HIV co-infection in some areas exceeds 50% (1).
The potential for rapid diagnosis of TB in African hospitals has been enhanced by the roll-out of the GeneXpert MTB/RIF test (Xpert, Cepheid, Sunnyvale, California, USA). Xpert is a sputum-based PCR assay with high sensitivity and specificity (2), but has several practical limitations. These include high cost, need for annual overseas calibration, laboratory containment facilities, and continuous electricity. In addition, as a laboratory-based assay, Xpert is not a true point-of-care (POC) test that can deliver a result within a single consultation.
An alternative to pathogen detection is quantification of host-derived biomarkers, such as serum proteins. Serum proteins are generally of higher abundance than pathogen products, are amenable to existing POC technologies such as lateral flow immunoassay (LFA), and have been shown to discriminate between different infections when combined as biosignatures (3–6). In 2016, a cohort study was published by the African European TB Consortium (AE-TBC) in which a seven-protein signature was reported that distinguished pulmonary TB from other respiratory diseases with an area under the receiver operating characteristic (ROC) curve of 91% (7). The study was conducted in primary care clinics across five countries in Africa. Participants presenting with symptoms requiring investigation for TB were recruited. The seven proteins were selected from a shortlist of 22 that had been discovered in pilot studies.
An accurate, cheap, user-friendly POC test for TB for use in secondary care hospital settings in sub-Saharan Africa would also be highly desirable. We therefore retested the signature and all 22 biomarkers from the AE-TBC study in cohorts from a case-control study that recruited adults presenting with features of TB to hospitals in Cape Town, South Africa, and Karonga, Malawi, and a TB clinic in Cape Town (the “ILULU-TB study”) (8). Equal numbers of patients were recruited with and without HIV to both TB and other diseases (OD) groups (8). Recruitment of TB patients at all sites was on the basis of culture positivity. All OD patients were recruited from hospitals. We therefore considered this cohort to be reflective of patients presenting to secondary care. We hypothesised that the seven-protein signature from the AE-TBC study, or a new signature derived from the same 22 proteins, would distinguish TB from OD in patients from the ILULU-TB study, regardless of HIV status, with a similar degree of accuracy as in the AE-TBC study.
ILULU-TB Patient Recruitment and Biobank Sampling
Between 2007 and 2010, 674 adults were recruited to the ILULU-TB study from Cape Town, South Africa, and Karonga District, Malawi. These sites have differing prevalences of ODs such as parasitic infection and differing environmental exposures (urban vs. rural). Details of recruitment have been described previously (8). Briefly, patients in the TB and OD groups were recruited consecutively and based on intention of the local clinician to test for TB. The criterion for inclusion in the TB group was at least one positive culture (sputum or tissue) for Mycobacterium tuberculosis (Mtb), which is the WHO gold standard (1). Laboratory identification of Mtb was confirmed by polymerase chain reaction (PCR). All of the TB patients that were enrolled had pulmonary TB. OD patients had an established alternative diagnosis, negative cultures for Mtb and an observed improvement of symptoms after follow-up without TB treatment. In Cape Town, TB patients were recruited from either an outpatient clinic (Khayelitsha site B) or hospital sites (Groote Schuur and GF Jooste), whereas OD patients were all recruited from the hospital sites. In Karonga, both TB and OD patients were recruited from Karonga District Hospital. As healthy controls, adults with latent TB infection (LTBI) were also recruited. LTBI status was defined by positive tuberculin skin tests and in-house interferon-gamma release assays in the absence of TB symptoms (9). Sera were collected from all participants at recruitment and stored at −80°C. All groups had HIV-1 status ascertained.
For the present study, sera from 438 individuals were selected from the ILULU-TB biobank using random number generation (Microsoft Excel 2013). Equal numbers were selected for each of the TB, OD, and LTBI groups. Within each group, equal numbers were selected with and without HIV, and from each of the two sites (Table 1). The primary aim was to distinguish TB from OD, regardless of HIV status or site. The selection process with regard to the TB and OD patients is illustrated in Figure 1. No sera from the AE-TBC study were re-analysed as part of this study.
Table 1. Demographic and clinical features for the 438 participants randomly selected from the ILULU-TB cohort for this study.
Figure 1. Selection process for inclusion of patients in the biosignature analyses. The flow diagram shows the process from original recruitment to the ILULU-TB study onwards. TB, tuberculosis; OD, other diseases; HIV−, HIV uninfected; HIV+, HIV infected.
Luminex assays were used as per the AE-TBC study for quantification of interleukin-1 receptor antagonist (IL-1RA), transforming growth factor alpha (TGF-alpha), interferon gamma (IFN-gamma), IFN-gamma-inducible protein 10 (IP-10), tumour necrosis factor alpha (TNF-alpha), IFN-alpha-2, vascular endothelial growth factor (VEGF), matrix metalloproteinase-2 (MMP-2), MMP-9, apolipoprotein A-I (apo-AI), apo-CIII, transthyretin, complement factor H (complement FH) (Merck Millipore, Billerica, Massachusetts, USA); and C-reactive protein (CRP), serum amyloid A (SAA), serum amyloid P (SAP), fibrinogen, ferritin, tissue plasminogen activator (tPA), procalcitonin (PCT), haptoglobin, and alpha-2-macroglobulin (alpha-2-M) (Bio-Rad Laboratories, Hercules, California, USA) (7). Patients were randomised across the series of assays. Sera were diluted as per manufacturers' instructions, except for MMP-2 and−9 which were diluted 1 in 100, and apo-AI, apo-CIII, transthyretin and complement FH which were diluted 1 in 30,000 following optimisation. Assays were performed in single wells with three patients run in duplicate on each plate to estimate intra-assay variability. Quality controls were run on each plate. Plates were read on Bio-Plex 200 instruments at Imperial College London with Bio-Plex Manager v6.1 software (Bio-Rad). Intra-assay variability, calculated as the mean of the coefficients of variance for each analyte individually across all plates, was <12% for all proteins. Results for quality controls fell within expected ranges. If results were below the lower limit of detection, they were assigned a value of zero. If above the upper limit, they were retested at a higher dilution.
For analyses of individual proteins, all patients with results for that protein were included. Protein concentrations were compared between the TB group and each of the OD and LTBI groups in turn using one-sided Mann-Whitney U-tests. The performance of each of the 22 proteins to distinguish TB from each of OD and healthy LTBI in turn by their serum concentration, regardless of HIV status or site, was assessed by the area under the respective ROC curve (ROC AUC). Analyses were performed using GraphPad Prism v7 (GraphPad Software, La Jolla, California, USA).
For the biosignature analyses, as shown in Figure 1, only those patients (i.e., TB and OD) for whom data was gathered for all 22 proteins were included (n = 249). This was because a finite number of kits were purchased at the outset, hence if serum from any patient had to be re-tested because a protein concentration was too high, the total number of patients with results for that protein was reduced. Healthy LTBI controls were omitted from these analyses. Patients were classified as TB if the model predicted the probability of TB was >0.5 (p > 0.5).
To retest the seven-protein signature from the AE-TBC study, data on the entire AE-TBC cohort were used for discovery (n = 701) and on this sample of the ILULU-TB cohort for validation (n = 249). The same method was used as for the AE-TBC signature [Generalised Discriminant Analyses (GDA)] using Statistica (Statsoft, Ohio, USA) (7).
For discovery of the optimal new signature, data on the ILULU-TB cohort alone was used. For consistency with the AE-TBC study, patients were randomly allocated to training and test sets at a ratio of 70:30, regardless of HIV status or study site. The same signature discovery methods were also used, namely GDA and Random Forest analyses of log-transformed values. In addition, we also performed variable selection using the Parallel Regularised Regression Model Search method (PReMS) on decile-normalised values using “R” v3.2.2 (R Foundation for Statistical Computing, Vienna, Austria). This is a logistic regression-based method designed to minimise the number of biomarkers selected (10). For each method, the same allocation of patients to training and test sets was used. Assuming the AE-TBC signature had the same accuracy in our data, we had 95% power to show a sensitivity of >90% and specificity of >66.5% with these new signatures.
For a screening test, albeit for community settings, the WHO recommend a minimum sensitivity of 90% (11). No criteria for a rule-in test are specified. After obtaining the best new signature from each method, we therefore re-tested them after adjusting the cut-off for diagnosis to increase each of the sensitivity and specificity in turn to 90%. This was to assess the performance of each signature as either a rule-out or rule-in test for TB. There were no indeterminate test results.
Ethical approval for this study was covered by the approvals for the ILULU-TB study: the Human Research Ethics Committee of the University of Cape Town, South Africa (HREC012/2007), the National Health Sciences Research Committee, Malawi (NHSRC/447), and the Ethics Committee of the London School of Hygiene and Tropical Medicine (5212).
Demographic and clinical features of individuals selected for this study are shown in Table 1. The range of diagnoses that comprised the OD group is shown in Table 2. Medians and interquartile ranges of proteins in each group are shown in Supplementary Table 1.
Table 2. Major clinical diagnoses in the Other Diseases groups of the sample of the ILULU-TB cohort that was selected for this study.
Performance of Biomarkers Individually
The best performing protein was complement factor H (FH). As shown in Table 3, this had a ROC AUC of 70% (95% confidence interval (CI): 64–76%). This performance was preserved across the sites (70% in Cape Town, 71% in Karonga) and HIV status (71% in HIV uninfected, 69% in HIV infected). ROC curves for these subdivisions are shown in Supplementary Figure 1. In addition, as shown in Figure 2, in comparison with the healthy LTBI control group, concentrations were higher in the TB group but trended toward being lower in the OD group (p = 0.072). This contrasted with the other 21 proteins, in which concentrations in the TB and OD groups differed from those in the LTBI group in the same direction.
Figure 2. Serum concentrations of the top four protein biomarkers (panels A–D) by clinical group. Scatter-dot plots show results for each patient in the ILULU-TB cohort, regardless of HIV status or site. P-values are 1-sided and derived from Mann-Whitney tests. Error bars represent medians and interquartile ranges. IP-10, IFN-gamma-inducible protein 10; IFN-gamma, interferon-gamma.
The concentrations of the top four individual biomarkers in each group are shown in Figure 2, and a display of all individual ROC AUCs is shown in Table 3. In comparison with the AE-TBC study, four proteins performed better in the ILULU-TB cohort (complement FH, SAP, haptoglobin, and alpha-2-M). The remaining 18 showed inferior performance, and the protein with the largest drop in performance was CRP, which was the best performing biomarker in the AE-TBC study and part of the seven-protein signature. Individual ROC AUCs in order of their difference compared to the AE-TBC study are shown in Supplementary Figure 2.
The performance of each protein was then stratified by HIV status. 16 proteins performed better in HIV uninfected patients: complement FH, IP-10, SAA, VEGF, haptoglobin, SAP, transthyretin, apo-CIII, ferritin, alpha-2-M, TGF-alpha, TNF-alpha, MMP-9, apo-AI, PCT, and CRP. Five proteins performed better in HIV co-infected patients: IFN-gamma, fibrinogen, IFN-alpha-2, MMP-2, and IL-1RA. Confidence intervals overlapped for every protein, however (Figure 3).
Figure 3. Individual ROC AUCs for each of HIV uninfected and infected halves of this sample of the ILULU-TB cohort. Green bars: distinction of TB, HIV– from OD, HIV–; yellow bars: TB, HIV+ from OD, HIV+.
Finally, while the main aim was to assess performance of proteins to distinguish TB from OD, we also examined their performance to distinguish TB from LTBI. The protein with the highest ROC AUC for this purpose was CRP (92%, Supplementary Table 2).
Performance of the AE-TBC Signature in the ILULU-TB Cohort
For the biosignature analyses, 122 TB patients and 127 OD patients for whom results were available for all 22 proteins were included. There was an equal distribution of patients across the clinical groups, sites, and HIV status (Figure 1).
The performance of the seven-protein signature from the AE-TBC study in the ILULU-TB cohort is shown in Table 4. With the cut-off for defining a positive test at the default setting (p > 0.5), the sensitivity was greater than in the AE-TBC study [98% (95% CI: 94–100%)], but specificity was markedly reduced [12% (7–19%)]. On comparison of biomarker concentrations between studies, there were significant differences in some proteins, especially apo-AI (Supplementary Figure 3). To understand this, we compared concentrations of apo-AI in our healthy LTBI controls with published normal concentrations. Concentrations in our LTBI group were 4-fold lower than those published (medians 324 vs. 1,180 ug/ml) (12). Concentrations of apo-CIII, however, which was part of the same multiplexed panel, matched those published (medians 114 vs. 114 ug/ml) (13). In addition, concentrations of apo-AI in the AE-TBC TB cohort were higher than published normal concentrations (2,000 vs. 1,180 ug/ml), even though apo-AI concentrations decrease in TB (7).
Performance of New Signatures Derived in the ILULU-TB Cohort
The numbers of patients in each subgroup of the ILULU-TB cohort that were randomised to each of the train and test sets are shown in Table 5. The same patients were used for this set of analyses as for the re-test of the AE-TBC signature (n = 249: 122 TB and 127 OD). The results of the best new signatures from each of the GDA, Random Forests and PReMS methods are shown in Tables 6–8. Results are shown with the cut-off at the default setting, and after increasing each of sensitivity and specificity in turn to 90%. Positive and negative predictive values (PPV and NPV) are also shown in each case, based on the equal sizes of the TB and OD groups in this study.
Table 5. Numbers of patients in the ILULU-TB cohort allocated to train and test sets, for novel biosignature discovery.
Table 6. Performance of a new five-protein signature derived from Generalised Discriminant Analyses (GDA).
Table 8. Performance of a nine-protein signature derived from Parallel Regularised Regression Model Search.
The GDA method yielded a five-protein signature comprising complement factor H, IP-10, CRP, SAA, and transthyretin. The ROC AUC in the training set was 84% (Table 6). Sensitivities and specificities in the test set were 81% and 63% initially, 79% and 41% after increasing sensitivity, and 58% and 89% after increasing specificity.
The results of the Random Forests analyses, using all 22 proteins, are shown in Table 7. Sensitivities and specificities in the test set were 73% and 71% initially, 92% and 58% after increasing sensitivity, and 95% and 43% after increasing specificity.
The PReMS method yielded a nine-protein signature comprising fibrinogen, alpha-2-M, CRP, MMP-9, transthyretin, complement FH, IFN-gamma, IP-10, and TNF-alpha. As shown in Table 8, this had a ROC AUC of 90% in the training set and 84% in the test set. Sensitivities and specificities in the test set were 86% and 74% initially, 92% and 71% after increasing sensitivity, and 75% and 81% after increasing specificity. At the cut-off for increased sensitivity, PPV and NPV in the test set were 75% and 90%, respectively. The performance in each of the HIV uninfected and co-infected halves of the test set in terms of ROC AUC was 84% for HIV uninfected patients (95% CI: 72–97%) and 86% for co-infected patients (95% CI: 71–100%), regardless of site. The ROC AUC at each of the two sites was 94% at Cape Town (95% CI: 86–100%) and 78% at Karonga (95% CI: 63–93%), regardless of HIV status. The difference between the ROC AUCs at the two sites was not significant by DeLong's test, however (p = 0.069) (14).
In the field of host serum proteomics-based TB diagnostics, this study stands out for several reasons. Firstly, it was conducted in Africa, where the burden of TB is highest, and included equal numbers of patients with and without HIV. This is important because the host response to TB may vary by ethnicity (15, 16), and is also distinct in the setting of HIV co-infection. Differences in concentrations of serum proteins between TB patients with and without HIV co-infection have not been extensively studied, although concentrations of neopterin and beta-2-microglobulin have both been found to be significantly higher in TB patients with HIV than without (17). This may reflect a state of “immune activation” in HIV-associated TB, which is well-recognised (18–20). Fundamentally, however, the pathogenesis of TB in HIV co-infection differs significantly, with impaired granuloma formation, less pulmonary cavitation and more dissemination (21–23). With the prevalence of HIV amongst patients presenting with active TB as high as 50% in some areas of Africa, and the TB case fatality rate in HIV co-infection being approximately twice that of HIV uninfected individuals (1), it is essential that any biosignature for use in such settings be derived from a representative population. The range of other diseases to be distinguished from TB is also strongly associated with both geographical location and HIV prevalence. Previous studies have derived promising serum protein signatures using techniques including mass spectrometry, but these were either not set in Africa or did not include or amalgamate sufficient numbers of HIV co-infected patients in both TB and OD (or control) groups (24–29). The two sites in Cape Town and Karonga were also selected in this study to represent the spread of epidemiological settings in Africa. Cape Town was selected to represent urban sites, and also had a low prevalence of malaria. Karonga was selected as a rural site, and had a high prevalence of malaria and other parasitic infections. The second major strength of this study was that patients were prospectively recruited from a point of differential diagnosis. An early study by Agranoff et al. included African sera, but the OD group comprised a selection of diseases whose clinical features “can overlap with” those of TB (26). This is less rigorous, since a population which is more homogenous clinically (such as ours) is likely to be more homogenous in their serum proteomes, and therefore a more challenging one from which to derive markers of host response that are TB-specific. Thirdly, our signatures were tested using immunocapture. Whilst not arising from an untargeted proteomic approach, this ensured that any such signatures were more easily translatable to lateral flow immunoassay. To our knowledge, none of the relevant mass spectrometry-based studies published in the literature performed full technical validation by immunocapture (24–28). Fourthly, the patients recruited to this study were largely hospital attendees, which is also a population currently under-represented in the literature. Two studies recruited hospital patients from sites including in Africa, but either did not include HIV co-infected patients in the discovery cohort, or had a low number of HIV infected patients in the OD group (25, 26). Several recent studies have employed immunocapture to discover new signatures including in patients from Africa, but recruitment was limited to primary care settings (29–33). Patients presenting to hospitals are likely to be more unwell than those presenting to primary care settings, and therefore to have a greater degree of disturbance to their serum proteome. The TB patients in Cape Town were recruited from a clinic, however all were culture positive as per the study design, and therefore likely to have had more advanced disease than cohorts that included clinical diagnoses. Severity of TB is known to affect the concentrations of serum proteins, including CRP, procalcitonin, and serum amyloid A, hence the importance of evaluating biomarker performance at this different level of the healthcare system (34, 35). Other strengths of the study design were that diagnoses were confirmed in all patients, and that healthy controls with LTBI infection were included for reference.
The design of this study was also well-suited to re-testing the biomarkers from the AE-TBC study. The countries in which recruitment took place were a subset of the countries in the AE-TBC study (Malawi and South Africa); the assays were performed using the same Luminex kits and analyser; and the same statistical methods were applied to the data, by the same statistician (7). To complement the signature discovery process, an additional method (PReMS) was also used.
Limitations included the fact that even though the recruitment process was open to extrapulmonary TB (EPTB) cases, no cases of culture positive TB without pulmonary involvement were included. In addition, none of the OD cases were documented as having non-tuberculous mycobacterial disease (NTM), which may more closely resemble TB in terms of host response (36). Secondly, this study was limited to the 22 proteins that had been selected by the AE-TBC based on previous performance in primary care settings. This was a strength in that the biomarkers had been through prior selection to diagnose the disease of interest (TB), but a weakness since they had not all previously been selected from presentations to secondary care. A third weakness was that, in terms of the comparison of biomarker performance between the ILULU-TB and AE-TBC studies, the study designs were different: ILULU-TB was case-control, with group sizes held equal, whereas AE-TBC was a cohort study. The group sizes in the latter therefore reflected local epidemiology, including with regards to HIV prevalence. Another difference was that our OD group comprised both pulmonary and non-pulmonary diseases, whereas AE-TBC focussed on lung diseases only. A final limitation was that our study did not include an external cohort in which to validate any new signatures.
Overall, the performance of the proteins individually was less good in the ILULU-TB cohort, except for complement FH, SAP, haptoglobin, and alpha-2-M. The results for complement FH were particularly promising in that diagnostic performance was sustained across site and HIV status. In addition, the fact that concentrations of complement FH in the TB and OD groups moved in a different direction from each other relative to the healthy controls implies that rising concentrations of complement FH may be TB-specific. Complement FH did distinguish TB from OD (or “no-PTB”) in the AE-TBC study, with higher concentrations in the TB group, but this difference was more pronounced in the ILULU-TB cohort. A possible reason for that might be that transcription of complement FH in vitro is driven by IFN-gamma (37, 38), which in turn has a central role in the host response in TB (39, 40). In keeping with this, IFN-gamma serum concentrations were higher in our TB group than OD group (Figure 2C). As discussed above, the TB cases in the ILULU-TB study were likely more advanced than those in the AE-TBC study, which may have driven serum FH concentrations up higher. An additional possibility is that FH concentrations were lower in our OD group, again due to more severe illnesses. Complement FH concentrations in serum/plasma have not been extensively studied in other infections, but are known to decrease in inflammatory conditions such as lupus nephritis and myaesthenia gravis as a result of excessive complement consumption (41, 42). Enhanced complement activation and consumption has also been shown to occur in HIV-infected patients with sepsis (43), and this may have been relevant for a proportion of our OD cohort. By contrast, the particularly poor performance of CRP in this study is interesting, since this was the best-performing biomarker individually in the AE-TBC study, with concentrations being higher in the TB group. Whilst concentrations trended toward being higher in the TB group in the ILULU-TB study, this difference was not statistically significant, and CRP did not function as a standalone biomarker. This was likely reflective, again, of the more severely ill state of the OD patients in the ILULU-TB cohort, rather than any reduction of levels in our TB cohort. This is supported by CRP being the top individual biomarker to distinguish TB from LTBI in our cohort, and also by previous observations that CRP performs significantly less well in hospital than in community settings (44, 45).
The application of the 7-protein signature from the AE-TBC study directly to the data from the ILULU-TB study was hampered by the fact that concentrations of some of the proteins differed significantly between the two studies. Data accuracy and precision within each of the two studies was good, which suggests that the commonly observed phenomenon of lot-lot variation between multiplexed kits was the main contributor (46). It is possible that the marked decrease in concentrations of apo-AI in our study represent over-correction of calibration by the manufacturer of previously high concentrations, such as were reported in the AE-TBC study.
The newly derived 5-protein GDA signature had a moderately high ROC AUC in the training set of 84% (78–90%). In the test set, however, sensitivity and specificity were less promising. The five proteins were a subset of the seven that comprised the AE-TBC signature, however, which validates them as being among the best biomarkers for TB diagnosis. The Random Forests method produced performances in the test set that were slightly greater, but this was with all 22 proteins included in the model, which is less feasible for translation to a POC test.
The best performing test emerged from the PReMS method in the form of a nine-protein signature comprising fibrinogen, alpha-2-M, CRP, MMP-9, transthyretin, complement factor H, IFN-gamma, IP-10, and TNF-alpha. The highest combined results came from optimising the sensitivity, which yielded 92% sensitivity and 71% specificity in the test set. This was comparable with the performance of the seven-protein signature in the AE-TBC study. It also exceeded the WHO minimum requirements for a “triage test” for TB, which is notable, even though that particular target was designed with community settings in mind (11). The potential benefit of a screening test in hospital settings is clear, since it would decrease the number of sputum-based investigations that would be needed, including by GeneXpert, as well as unnecessary courses of TB treatment. The performance of the signature was unaffected by HIV status, which is promising for use in African settings, and also contrasts with the performance of sputum smear microscopy, which is significantly less sensitive in HIV co-infected patients (47).
This study focussed on culture positive TB, in order to derive a signature based on confirmed cases. Future validation studies, however, should include culture negative pulmonary TB cases, as well as EPTB, and OD groups including NTM disease. In addition, translation to POC will depend on the availability of LFA platforms for measuring multiple proteins. LFAs have been shown to be feasible for use in sub-Saharan African settings and accurate across four orders of magnitude, without the need for a cold chain for distribution or storage (6). Multiplexing technology is also emerging for LFAs, with multiple proteins either being detected in series, along one strip (48, 49), or in parallel, with multiple strips contained within one handheld device (50).
In summary, we retested the performance of 22 host serum protein biomarkers of TB that had originally been selected from primary care studies in Africa in a large sample from a well-characterised cohort recruited largely from hospitals. The top-performing single biomarker was complement factor H, which is a novel marker of TB in this setting. A nine-protein biosignature was discovered which showed promise for use as a POC screening test in hospital settings, and performed equally well in individuals co-infected with HIV. Translation to this will depend on validation in independent cohorts and on development of accurate POC platforms.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
The studies involving human participants were reviewed and ethical approval for this study was covered by the approvals for the ILULU-TB study: the Human Research Ethics Committee of the University of Cape Town, South Africa (HREC012/2007), the National Health Sciences Research Committee, Malawi (NHSRC/447), and the Ethics Committee of the London School of Hygiene and Tropical Medicine (5212). The patients/participants provided their written informed consent to participate in this study.
HD, ML, RW, and MH: conceived and designed the experiments. TM: performed the experiments. TM, CH, and MK: analysed the data. TM, CH, NC, MK, NF, GW, ML, RW, and MH: provided input into data interpretation. TM, CH, NC, RW, and MH: contributed to writing the first version of the manuscript. NC, TO, KW, HD, AC, NF, RW, and MH: contributed to revisions of the manuscript. TO, RG, LS, LB, ML, and RW: enrolled patients used in this study. TO, RG, KW, LS, and AC: data collection on patients used in this study. All authors contributed to the article and approved the submitted version.
This study was funded by the James Maxwell Grant Prophit Fellowship 2016-17 (Royal College of Physicians, London) to TM. RW was funded by Wellcome (104803 and 203135); The Francis Crick Institute which is funded by UKRI-MRC (FC0010218), CRUK (FC0010218) and Wellcome (FC0010218); National Institutes of Health (AI115940); Foundation for National Institutes of Health (WILK116PTB); and EDCTP2 (SRIA2015-1065). The authors also wish to acknowledge the funding source of the ILULU-TB project which was EU Action for Diseases of Poverty program grant (Sante/2006/105-061).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors wish to thank all members of the ILULU Consortium: Institute of Infectious Diseases and Molecular Medicine, University of Cape Town Nonzwakazi Bangani, Lizl Bashe, Melina Carr, Hannah P. Gideon, RG, Yekiwe Hlombe, Vanessa January, Bekekile Kwaza, Suzaan Marais, Marc Mendelson, TO, Fadheela Patel, Ronnett Seldon, Relebohile Tsekela, KW, RW, Kathryn Wood; London School of Hygiene & Tropical Medicine/Karonga Prevention Study Lyn Ambrose, AC, HD, NF, Lumbani Munthali, Bagrey Ngwira, Amos Phiri, Femia Zgambo; Red Cross War Memorial Children's Hospital, University of Cape Town Margaret Cooper, Brian Eley, Mabel Gcuwa, Spasina King, Glynis Kossew, Karen McCabe, Wonita Petersen, Sandra Pienaar, Vashini Pillay; Liverpool School of Tropical Medicine/Malawi-Liverpool-Wellcome Trust Clinical Research Programme, University of Malawi College of Medicine Benjamin Allubha, George Chagaluka, Angeziwa Chunga, Janet Dube, Robert S. Heyderman, Annie Joabe, Martha Kalemba, Anne Kerr, Monica Matola, Rachel Mlotha, Agnes Mwale, David Mzinza; Brighton and Sussex Medical School, University of Sussex Suzanne T. Anderson, Gillian Baker, Claire M. Banwell, Terry Bishop, Natalie Chaplin, Julian Golland, Florian Kern, Susan Poore, Jayne Wellington; Imperial College London Andrew J. Brent, Lachlan J. Coin, Hariklia Eleftherohorinou, Shea Hamilton, Myrsini Kaforou, Paul R. Langford, ML, Stephanie Menikou, Victoria J. Wright.
The authors also wish to acknowledge all members of the AE-TBC: Stellenbosch University, South Africa: GW, NC, Magdalena Kriel, Gian van der Spuy, Andre G Loxton, Kim Stanley, Stephanus Malherbe, Belinda Kriel, Leigh A Kotzé, Dolapo O Awoniyi, Elizna Maasdorp. MRC Unit, The Gambia: Jayne S Sutherland, Olumuyiwa Owolabi, Abdou Sillah, Joseph Mendy, Awa Gindeh, Simon Donkor, Toyin Togun, Martin Ota. Karonga Prevention Study, Malawi: AC, Felanji Simukonda, Alemayehu Amberbir, Femia Chilongo, Rein Houben. Ethiopian Health and Nutrition Research Institute, Ethiopia: Desta Kassa, Atsbeha Gebrezgeabher, Getnet Mesfin, Yohannes Belay, Gebremedhin Gebremichael, Yodit Alemayehu. University of Namibia, Namibia: Marieta van der Vyver, Faustina N Amutenya, Josefina N Nelongo, Lidia Monye, Jacob A Sheehama, Scholastica Iipinge. Makerere University, Uganda: Harriet Mayanja-Kizza, Ann Ritah Namuganga, Grace Muzanye, Mary Nsereko, Pierre Peters. Armauer Hansen Research Institute, Ethiopia: Rawleigh Howe, Adane Mihret, Yonas Bekele, Bamlak Tessema, Lawrence Yamuah. Leiden University Medical Centre, The Netherlands: Tom HM Ottenhoff, Annemieke Geluk, Kees LMC Franken, Paul LAM Corstjens, Elisa M Tjon Kon Fat, Claudia J de Dood, Jolien J van der Ploeg-van Schip. Statens Serum Institut, Copenhagen, Denmark: Ida Rosenkrands, Claus Aagaard. Max Planck Institute for Infection Biology, Berlin, Germany: Stefan HE Kaufmann, Maria M. Esterhuyse. London School of Hygiene and Tropical Medicine, London, UK: Jacqueline M Cliff, Hazel M Dockrell.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2021.639174/full#supplementary-material
2. Steingart KR, Schiller I, Horne DJ, Pai M, Boehme CC, Dendukuri N, et al. Xpert® MTB/RIF assay for pulmonary tuberculosis and rifampicin resistance in adults. Cochrane Database Syst Rev. (2014) 1:CD009593. doi: 10.1002/14651858.CD009593.pub3
3. Papadopoulos MC, Abel PM, Agranoff D, Stich A, Tarelli E, Bell AB, et al. A novel and accurate diagnostic test for human African trypanosomiasis. Lancet. (2004) 363:1358–63. doi: 10.1016/S0140-6736(04)16046-7
4. van Houten CB, de Groot JAH, Klein A, Srugo I, Chistyakov I, de Waal W, et al. A host-protein based assay to differentiate between bacterial and viral infections in preschool children (OPPORTUNITY): a double-blind, multicentre, validation study. Lancet Infect Dis. (2017) 17:431–40. doi: 10.1016/S1473-3099(16)30519-9
5. Srugo I, Klein A, Stein M, Golan-Shany O, Kerem N, Chistyakov I, et al. Validation of a novel assay to distinguish bacterial and viral infections. Pediatrics. (2017) 140:e20163453. doi: 10.1542/peds.2016-3453
6. Corstjens PL, Tjon Kon Fat EM, de Dood CJ, van der Ploeg-van Schip JJ, Franken KLMC, Chegou NN, et al. Multi-center evaluation of a user-friendly lateral flow assay to determine IP-10 and CCL4 levels in blood of TB and non-TB cases in Africa. Clin Biochem. (2016) 49:22–31. doi: 10.1016/j.clinbiochem.2015.08.013
7. Chegou NN, Sutherland JS, Malherbe S, Crampin AC, Corstjens PLAM, Geluk A, et al. Diagnostic performance of a seven-marker serum protein biosignature for the diagnosis of active TB disease in African primary healthcare clinic attendees with signs and symptoms suggestive of TB. Thorax. (2016) 71:785–94. doi: 10.1136/thoraxjnl-2015-207999
8. Kaforou M, Wright VJ, Oni T, French N, Anderson ST, Bangani N, et al. Detection of tuberculosis in HIV-infected and -uninfected African adults using whole blood RNA expression signatures: a case-control study. PLoS Med. (2013) 10:e1001538. doi: 10.1371/journal.pmed.1001538
9. Schölvinck E, Wilkinson KA, Whelan AO, Martineau AR, Levin M, Wilkinson RJ. Gamma interferon-based immunodiagnosis of tuberculosis: comparison between whole-blood and enzyme-linked immunospot methods. J Clin Microbiol. (2004) 42:829–31. doi: 10.1128/JCM.42.2.829-831.2004
12. McQueen MJ, Hawken S, Wang X, Ounpuu S, Sniderman A, Probstfield J, et al. Lipids, lipoproteins, and apolipoproteins as risk markers of myocardial infarction in 52 countries (the INTERHEART study): a case-control study. Lancet. (2008) 372:224–33. doi: 10.1016/S0140-6736(08)61076-4
13. Aroner SA, Yang M, Li J, Furtado JD, Sacks FM, Tjønneland A, et al. Apolipoprotein C-III and high-density lipoprotein subspecies defined by apolipoprotein C-III in relation to diabetes risk. Am J Epidemiol. (2017) 186:736–44. doi: 10.1093/aje/kwx143
14. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. (1988) 44:837–45. doi: 10.2307/2531595
15. Coussens AK, Wilkinson RJ, Nikolayevskyy V, Elkington PT, Hanifa Y, Islam K, et al. Ethnic variation in inflammatory profile in tuberculosis. PLoS Pathog. (2013) 9:e1003468. doi: 10.1371/journal.ppat.1003468
16. Oliveira-de-Souza D, Vinhaes CL, Arriaga MB, Kumar NP, Cubillos-Angulo JM, Shi R, et al. Molecular degree of perturbation of plasma inflammatory markers associated with tuberculosis reveals distinct disease profiles between Indian and Chinese populations. Sci Rep. (2019) 9:8002. doi: 10.1038/s41598-019-44513-8
17. Skogmar S, Schön T, Balcha TT, Sturegård E, Jansson M, Björkman P. Plasma levels of neopterin and C-Reactive Protein (CRP) in Tuberculosis (TB) with and without HIV coinfection in relation to CD4 cell count. PLoS ONE. (2015) 10:e0144292. doi: 10.1371/journal.pone.0144292
18. Vanham G, Edmonds K, Qing L, Hom D, Toossi Z, Jones B, et al. Generalized immune activation in pulmonary tuberculosis: co-activation with HIV infection. Clin Exp Immunol. (1996) 103:30–4. doi: 10.1046/j.1365-2249.1996.907600.x
19. Toossi Z, Funderburg NT, Sirdeshmuk S, Whalen CC, Nanteza MW, Johnson DF, et al. Systemic immune activation and microbial translocation in dual HIV/tuberculosis-infected subjects. J Infect Dis. (2013) 207:1841–9. doi: 10.1093/infdis/jit092
20. Meng Q, Sayin I, Canaday DH, Mayanja-Kizza H, Baseke J, Toossi Z. Immune activation at sites of HIV/TB Co-infection contributes to the pathogenesis of HIV-1 disease. PLoS ONE. (2016) 11:e0166954. doi: 10.1371/journal.pone.0166954
21. Gilks CF, Brindle RJ, Otieno LS, Bhatt SM, Newnham RS, Simani PM, et al. Extrapulmonary and disseminated tuberculosis in HIV-1-seropositive patients presenting to the acute medical services in Nairobi. AIDS. (1990) 4:981–5. doi: 10.1097/00002030-199010000-00006
23. Walker NF, Clark SO, Oni T, Andreu N, Tezera L, Singh S, et al. Doxycycline and HIV infection suppress tuberculosis-induced matrix metalloproteinases. Am J Respir Crit Care Med. (2012) 185:989–97. doi: 10.1164/rccm.201110-1769OC
24. Sandhu G, Battaglia F, Ely BK, Athanasakis D, Montoya R, Valencia T, et al. Discriminating active from latent tuberculosis in patients presenting to community clinics. PLoS ONE. (2012) 7:e38080. doi: 10.1371/journal.pone.0038080
25. Achkar JM, Cortes L, Croteau P, Yanofsky C, Mentinova M, Rajotte I, et al. Host protein biomarkers identify active tuberculosis in HIV uninfected and co-infected individuals. EBioMedicine. (2015) 2:1160–8. doi: 10.1016/j.ebiom.2015.07.039
26. Agranoff D, Fernandez-Reyes D, Papadopoulos MC, Rojas SA, Herbster M, Loosemore A, et al. Identification of diagnostic markers for tuberculosis by proteomic fingerprinting of serum. Lancet. (2006) 368:1012–21. doi: 10.1016/S0140-6736(06)69342-2
27. Xu D, Li Y, Li X, Wei LL, Pan Z, Jiang TT, et al. Serum protein S100A9, SOD3, and MMP9 as new diagnostic biomarkers for pulmonary tuberculosis by iTRAQ-coupled two-dimensional LC-MS/MS. Proteomics. (2015) 15:58–67. doi: 10.1002/pmic.201400366
28. Liu J, Jiang T, Wei L, Yang X, Wang C, Zhang X, et al. The discovery and identification of a candidate proteomic biomarker of active tuberculosis. BMC Infect Dis. (2013) 13:506. doi: 10.1186/1471-2334-13-506
29. Ahmad R, Xie L, Pyle M, Suarez MF, Broger T, Steinberg D, et al. A rapid triage test for active pulmonary tuberculosis in adult patients with persistent cough. Sci Transl Med. (2019) 11:eaaw8287. doi: 10.1126/scitranslmed.aaw8287
30. De Groote MA, Sterling DG, Hraha T, Russell TM, Green LS, Wall K, et al. Discovery and validation of a six-marker serum protein signature for the diagnosis of active pulmonary tuberculosis. J Clin Microbiol. (2017) 55:3057–71. doi: 10.1128/JCM.00467-17
31. Chegou NN, Sutherland JS, Namuganga AR, Corstjens PL, Geluk A, Gebremichael G, et al. Africa-wide evaluation of host biomarkers in QuantiFERON supernatants for the diagnosis of pulmonary tuberculosis. Sci Rep. (2018) 8:2675. doi: 10.1038/s41598-018-20855-7
32. Manngo PM, Gutschmidt A, Snyders CI, Mutavhatsindi H, Manyelo CM, Makhoba NS, et al. Prospective evaluation of host biomarkers other than interferon gamma in QuantiFERON Plus supernatants as candidates for the diagnosis of tuberculosis in symptomatic individuals. J Infect. (2019) 79:228–35. doi: 10.1016/j.jinf.2019.07.007
33. Jacobs R, Malherbe S, Loxton AG, Stanley K, van der Spuy G, Walzl G, et al. Identification of novel host biomarkers in plasma as candidates for the immunodiagnosis of tuberculosis disease and monitoring of tuberculosis treatment response. Oncotarget. (2016) 7:57581–92. doi: 10.18632/oncotarget.11420
34. Sigal GB, Segal MR, Mathew A, Jarlsberg L, Wang M, Barbero S, et al. Biomarkers of tuberculosis severity and treatment effect: a directed screen of 70 host markers in a randomized clinical trial. EBioMedicine. (2017) 25:112–21. doi: 10.1016/j.ebiom.2017.10.018
35. Liu Q, Chen X, Hu C, Zhang R, Yue J, Wu G, et al. Serum protein profiling of smear-positive and smear-negative pulmonary tuberculosis using SELDI-TOF mass spectrometry. Lung. (2010) 188:15–23. doi: 10.1007/s00408-009-9199-6
36. Teklu T, Wondale B, Taye B, Hailemariam M, Bekele S, Tamirat M, et al. Differences in plasma proteomes for active tuberculosis, latent tuberculosis and non-tuberculosis mycobacterial lung disease patients with and without ESAT-6/CFP10 stimulation. Proteome Sci. (2020) 18:10. doi: 10.1186/s12953-020-00165-5
37. Brooimans RA, van der Ark AA, Buurman WA, van Es LA, Daha MR. Differential regulation of complement factor H and C3 production in human umbilical vein endothelial cells by IFN-gamma and IL-1. J Immunol. (1990) 144:3835–40.
38. van den Dobbelsteen ME, Verhasselt V, Kaashoek JG, Timmerman JJ, Schroeijers WE, Verweij CL, et al. Regulation of C3 and factor H synthesis of human glomerular mesangial cells by IL-1 and interferon-gamma. Clin Exp Immunol. (1994) 95:173–80. doi: 10.1111/j.1365-2249.1994.tb06033.x
40. Chackerian A, Perera T, Behar S. Gamma interferon-producing CD4+ T lymphocytes in the lung correlate with resistance to infection with Mycobacterium tuberculosis. Infect Immun. (2001) 69:2666–74. doi: 10.1128/IAI.69.4.2666-2674.2001
41. Wang FM, Yu F, Tan Y, Song D, Zhao MH. Serum complement factor H is associated with clinical and pathological activities of patients with lupus nephritis. Rheumatology. (2012) 51:2269–77. doi: 10.1093/rheumatology/kes218
42. Romi F, Kristoffersen EK, Aarli JA, Gilhus NE. The role of complement in myasthenia gravis: serological evidence of complement consumption in vivo. J Neuroimmunol. (2005) 158:191–4. doi: 10.1016/j.jneuroim.2004.08.002
43. Huson MA, Wouters D, van Mierlo G, Grobusch MP, Zeerleder SS, van der Poll T. HIV coinfection enhances complement activation during sepsis. J Infect Dis. (2015) 212:474–83. doi: 10.1093/infdis/jiv074
44. Santos VS, Goletti D, Kontogianni K, Adams ER, Molina-Moya B, Dominguez J, et al. Acute phase proteins and IP-10 as triage tests for the diagnosis of tuberculosis: systematic review and meta-analysis. Clin Microbiol Infect. (2019) 25:169–77. doi: 10.1016/j.cmi.2018.07.017
45. Yoon C, Semitala FC, Atuhumuza E, Katende J, Mwebe S, Asege L, et al. Point-of-care C-reactive protein-based tuberculosis screening for people living with HIV: a diagnostic accuracy study. Lancet Infect Dis. (2017) 17:1285–92. doi: 10.1016/S1473-3099(17)30488-7
47. Boehme CC, Nicol MP, Nabeta P, Michael JS, Gotuzzo E, Tahirli R, et al. Feasibility, diagnostic accuracy, and effectiveness of decentralised use of the Xpert MTB/RIF test for diagnosis of tuberculosis and multidrug resistance: a multicentre implementation study. Lancet. (2011) 377:1495–505. doi: 10.1016/S0140-6736(11)60438-8
48. Bartosh AV, Sotnikov DV, Hendrickson OD, Zherdev AV, Dzantiev BB. Design of multiplex lateral flow tests: a case study for simultaneous detection of three antibiotics. Biosensors. (2020) 10:17. doi: 10.3390/bios10030017
49. van Hooij A, van den Eeden S, Richardus R, Tjon Kon Fat E, Wilson L, Franken KLMC, et al. Application of new host biomarker profiles in quantitative point-of-care tests facilitates leprosy diagnosis in the field. EBioMedicine. (2019) 47:301–8. doi: 10.1016/j.ebiom.2019.08.009
Keywords: serum, protein, biomarker, tuberculosis, diagnosis, HIV, Africa
Citation: Morris TC, Hoggart CJ, Chegou NN, Kidd M, Oni T, Goliath R, Wilkinson KA, Dockrell HM, Sichali L, Banda L, Crampin AC, French N, Walzl G, Levin M, Wilkinson RJ and Hamilton MS (2021) Evaluation of Host Serum Protein Biomarkers of Tuberculosis in sub-Saharan Africa. Front. Immunol. 12:639174. doi: 10.3389/fimmu.2021.639174
Received: 08 December 2020; Accepted: 27 January 2021;
Published: 25 February 2021.
Edited by:Buka Samten, University of Texas at Tyler, United States
Reviewed by:Zissis Chroneos, Pennsylvania State University, United States
Roberta Olmo Pinheiro, Oswaldo Cruz Foundation, Brazil
Edward Chan, Rocky Mountain Regional VA Medical Center, United States
Copyright © 2021 Morris, Hoggart, Chegou, Kidd, Oni, Goliath, Wilkinson, Dockrell, Sichali, Banda, Crampin, French, Walzl, Levin, Wilkinson and Hamilton. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.