Previous Article | Next Article ![]()
Clinical and Vaccine Immunology, March 2008, p. 425-432, Vol. 15, No. 3
1071-412X/08/$08.00+0 doi:10.1128/CVI.00398-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Division of Infectious Diseases and Geographic Medicine, Stanford University School of Medicine, Stanford, California,1 Santa Clara Valley Health and Hospitals System, San Jose, California2
Received 3 September 2007/ Returned for modification 7 November 2007/ Accepted 6 January 2008
|
|
|---|
= 0.48 [± 0.11]). At 3 months, 3/48 (6.3%; 95% confidence interval [95CI], 2 to 17) of those initially QFT-G-IT negative converted, and 5/15 (33%; 95CI, 15 to 58) of those initially QFT-G-IT positive reverted. Among the 8 individuals with inconsistent QFT-G-IT results, the maximum gamma interferon response at either visit was 0.68 IU/ml versus means of 4.99 (± 3.74) and 6.95 (± 5.6) for 10 persistent positives at the first and second visits, respectively. Expected false-reversion and -conversion rates were 32% (90CI, 25 to 39%) and 6.95% (90CI, 4.6 to 9.8%) when the sensitivity and specificity were assumed to average 70% and 98%, respectively. Transient responses to QFT-G-IT are common, and low positive results need to be interpreted with caution. Further studies are needed to characterize the predictive value of the test for U.S. foreign-born and other targeted screening populations. |
|
|---|
After more than a century, the development of T-cell-based gamma interferon (IFN-
) release assays (IGRAs) for the detection of latent TB infection represents a significant advancement in TB diagnostics (36). Advantages of these in vitro blood tests include direct and objective measurement of cell-mediated immune response and the need for only one patient visit. Newer generations of the assays incorporate highly specific M. tuberculosis targets, such as the ESAT-6 (early secretory antigen 6) and CFP-10 (culture filtrate protein 10) complex from region of difference 1 (RD1) of the M. tuberculosis genome (3). Because the RD1 region is absent from most NTM as well as from strains used in BCG vaccines, the tests are considered highly specific for infection with M. tuberculosis (1, 42). One IGRA is now commercially available in the United States: QuantiFERON-TB Gold (QFT-G; Cellestis Ltd., Carnegie, Victoria, Australia), approved by the U.S. Food and Drug Administration in 2005 (26). A recent meta-analysis estimated the pooled average specificity and sensitivity of this assay to be 97.7% and 67%, respectively (29). A simplified QFT-G blood collection system (the QFT-G in-tube assay [QFT-G-IT]) was approved in October 2007 by the U.S. FDA; another IGRA format, the enzyme-linked immunospot (ELISPOT) assay (T-SPOT.TB; Oxford Immunotec, Oxon, United Kingdom) available in Europe, is currently undergoing U.S. FDA review. The Centers for Disease Control and Prevention has recommended use of the QFT-G in "all circumstances" where the TST is used (26).
Despite broad enthusiasm for the tests, their role in screening programs remains to be defined (29). The biological basis for discordance with TST in the assessment of high-risk populations is not understood (21, 34). Studies with children (9) and immune-compromised hosts (5) are also still needed. Although adoption of the QFT-G is being actively investigated by U.S. public health departments (12), only one cross-sectional study has been reported from the United States (39). There are no U.S. studies, and only a handful internationally (16, 33), examining reproducibility for the intended screening populations. In the United States, serial testing studies are needed to characterize utility in screening foreign-born and other targeted high-risk groups, for whom repeat testing is often conducted either intentionally (to assess boosting) or incidentally (immigrants may see a variety of health care providers who each perform a TB test). In the absence of a true "gold standard" for latent TB infection, serial testing studies can also be useful for evaluating discordance with TST (31).
As part of a prospective study of infectious disease transmission in immigrant households in northern California, we tested household members on two occasions 3 months apart in order to evaluate the reproducibility of the QFT-G-IT.
|
|
|---|
Subjects who consented completed a structured interview, which was followed by whole-blood collection and TST placement. Individuals of <2 years of age, those reporting histories of active TB, immune-modulating conditions (human immunodeficiency virus infection, malignancy, chemotherapy, or pregnancy) or adverse reaction to TST, or those who had received a TST within the previous 3 months were excluded. Within 48 to 72 h, a return home visit was made to read the TST. Approximately 3 months later, a second home visit was scheduled at which blood collection followed by TST testing and a TST reading visit within 48 to 72 h were repeated. Subjects reporting a prior positive TST at the first visit, or who tested positive at the first visit, were offered a choice of repeating the TST; however, all subjects were asked to complete follow-up blood collection for the QFT-G-IT.
The study was approved by the Institutional Review Boards of Stanford University and the Santa Clara Valley Medical System.
Risk factor assessment. A structured interview was administered to collect demographic information, including country of birth, travel to countries of endemicity, and time in the United States. In addition, information was collected on history of BCG vaccination (including visual inspection for scar), TB exposure, social and medical risk factors for TB, prior diagnosis or treatment of TB, and current gastrointestinal or pulmonary symptoms.
TST.
Tuberculin PPD (0.1 ml; Tubersol; Aventis) was applied intradermally at the inner volar surface of the forearm under supervision of a licensed health care professional as required by California law. The size of the induration was recorded within 48 to 72 h, measured across the wide diameter. Per local public health authority guidelines (41), an induration of
10 mm was classified as positive. A TST conversion was defined as an increase in induration of at least 6 mm, accompanied by a qualitative change from negative (<10 mm) to positive (
10 mm) (27). Participants received a written notice of their TST results, and those with a conversion were offered assistance with referral to a primary health care provider.
QFT-G-IT collection.
Prior to placement of the TST, 1 ml whole blood was drawn into one (each) of three manufacturer-precoated heparinized tubes containing mitogen (phytohemagglutinin A), TB peptide antigens ESAT-6/CFP-10/TB7.7, and a negative control (phosphate-buffered saline with 0.1% thimerosal), respectively. Tubes were transported to the research laboratory within 6 h and incubated at 37°C for 16 to 24 h. Harvested plasma was stored at 2 to 8°C for up to 3 days or frozen at –20°C for up to 8 weeks pending enzyme-linked immunosorbent assay (ELISA) and at –80°C for periods longer than 2 months. ELISA was used to detect levels of IFN-
in response to TB antigens according to manufacturer's instructions and quality control criteria. Absorbance values were converted to international units of concentration using manufacturer's standard curve methodology. Results were interpreted according to manufacturer's guidelines: an IFN concentration of
0.35 IU/ml over the nil concentration was classified as positive. Nil concentrations of
8.0 IU/ml and mitogen differences of <0.5 IU/ml were considered indeterminate. Values greater than 10 IU/ml were assigned a value of 10 IU, the upper limit of the standard curve. All tests were conducted by the same operator, who was blinded to subject characteristics. Conversions and reversions were defined qualitatively as a change from negative (<0.35 IU/ml above background) to positive (
0.35 IU/ml above background) and as a change from positive to negative between visits, respectively.
Analysis.
The McNemar test and the
coefficient were used to characterize baseline discordance with TST and the reproducibility of QFT-G-IT, respectively. QFT-G-IT conversion and reversion rates and 95% confidence intervals (95CI) were computed. We also evaluated variation in absolute IFN response levels, using Student's t test for group comparisons. Fisher's exact test was used to compare categorical outcomes between groups. Statistical analysis was conducted using SAS 9.3.
Monte Carlo simulation (CrystalBall; Decisioneering, Denver, CO) was used to predict apparent conversion and reversion rates based on assumptions regarding accuracy of testing (38). A base model was constructed using observed test results for QFT at the baseline visit. After one additional testing cycle, the expected rate of conversion in initial negatives can be predicted as follows: Expected conversion % = (1 – PV0–)(Se) + PV0–(1 – Sp), where PV0– and PV0+ the predictive value negative and positive of the baseline test and Se and Sp are the constant sensitivity and specificity of the test, respectively.
Similarly, the expected reversion rate in initial positives can be predicted as follows: Expected reversion % = PV0+ (1 – Se) + (1 – PV0+) (Sp).
Prevalence, sensitivity, and specificity were modeled with beta distributions (0,1) using credible estimates from a recent meta-analysis (29). The analysis assumes that repeated test results are independent (conditional only on the true infection state) and that the "true" rates of conversion and reversion are zero over 3 months (18). A three-way sensitivity analysis, varying sensitivity and specificity over the apparent prevalence of infection, was used to evaluate uncertainty in the model's assumptions. Each Monte Carlo simulation was seeded with the baseline data and sampled for 1,000 trials, using a fixed random number seed.
|
|
|---|
Within the 39 participating households, 88 SIFT study participants over the age of 2 were enrolled. Of these, 87 had baseline QFT-G-IT results (1 individual had an indeterminate QFT-G-IT result). Of 87 subjects with baseline results, 65 (75%) completed blood collection at the 3-month follow-up visit, and 22 (25%) were not available. Two baseline-TST-negative subjects did not complete the follow-up TST. Thus, a total of 63 subjects (49 adults and 14 children ranging in age from 7 to 17 years) were included in the serial testing analysis, of whom 49 were concurrently skin tested at both visits.
Nearly 80% (50/63) of study subjects were born in a high-prevalence country (98% from Latin America), 37% of whom had resided in the United States for less than 5 years; 15 (24%) reported history of exposure to an active TB case (median, 3 years before baseline visit). Over two-thirds (68%) reported BCG vaccination, of whom 31 (72%) were confirmed by observation of scar and 47 (75%) reported having been skin tested before (median, 2.8 years before baseline visit), including 14 (21%) who reported a prior positive TST result. The 63 subjects completing both visits did not differ in characteristics from those who were not retested (Table 1), and there was no difference in the proportions of baseline QFT-positive and QFT-negative subjects who did not complete the second blood collection (25% versus 25%, respectively).
|
View this table: [in a new window] |
TABLE 1. Characteristics of subjects participating at baseline and 3-month follow-up visits
|
Agreement between QFT-G-IT and TST results at the baseline visit was moderate (
= 0.48 [±0.11]). Overall, 14 (22%) subjects had discordant results (McNemar P = 0.03), of whom 11 (79%) were TST+/QFT– and 3 (21%) were QFT+/TST– (Table 2). All 14 discordances occurred for subjects reporting histories of BCG vaccination; results from 20 subjects reporting no history of vaccination were 100% concordant. Among those reporting BCG vaccination, agreement was 0.35 (± 0.13) by BCG recall and 0.39 (± 0.14) by observation of scar. Of the 23 subjects considered TST positive at baseline, 4/8 (50%) of those not retested were discordant versus 7/15 (47%) of those testing positive at the baseline visit.
|
View this table: [in a new window] |
TABLE 2. Baseline agreement of TST and QFT-G-IT
|
= 0.50 ± 0.13). Seven of 15 (47%) with TST indurations of
10 mm had QFT values of <0.35 IU/ml (range, –0.07 to 0.30 IU/ml), including 3/9 (33%) of those with indurations of
15 mm.
Three-month visit.
At the second visit 3 months later, 8 (13%) of 63 subjects had different QFT-G-IT results, including 5/15 (33%; 95CI, 15 to 58%) initially positive subjects who reverted and 3/48 (6.3%; 95CI, 2.2 to 16.8%) initially negative subjects who converted (
= 0.63 ± 0.12). There was a nonsignificant trend for subjects with inconsistent QFT-G-IT results to have been discordant by TST at baseline (21% versus 10%; overall Fisher's P = 0.36). Of 12 subjects whose QFT-G-IT and TST results were concordantly positive, 3 (25%) had reversions of QFT-G-IT, while there were two reversions (66%) among the 3 QFT positives who initially had a negative TST. Among 37 subjects whose QFT-G-IT and TST results were concordantly negative, 2 (5%) had QFT-G-IT conversions, compared to 1 (9%) among the 11 QFT-G negatives who initially had a positive TST.
For 49 individuals concurrently skin tested at both visits (Table 3), including 7 of the 8 with different QFT-G-IT results, QFT-G-IT reproducibility was lower (
= 0.38 ± 0.19), and 11/49 (22%) QFT-G-IT2 results were discordant with TST2 results (McNemar P = 0.03). Overall, the two tests agreed on none of seven possible conversions (two QFT-G-IT and five TST) and on one of seven possible reversions (five QFT-G-IT and two TST).
|
View this table: [in a new window] |
TABLE 3. Prospective agreement of QFT-G-IT and TST for 49 concurrently skin-tested subjects
|
![]() View larger version (11K): [in a new window] |
FIG. 1. IFN- responses to QFT-G-IT by study visit. Mean IFN- responses at two study visits 3 months apart for subjects who were persistently negative (Pers. Neg.) or persistently positive (Pers. Pos.) and who converted (Convert) or reverted (Revert) according to QFT-G-IT. 0.35 UI/ml, the manufacturer's recommended positive cutoff, is marked with a dotted line.
|
|
View this table: [in a new window] |
TABLE 4. Characteristics of eight subjects with inconsistent QFT-GOLD-IT results
|
Role of testing error. A test with an average sensitivity and specificity of 70% (beta, 70,29) and 98% (beta, 98,1), respectively, would be expected to have a median false-reversion rate of 31.72% (90CI, 25.4 to 38.5%) among initially positive subjects and a median false-conversion rate of 6.95% (90CI, 4.6 to 9.9%) when the apparent prevalence of infection is approximately 24% (beta, 15,47). Figure 2 shows the effect of varying these assumptions. When the specificity of a test is high, the reversion rate depends predominantly on the sensitivity of testing and is relatively stable for the given test characteristics and an apparent prevalence of infection of >10%. Conversely, the expected conversion rate is more dependent on the specificity of testing as well as the prevalence of infection. Conversion rates may also be difficult to interpret due to uncertainty about the difference between the apparent and "true" risk of infection.
![]() View larger version (18K): [in a new window] |
FIG. 2. Three-way sensitivity analysis. Three-way sensitivity analysis illustrating the effect of varying assumptions regarding diagnostic accuracies (Se, sensitivity; Sp, specificity) and prevalence (apparent risk) of infection on expected rates of false reversion (A) and false conversion (B).
|
|
|
|---|
To our knowledge, this is the first study examining the reproducibility of the QFT-G assay with an intended U.S. screening population. Similar results were reported for a cohort of Indian health care workers who were retested with the QFT-G-IT after 18 months. In that study, 24% of subjects reverted by QFT-G-IT, including 55% of those with IFN response levels of 0.35 to 0.69 IU/ml, and conversions ranged from 7 to 12%, depending on the stringency of conversion criteria (33). In 341 Gambian household contacts of TB cases followed by ELISPOT assay, reversion rates were 41% and 36% at 3 and 18 months, respectively, and corresponding conversion rates were 25% and 27% (versus 50% at 18 months by TST) (16). Although reproducibility appears to be somewhat better when the TST is concordant at baseline, this has been difficult to calibrate (35). Despite different IGRA formats, tuberculin formulations, and exposure gradients, serial testing studies to date have consistently reported that variable IFN-
responses in assays based on the RD1 antigens appear to be common.
Possible explanations for variable responses, as well as discordance with TST, include biological factors, laboratory artifact, and test-related error. While the higher specificity of the QFT is an important explanation of cross-sectional discordance with TST, responses to early secreted antigens over the evolution of latent infection are not well understood and may have cycling patterns (10, 16). Most of our subjects were adult emigrants from Latin America who reside in multigenerational Hispanic communities, where travel to and from Latin America is common. Nearly one-fourth, including five-eighths of inconsistent responders, reported prior TB exposure (16% within 3 years), and over two-thirds reported BCG vaccination. It is possible that fluctuations in response to unknown environmental stimuli contributed to nonspecific variation in test results. Although the assay is theoretically invariant to BCG vaccination, this does not rule out confounding due to concomitant infections, such as NTM or helminths and other host factors that may also be present in BCG-vaccinated populations (4, 13). We were unable to evaluate reproducibility in children due to low numbers; however, poor cross-sectional agreement with TST (9) and, in the Gambian study (16), higher reversion rates among children may also reflect the variability of IFN-
responses during the adaptation of the cellular immune system (23). Individuals with more-remote exposure may also respond to the whole-blood incubation protocol differentially. Because the QFT-G assay captures recently activated lymphocytes, it has also been suggested that the 24-hour incubation period misses some dormant infections (6).
Other laboratory parameters could account for minor variations in results around the positive threshold. For replicate serum samples, the manufacturer estimated the overall test-related coefficient of variation for QFT-G to range from 7.6 to 12.3 (mean, 8.7 ± 0.7) for concentrations of 0.33 to 7.7 IU/ml, with higher coefficients of variation at lower concentrations (7). In our study, two of three conversions had follow-up values of <10% from the positive threshold of 0.35 IU/ml, and the third was within 20% of the threshold. Given current plate configurations, running paired serial samples may not always be feasible in consecutive enrollment series, although inclusion of reference samples on every plate would help to quantify this problem. For the in-tube collection system, the degeneracy of the four-point dilution curve after logarithmic curve transformation may also play a role in the nonspecific variation of positive results of <1 IU/ml. In our series, approximately one-half of positive results fell within the range of 0.35 to 1 IU/ml, including 40% between 0.35 and 0.7 IU/ml. Although the test is currently packaged for qualitative interpretation, the use of absolute values and the establishment of an equivocal or "borderline" zone—as is common with many ELISAs—would be useful to physicians.
Another important source of variability in repeated testing studies is test-related error. In the absence of a "gold standard" for latent TB infection, the contribution of test operating characteristics to inconsistent test results is difficult to estimate; however, because serial testing studies will propagate test-related error according to predictable Bayesian laws (38), they can be useful staging grounds for exploratory probabilistic models (2). Although the specificity of the QFT is considered excellent, the sensitivity of the test, particularly for screening populations, is less well understood. While our model was greatly simplified, our estimate of QFT-G-IT sensitivity at around 70% is consistent with estimates from clinically based studies (11, 29, 32). As illustrated by our uncertainty analysis, reversion rates in a high-specificity scenario depend predominantly on the sensitivity of testing. Thus, one explanation for the consistently high reversion rates observed in studies to date is the relative sensitivity of the test in a high-risk context. Conversely, the model also predicted that false-conversion rates, which depend on the negative predictive value of an antecedent test, might vary more widely across studies due to differences in underlying risk, such as exposure gradient and age, as well as with use of different cut points that alter relative sensitivity and specificity locally (15, 20, 22). Probabilistic models can be useful adjuncts in diagnostic technology assessment for approaching such questions as verification bias, precision, and sample size requirements for "two-stage" designs (19).
Because the QFT-G does not cross-react with BCG or most NTM, it has been considered a candidate for use in screening protocols such as two-step testing or repeated screening, where TST boosting is also a concern (26). Although the foreign-born are not candidates per se for regular repeated screening programs, our population was fairly typical of groups with ongoing indications for targeted screening in our area, and also of the testing limitations posed. In this series, if the QFT-G replaced TST outright, nearly one-half (48%) of TST positives, including one-third of those with indurations of
15 mm, would not be referred for prophylaxis evaluation, a discrepancy that is also noted in cross-sectional studies (21, 25). Of those referred, one-third would not be confirmed if the test were repeated as part of a clinical workup. Alternatively, if the QFT-G were used to confirm TST converters, two of five would have been detected at baseline, although one of these two would have been shown to revert if the QFT-G-IT were repeated. In a larger series from The Gambia, 25% of TST converters with positive baseline ELISPOT results reverted (17). In our study, among the subjects who were both QFT-G-IT and TST negative at baseline, not one of the seven converters identified by either test was identified by the other test, despite the fact that four of five TST converters had increments of greater than 10 mm. Most of our vaccinees were adults from Mexico, where BCG vaccination is usually given once at birth. Significantly boosted reactions in adults are unlikely to be explained by BCG vaccination at birth or NTM (14). The reasons for TST boosting in high-risk populations, including why IGRAs appear to anticipate this phenomenon (17, 31, 40), are not straightforward. Although the tests have excellent agreement for unvaccinated populations, the adoption of highly specific antigens in the screening of more immunologically complex populations is likely to be complex as well. Although our 3-month testing interval does not distinguish boosting from true conversion, the context nonetheless captures the uncertainties faced by screening programs working in dynamic immigrant communities. These issues emphasize the need to evaluate different screening algorithms (30) to support the adoption of QFT-G in serial testing protocols (29, 35).
This study has several limitations. First, individuals reporting a positive TST history and those testing positive at the first visit repeated the skin test at their discretion. Because TST history is often inaccurate, particularly in multiply tested or BCG-vaccinated populations (27), misclassification could have biased the discordance analysis. However, five of six prior positives who chose to repeat the test were also TST positive at baseline, and six of eight TST positives who declined a baseline or follow-up skin test were persistently positive for the QFT-TB with PPD antigen (with two others, including one individual who reported taking isoniazid 10 years previously, being negative due only to avian sensitin response) (31). The use of prior positive TST results in analysis was likely to bias analysis conservatively. Although recall of BCG vaccination may also be inaccurate, we found little difference in agreement, whether based on self-report or observation of scar. Second, although false-conversion rates at the second visit could have been caused by the antecedent TST at the first visit 3 months earlier, a recent study suggests this is unlikely (24). A third limitation may have been loss to follow-up, inasmuch as 22 (25%) did not complete the second QFT-G-IT. However, there was no difference in the proportions of baseline positives and negatives who were not available for the second visit. Finally, because of small sample size we could not adequately evaluate risk factors associated with conversions and reversions, indicating the need for larger studies examining the reproducibility of the QFT-G in foreign-born and other high-risk screening populations.
Acknowledging these limitations, our study offers additional information about the reproducibility of the QFT-G-IT and discordance with TST. For a high-risk, BCG-vaccinated population, the agreement of QFT-G-IT and TST was moderate to poor, and nonspecific threshold variation in QFT-G results was common. Although the specificity of the test is an important explanation of cross-sectional TST discordance, much remains to be learned about the kinetics of diagnostic responses to early and late antigens in populations with different patterns of remote and recent exposure to mycobacterial antigens—a research agenda the IGRAs are revealing at the same time they are being adopted. Although the CDC has recommended use of the QFT-G "in all circumstances" where the TST is indicated, additional studies are needed to understand the reproducibility and relative accuracy of the test before its utility in prospective screening programs can be defined.
We gratefully acknowledge the assistance of the following community clinics with recruitment for this study: Santa Clara Valley Health Center Emergency Department (San Jose, CA), Santa Clara Valley Health Center Pediatric Clinic (San Jose, CA), East Valley Pediatric and Urgent Care Clinics (San Jose, CA), and Santa Clara County Office of Environmental Health (San Jose, CA).
Published ahead of print on 16 January 2008. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»