U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Chou R, Cottrell EB, Wasson N, et al. Screening for Hepatitis C Virus Infection in Adults [Internet]. Rockville (MD): Agency for Healthcare Research and Quality (US); 2012 Nov. (Comparative Effectiveness Reviews, No. 69.)

  • This publication is provided for historical reference only and the information may be out of date.

This publication is provided for historical reference only and the information may be out of date.

Cover of Screening for Hepatitis C Virus Infection in Adults

Screening for Hepatitis C Virus Infection in Adults [Internet].

Show details

Methods

Topic Development

The topic of HCV screening was nominated for a comparative effectiveness review (CER) in a public process. The Key Questions were proposed in the public nomination process and developed by investigators from the Evidence-based Practice Center (EPC) with input from expert Key Informants, who helped to refine Key Questions, identify important methodological and clinical issues, and define parameters for the review of evidence. The revised Key Questions were then posted to a public Web site for comment. The Agency for Healthcare Research and Quality (AHRQ) and the EPC agreed upon the final Key Questions after reviewing the public comments and receiving additional input from a Technical Expert Panel (TEP) convened for this report.

Search Strategy

To identify articles relevant to each Key Question, a research librarian searched Ovid® MEDLINE (see Appendix A. Exact Search Strategy), EMBASE, Scopus, and PsycINFO from 1947 to May 2012. Gray literature was identified by searching clinical trial registries (Ovid® EBM Reviews: Cochrane Central Register of Controlled Trials, ClinicalTrials.gov, Current Controlled Trials, Clinical Trial Results, and WHO Trial Registries) and grants databases (NIHRePORTER, HSRProj, and AHRQ GOLD). We supplemented the electronic searches by reviewing the reference lists of retrieved articles.

We updated searches prior to finalization of the report to identify new publications.

Study Selection

We developed criteria for inclusion and exclusion of studies based on the Key Questions and the populations, interventions, comparators, outcomes, timing, and setting (PICOTS) approach. Inclusion and exclusion criteria, summarized below, are described in more detail by Key Question in Appendix B. Papers were selected for full review if they were about chronic HCV infection, were relevant to Key Questions in the analytic framework, and met the predefined inclusion criteria. We restricted inclusion to English language articles since translation of foreign language articles was not feasible due to resource limitations and excluded studies only published as abstracts. Studies of nonhuman subjects were also excluded, and studies had to include original data.

Abstracts and full-text articles were dual reviewed for inclusion or exclusion for each Key Question. Full-text articles were obtained for all studies that either investigator identified as potentially meeting inclusion criteria. Two investigators independently reviewed all full-text articles for final inclusion or exclusion (Appendix C. Included studies list). A list of excluded studies can be found in Appendix D. Discrepancies were resolved through discussion and consensus, and a third investigator was included in the discussion if necessary.

Population and Conditions of Interest

The target population was adults without signs or symptoms of liver disease or known liver function test abnormalities. Specific Key Questions (1b and 7) addressed screening in pregnant women. We excluded children because of the low prevalence of anti-HCV antibodies (0.2–0.4 percent in 6–19 years old)15 and because of limited data on benefits and harms of antiviral treatments in children. We excluded specific populations such as post-transplant patients, HIV patients, and hemodialysis patients, because screening test characteristics, natural history of HCV infection, and treatment considerations may differ from what is observed in the general population.5963 In addition, evaluation of such patients for chronic HCV infection may be indicated for other reasons such as for informing use of antiretroviral therapies in individuals with HIV infection or assessing prognosis. Patients with occupational exposures were excluded because of consensus regarding screening after percutaneous exposures.64 See Appendix B for detailed inclusion and exclusion criteria.

Interventions and Comparators

Our review assumed screening with a later-generation HCV enzyme-linked immunoassay (ELISA) as the initial test, with confirmatory recombinant immunoblot assay (RIBA) or nucleic acid testing for HCV infection for positive ELISA.44 We considered patients to have chronic HCV infection if they had hepatitis C viremia based on reverse transcriptase polymerase chain reaction (PCR) or nucleic acid testing. Diagnostic accuracy of HCV antibody testing was reviewed for an earlier report and was not re-reviewed, given the high accuracy of later-generation ELISA testing for HCV antibody with confirmatory RIBA (sensitivity of third-generation ELISA 94 percent or higher and specificity 97 percent or higher; positive predictive value 73 to 86 percent), followed by PCR testing to detect viremia in those with positive tests.44 Rather, this report focused on the effects of different screening strategies on clinical outcomes (Key Question 2a) and their yield (sensitivity) and efficiency (number needed to screen to identify one HCV infection) (Key Question 2b). A rapid HCV test was approved by the US Food and Drug Administration (FDA) in 2011 for point-of-care testing, with diagnostic accuracy comparable to standard HCV testing, but is not yet in widespread use.6567

In most patients with chronic HCV infection, liver biopsy is still recommended as a standard part of the workup for guiding decisions regarding eligibility for antiviral treatments.45 The absence of bridging fibrosis (METAVIR F0-F2, Ishak stage 0–3, or equivalent) on liver biopsy is associated with a low likelihood for liver-related complications over the next 10 to 20 years and is an important consideration when making individualized treatment decisions.68 However, liver biopsy is invasive and associated with potential complications, is subject to sampling errors, and requires expertise and judgment to interpret. Therefore, a number of tests (including blood tests and imaging studies) have been proposed as potential noninvasive alternatives to biopsy. We evaluated the diagnostic accuracy of noninvasive tests for identifying fibrosis or cirrhosis in patients with HCV infection compared with liver biopsy as the reference standard. We excluded the 13c methacetin breath test69 and ultrasonographic transient elastography,70 as these are not approved by the FDA and are not in widespread use in the United States.

For treatment of chronic HCV infection, we focused on evidence regarding effects of interventions for reducing risky behaviors associated with transmission of HCV infection, counseling regarding alcohol use, and immunizations for hepatitis A and hepatitis B virus infections. Alcohol use is associated with accelerated liver disease in people with HCV infection and becoming infected with hepatitis A or hepatitis B virus infection may result in fulminant hepatitis or more rapid progression. We also evaluated how knowledge of HCV-positive status affects risky behaviors and alcohol use. Antiviral treatments for HCV infection will be reviewed in a separate report.58

For interventions in pregnant women, we focused on evidence regarding effects of labor and delivery and postnatal interventions and practices on risk of vertical transmission. These include mode of delivery (cesarean vs. vaginal delivery), breastfeeding, use of internal fetal monitoring, and management of premature rupture of membranes. Antiviral therapy is contraindicated in pregnant women due to potential teratogenic effects. Management of HCV infection in children was outside the scope of this review.

Outcomes

Clinical outcomes assessed were mortality, end-stage liver disease, cirrhosis, hepatocellular cancer, need for transplantation, quality of life, and HCV transmission. Intermediate outcomes were sustained virological response, histological changes, and reductions in high-risk behaviors (such as alcohol use or intravenous drug use behaviors). Harms of screening included labeling and anxiety. We also reviewed adverse outcomes from screening and treatment including effects of diagnosing chronic HCV infection on quality of life, psychological outcomes, and social and family relationships. We also reviewed adverse outcomes associated with percutaneous liver biopsy such as bleeding, gut perforation, pain, and other complications.

For diagnostic accuracy of noninvasive blood tests for evaluating patients with chronic HCV infection, we evaluated sensitivity and specificity against liver biopsy (considered the reference standard). Because sensitivity and specificity varies depending on the cutoff evaluated, we also evaluated the area under the receiver operating characteristic curve (AUROC), a measure of discrimination that incorporates diagnostic information at multiple cutoffs. An AUROC of >0.90 is often interpreted as indicating excellent discrimination, >0.80 to 0.90 good discrimination, >0.70 to 0.80 fair discrimination, and ≤0.70 poor, though cutoffs are somewhat arbitrary. We did not focus on predictive values because they vary depending on the prevalence of the population being evaluated. All of the studies of diagnostic accuracy evaluated referral populations with substantially higher prevalence of fibrosis and cirrhosis than would be expected in screen-detected patients.

Timing

We did not apply a minimum threshold for duration of studies.

Setting

Studies conducted in primary care and specialty settings were included.

Types of Studies

We included randomized trials, cohort studies, and case-control studies pertinent to all Key Questions. If such studies were not available, we included cross-sectional studies and intervention series. We also included studies that reported the diagnostic accuracy of noninvasive tests for evaluating fibrosis or cirrhosis in patients with chronic HCV infection compared with liver biopsy. See appendix B for detailed inclusion and exclusion criteria.

Data Extraction

We extracted the following data from included trials: study design, setting, population characteristics (including sex, age, ethnicity/race, and diagnosis), eligibility and exclusion criteria, HCV infection intervention and comparisons, the method of outcome ascertainment if available, and results for each outcome. Evidence tables with included studies are presented for all Key Questions unless there was only very weak evidence (e.g., because of major methodological shortcomings or study designed without (comparison groups).

For studies reporting the diagnostic yield of different screening strategies, we computed the number needed to screen to identify one case of HCV infection by dividing the number of screening tests performed by the number of HCV cases identified. The proportion screened was the number of patients screened upon application of a particular screening strategy, divided by the total number of patients assessed.

For studies of diagnostic accuracy, we attempted to create 2×2 tables from information provided (usually sample size, prevalence, sensitivity, and specificity) and compared calculated measures of diagnostic accuracy based on the 2×2 tables with reported results. Although we abstracted data for severe fibrosis (defined as biopsy showing METAVIR F3-F4, Ishak 4–6, or equivalent), we summarized results for fibrosis (defined as biopsy showing METAVIR F2-F4, Ishak 3–6, or equivalent) and cirrhosis (defined as biopsy showing METAVIR F4, Ishak 5–6, or equivalent), unless there was insufficient evidence for fibrosis. We also abstracted reported area under the receiver operating characteristic curve (AUROC).71, 72 The AUROC, which is based on sensitivities and specificities across a range of test results, is a measure of discrimination, or the ability of a test to distinguish people with a condition from people without. An AUROC of 1.0 indicates perfect discrimination, and an AUROC of 0.5 indicates complete lack of discrimination. Interpretation of AUROC values between 0.5 and 1.0 is somewhat arbitrary, but a value of 0.90 to <1.0 may be classified as excellent, 0.80 to <0.90 good, 0.70 to <0.80 fair, and <0.70 poor. Data abstraction for each study was completed by two investigators: the first abstracted the data, and the second reviewed the abstracted data for accuracy and completeness. See Appendix G for evidence tables of extracted data.

Quality Assessment of Individual Studies

We assessed the quality of each study based on predefined criteria. We adapted criteria from methods proposed by Downs and Black (observational studies),73 the USPSTF,74 and the Quality Assessment of Diagnostic Accuracy Studies-2 Group.75 The criteria used are consistent with the approach recommended by AHRQ in the Methods Guide for Comparative Effectiveness Reviews.76 We used the term “quality” rather than the alternate term “risk of bias”; both refer to internal validity.

We rated the quality of each randomized trial based on the methods used for randomization, allocation concealment, and blinding; the similarity of compared groups at baseline; maintenance of comparable groups; adequate reporting of dropouts, attrition, crossover, adherence, and contamination; loss to followup; the use of intent-to-treat analysis; and ascertainment of outcomes.74

We rated the quality of each cohort study based on whether it used nonbiased selection methods to create an inception cohort; whether it evaluated comparable groups; whether rates of loss to followup were reported and acceptable; whether it used accurate methods for ascertaining exposures, potential confounders, and outcomes; and whether it performed appropriate statistical analyses of potential confounders.74 For assessing quality of each case-control study, we evaluated whether similar inclusion and exclusion criteria were applied to select cases and controls, whether it used accurate methods to identify cases, whether it used accurate methods for ascertaining exposures and potential confounders, and whether it performed appropriate statistical analyses of potential confounders.74

We rated the quality of each diagnostic accuracy study based on whether it evaluated a representative spectrum of patients, whether it enrolled a random or consecutive sample of patients meeting predefined criteria, whether it used a credible reference standard, whether the same reference standard was applied to all patients, whether the reference standard was interpreted independently from the test under evaluation, and whether thresholds were predefined.74, 75

Following assessment of individual quality criteria, individual studies were rated as “good,” “fair,” or “poor” quality, as defined below.76

Good-quality studies are considered likely to be valid. Good-quality studies clearly describe the population, setting, interventions, and comparison groups; use a valid method for allocation of patients to interventions; clearly report dropouts and have low dropout rates; use appropriate methods for preventing bias; and appropriately measure outcomes and fully report results.

Fair-quality studies have some methodological deficiencies, but no flaw or combination of flaws judged likely to cause major bias. The study may be missing information, making it difficult to assess its methods or assess limitations and potential problems. The fair-quality category is broad, and studies with this rating vary in their strengths and weaknesses: the results of some fair-quality studies are likely to be valid, while others are only probably valid.

Poor-quality studies have significant flaws that may invalidate the results. They have a serious or “fatal” flaw in design, analysis, or reporting; large amounts of missing information; or discrepancies in reporting. The results of these studies are judged to be at least as likely to reflect flaws in the study design as true effects of the interventions under investigation. We did not exclude studies rated poor quality a priori, but they were considered to be the least reliable studies when synthesizing the evidence, particularly when discrepancies between studies were present. For detailed quality assessment methods see Appendix E.

Assessing Research Applicability

We recorded factors important for understanding the applicability of studies, such as whether the publication adequately described the study population, how similar patients were to populations likely to be targeted by screening, whether differences in outcomes were clinically (as well as statistically) significant, and whether the interventions and tests evaluated were reasonably representative of standard practice.77 We also recorded the funding source and role of the sponsor. We did not assign a rating of applicability (such as “high” or “low”) because applicability may differ based on the user of this report.

Evidence Synthesis and Rating the Body of Evidence

We did not attempt to pool studies of screening or treatments quantitatively due to small numbers of studies, lack of randomized trials, and substantial clinical diversity with respect to the populations, settings, and comparisons evaluated. We also did not quantitatively pool results on diagnostic accuracy (such as creating summary receiver operating characteristic curves) due to differences across those studies in populations evaluated, differences in how fibrosis or cirrhosis were defined, and methodological limitations in the studies. Instead, we created descriptive statistics with the median sensitivity and specificity at specific cutoffs and reported AUROCs, along with associated ranges. The total range, rather than the interquartile range, was chosen because certain outcomes were only reported by a few studies and the summary range highlighted the greater variability (and uncertainty) in the estimates.

We assessed the overall strength of evidence for each body of evidence in accordance with the AHRQ Methods Guide for Comparative Effectiveness Reviews.76 We synthesized the quality of the studies; the consistency of results within and between study designs; the directness of the evidence linking the intervention and health outcomes; the precision of the estimate of effect (based on the number and size of studies and confidence intervals for the estimates); and strength of association (magnitude of effect). We were not able to formally assess for publication bias in studies of interventions due to small number of studies, methodological shortcomings, or differences across studies in designs, measured outcomes, and other factors. We rated the strength of evidence for each Key Question using the four categories recommended in the AHRQ Methods Guide:76 A “high” grade indicates high confidence that the evidence reflects the true effect and that further research is very unlikely to change our confidence in the estimate of effect. A “moderate” grade indicates moderate confidence that the evidence reflects the true effect and further research may change our confidence in the estimate of effect and may change the estimate. A “low” grade indicates low confidence that the evidence reflects the true effect and further research is likely to change the confidence in the estimate of effect and is likely to change the estimate. An “insufficient” grade indicates evidence either is unavailable or does not permit a conclusion. See Appendix F for strength of evidence tables.

Peer Review

Experts in gastroenterology, hepatology, and infectious disease fields and individuals representing stakeholder and user communities were invited to provide external peer review of this CER; AHRQ and an associate editor also provided comments. The draft report was posted on the AHRQ Web site for 4 weeks to elicit public comment. We addressed all reviewer comments, revising the text as appropriate, and documented everything in a disposition of comments report that will be made available 3 months after the Agency posts the final CER on the AHRQ Web site.

Views

  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this title (3.9M)

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...