General

Guideline Title

Clinical practice guideline for diagnostic testing for adult obstructive sleep apnea: an American Academy of Sleep Medicine clinical practice guideline.

Bibliographic Source(s)

  • Kapur VK, Auckley DH, Chowdhuri S, Kuhlmann DC, Mehra R, Ramar K, Harrod CG. Clinical practice guideline for diagnostic testing for adult obstructive sleep apnea: an American Academy of Sleep Medicine clinical practice guideline. J Clin Sleep Med. 2017 Mar 15;13(3):479-504. [131 references] PubMed

Guideline Status

This is the current release of the guideline.

This guideline updates previous versions: Kushida CA, Littner MR, Morgenthaler T, Alessi CA, Bailey D, Coleman J Jr, Friedman L, Hirshkowitz M, Kapen S, Kramer M, Lee-Chiong T, Loube DL, Owens J, Pancer JP, Wise M. Practice parameters for the indications for polysomnography and related procedures: an update for 2005. Sleep. 2005 Apr 1;28(4):499-521. [150 references]

Collop NA, Anderson WM, Boehlecke B, Claman D, Goldberg R, Gottlieb DJ, Hudgel D, Sateia M, Schwab R, Portable Monitoring Task Force of the American Academy of Sleep Medicine. Clinical guidelines for the use of unattended portable monitors in the diagnosis of obstructive sleep apnea in adult patients. Portable Monitoring Task Force of the American Academy of Sleep Medicine. J Clin Sleep Med. 2007 Dec 15;3(7):737-47.

This guideline meets NGC’s 2013 (revised) inclusion criteria.

NEATS Assessment

Disclosure of Guideline Funding Source

  • Yes

Disclosure and Management of Financial Conflict of Interests

  • 5

Guideline Development Group Composition: Multidisciplinary Group

  • Unknown

Guideline Development Group Composition: Methodologist Involvement

  • Yes

Guideline Development Group Composition: Patient and Public Perspectives

  • 3

Use of a Systematic Review of Evidence: Search Strategy

  • 5

Use of a Systematic Review of Evidence: Study Selection

  • 5

Use of a Systematic Review of Evidence: Synthesis of Evidence

  • 5

Evidence Foundations for and Rating Strength of Recommendations: Grading the Quality or Strength of Evidence

  • 5

Evidence Foundations for and Rating Strength of Recommendations: Benefits and Harms of Recommendations

  • 5

Evidence Foundations for and Rating Strength of Recommendations: Evidence Summary Supporting Recommendations

  • 5

Evidence Foundations for and Rating Strength of Recommendations: Rating the Strength of Recommendations

  • 5

Specific and Unambiguous Articulation of Recommendations

  • 5

External Review

  • 3

Updating

  • 3

Recommendations

Major Recommendations

The quality of evidence ( High-Very Low ) and strengths of recommendations ( Strong, Weak ) are defined at the end of the “Major Recommendation” field.

Clinical Practice Recommendations

The following clinical practice recommendations are based on a systematic review and evaluation of evidence following the Grading of Recommendations Assessment, Development and Evaluation (GRADE) methodology. Remarks are provided to guide clinicians in the implementation of these recommendations. All figures, including meta-analyses and Summary of Findings tables are presented in the supplemental material (see the original guideline document). Table 5 in the original guideline document shows a summary of the recommendation statements including the strength of recommendation and quality of evidence. A decision tree for the diagnosis of patients suspected of having obstructive sleep apnea (OSA) is presented in Figure 2 in the original guideline document.

The following are good practice statements, the implementation of which is deemed necessary for appropriate and effective diagnosis and management of OSA.

  • Diagnostic testing for OSA should be performed in conjunction with a comprehensive sleep evaluation and adequate follow-up.
  • Polysomnography is the standard diagnostic test for the diagnosis of OSA in adult patients in whom there is a concern for OSA based on a comprehensive sleep evaluation.

Diagnosis of Obstructive Sleep Apnea in Adults Using Clinical Tools, Questionnaires and Prediction Algorithms

Recommendation 1 : The Task Force (TF) recommends that clinical tools, questionnaires and prediction algorithms not be used to diagnose OSA in adults, in the absence of polysomnography or home sleep apnea testing. ( Strong )

Home Sleep Apnea Testing for the Diagnosis of Obstructive Sleep Apnea in Adults

Recommendation 2 : The TF recommends that polysomnography, or home sleep apnea testing (HSAT) with a technically adequate device, be used for the diagnosis of OSA in uncomplicated adult patients presenting with signs and symptoms that indicate an increased risk of moderate to severe OSA. ( Strong )

Recommendation 3 : The TF recommends that if a single home sleep apnea test is negative, inconclusive or technically inadequate, polysomnography be performed for the diagnosis of OSA. ( Strong )

Remarks: The following remarks are based on specifications used by studies that support these recommendation statements:

An uncomplicated patient is defined by the absence of:

  1. Conditions that place the patient at increased risk of non-obstructive sleep-disordered breathing (e.g., central sleep apnea, hypoventilation and sleep related hypoxemia). Examples of these conditions include significant cardiopulmonary disease, potential respiratory muscle weakness due to neuromuscular conditions, history of stroke and chronic opiate medication use.
  2. Concern for significant non-respiratory sleep disorder(s) that require evaluation (e.g., disorders of central hypersomnolence, parasomnias, sleep related movement disorders) or interfere with accuracy of HSAT (e.g., severe insomnia).
  3. Environmental or personal factors that preclude the adequate acquisition and interpretation of data from HSAT.

An increased risk of moderate to severe OSA is indicated by the presence of excessive daytime sleepiness and at least two of the following three criteria: habitual loud snoring, witnessed apnea or gasping or choking, or diagnosed hypertension.

HSAT is to be administered by an accredited sleep center under the supervision of a board-certified sleep medicine physician, or a board-eligible sleep medicine provider.

A single HSAT recording is conducted over at least one night.

A technically adequate HSAT device incorporates a minimum of the following sensors: nasal pressure, chest and abdominal respiratory inductance plethysmography, and oximetry; or else peripheral arterial tonometry (PAT) with oximetry and actigraphy. For additional information regarding HSAT sensor requirements, refer to The American Academy of Sleep Medicine (AASM) Manual for the Scoring of Sleep and Associated Events.

A technically adequate diagnostic test includes a minimum of 4 hours of technically adequate oximetry and flow data, obtained during a recording attempt that encompasses the habitual sleep period.

Diagnosis of Obstructive Sleep Apnea in Adults with Comorbid Conditions

Recommendation 4 : The TF recommends that polysomnography, rather than home sleep apnea testing, be used for the diagnosis of OSA in patients with significant cardiorespiratory disease, potential respiratory muscle weakness due to neuromuscular condition, awake hypoventilation or suspicion of sleep-related hypoventilation, chronic opioid medication use, history of stroke or severe insomnia. ( Strong )

Diagnosis of Obstructive Sleep Apnea in Adults Using a Split-Night versus a Full-Night Polysomnography Protocol

Recommendation 5 : The TF suggests that, if clinically appropriate, a split-night diagnostic protocol, rather than a full-night diagnostic protocol for polysomnography be used in the diagnosis of OSA. ( Weak )

Remarks: Clinically appropriate is defined as the absence of conditions identified by the clinician that are likely to interfere with successful diagnosis and treatment using a split-night protocol.

This recommendation is based on a split-night protocol that initiates continuous positive airway pressure (CPAP) titration only when the following criteria are met: (1) a moderate to severe degree of OSA is observed during a minimum of 2 hours of recording time on the diagnostic polysomnography, AND (2) at least 3 hours are available for CPAP titration.

Repeat Polysomnography for the Diagnosis of Obstructive Sleep Apnea in Adults

Recommendation 6 : The TF suggests that when the initial polysomnogram is negative and there is still clinical suspicion for OSA, a second polysomnogram be considered for the diagnosis of OSA. ( Weak )

Definitions

Quality of a Body of Evidence

High : Corresponds to a high level of certainty that the true effect lies close to that of the estimate of the effect.

Moderate : Corresponds to a moderate level of certainty in the effect estimate; the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.

Low : Corresponds to a low level of certainty in the effect estimate; the true effect may be substantially different from the estimate of the effect.

Very low : Corresponds to very little certainty in the effect estimate; the true effect is likely to be substantially different from the estimate of effect.

Strength of Recommendations

Strong : A STRONG recommendation is one that clinicians should, under most circumstances, always follow (i.e., something that might qualify as a Quality Measure).

Weak : A WEAK recommendation reflects a lower degree of certainty in the appropriateness of the patient-care strategy and requires that the clinician use his/her clinical knowledge and experience, and refer to the individual patient’s values and preferences to determine the best course of action.

Example Characteristics of American Academy of Sleep Medicine (AASM) Strengths of Recommendations

AASM Strength of Recommendation Example Characteristics Guiding Recommendation
FOR STRONG
  • There is a high degree of clinical certainty that the balance between benefits vs. harms (i.e., net benefits ) favors benefits for this patient-care strategy.
  • The vast majority of well-informed patients would most likely choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.
WEAK
  • There is a lower degree of clinical certainty in the balance between benefits vs. harms (i.e., net benefits ) favors benefit for this patient-care strategy.
  • The majority of well-informed patients would most likely choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.
AGAINST WEAK
  • There is a lower degree of clinical certainty in the balance between benefits vs. harms (i.e., net harms ) of this patient-care strategy.
  • The majority of well-informed patients would most likely not choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.
STRONG
  • There is a high degree of clinical certainty in the balance between benefits vs. harms (i.e., net harms ) of this patient-care strategy.
  • The vast majority of well-informed patients would most likely not choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.

Clinical Algorithm(s)

An algorithm titled “Clinical Algorithm for Implementation of Clinical Practice Guidelines” is provided in the original guideline document.

Scope

Disease/Condition(s)

Obstructive sleep apnea (OSA)

Guideline Category

  • Diagnosis
  • Evaluation
  • Risk Assessment

Clinical Specialty

  • Family Practice
  • Internal Medicine
  • Otolaryngology
  • Pulmonary Medicine
  • Sleep Medicine

Intended Users

  • Advanced Practice Nurses
  • Allied Health Personnel
  • Nurses
  • Physician Assistants
  • Physicians
  • Respiratory Care Practitioners

Guideline Objective(s)

To combine and update information from prior guideline documents regarding the diagnosis of obstructive sleep apnea (OSA), including the optimal circumstances under which attended in-laboratory polysomnography (PSG) or home sleep apnea testing (HSAT) should be performed

Target Population

Adult patients with suspected obstructive sleep apnea (OSA)

Interventions and Practices Considered

  1. Diagnostic testing for obstructive sleep apnea (OSA) in conjunction with a comprehensive sleep evaluation
  2. Use of polysomnography (PSG) for diagnosis of OSA
  3. Use of clinical tools, questionnaires or prediction algorithms for diagnosis of OSA (not recommended)
  4. Home sleep apnea testing
  5. Diagnosis of OSA in patients with comorbid conditions
  6. Split-night versus full-night diagnostic protocols for PSG
  7. Repeat PSG

Major Outcomes Considered

  • Diagnostic accuracy
  • Subjective sleepiness
  • Continuous positive airway pressure (CPAP) adherence
  • Quality of life
  • Apnea-hypopnea index (AHI)
  • Depression
  • Cardiovascular endpoints

Methodology

Methods Used to Collect/Select the Evidence

  • Hand-searches of Published Literature (Primary Sources)
  • Hand-searches of Published Literature (Secondary Sources)
  • Searches of Electronic Databases

Description of Methods Used to Collect/Select the Evidence

Literature Searches, Evidence Review and Data Extraction

The Task Force (TF) performed a systematic review of the scientific literature to identify articles that addressed at least one of the nine PICO (Patient, Intervention, Comparison, Outcome) questions. Multiple literature searches were performed by American Academy of Sleep Medicine (AASM) staff using the PubMed and EMBASE databases, throughout the guideline development process (see Figure 1 in the original guideline document). The search yielded articles with various study designs, however the analysis was limited to randomized controlled trials (RCTs) and observational studies. The articles that were cited in the 2007 AASM clinical practice guideline, 2005 practice parameter, 2003 review, and 1997 review were included for data analysis if they met the study inclusion criteria described below.

The literature searches in PubMed were conducted using a combination of Medical Subject Headings (MeSH) terms and keywords as presented in the supplemental material (see the original guideline document). The PubMed database was searched from January 1, 2005 through July 26, 2012 for any relevant literature published since the last guideline. The PubMed search was expanded on September 26, 2012 to identify relevant articles published prior to January 1, 2005. Literature searches were also performed in EMBASE using a combination of terms and keywords as presented in the supplemental material. The EMBASE database was searched from January 1, 2005 through September 13, 2012. These searches yielded a total of 3,937 articles. There were 205 duplicates identified resulting in a total of 3,732 articles from both databases.

A second round of literature searches was performed in PubMed and EMBASE to capture more recent literature. The PubMed database was searched from July 27, 2012 to December 23, 2013, and the EMBASE database was searched from September 13, 2012 to December 23, 2013. These searches yielded a total of 2,061 articles. There were 670 duplicates identified resulting in 1,391 additional papers from both databases.

A final literature search was performed in PubMed to capture the latest literature. The PubMed database was searched from December 24, 2013 to June 29, 2016 and identified 2,129 articles.

Based on their expertise and familiarity with the literature, TF members submitted additional relevant literature and screened reference lists to identify articles of potential interest. This served as a “spot check” for the literature searches to ensure that important articles were not overlooked and identified an additional 140 publications.

A total of 7,392 abstracts were assessed by two reviewers to determine whether they met inclusion criteria presented below. Articles were excluded per the criteria listed below.

Literature Search Limits

January 1, 2005 to June 29, 2016; human studies; RCTs or observational studies; adults; English language

Inclusion Criteria

  • Diagnosis of obstructive sleep apnea (OSA) with polysomnography (PSG), home sleep apnea testing (HSAT), oximetry, or clinical prediction algorithm
  • Address one of nine PICO questions
  • Adults
  • Outcomes related to accuracy, inconclusive results, complications, quality of life, medical outcomes, adherence, efficiency of diagnosis or access to care

Exclusion Criteria

Treatment paper; no OSA; pediatric subjects; initial sample size <25 per condition, 50 total for PICO 2; initial sample size <10 per condition, 20 total for all other PICOs; wrong publication type (review, editorial, methodological, non-RCT or non-observational study); other sleep comorbidities besides OSA; hospitalized or general surgery; diagnostic test not in PICO question; time between HSAT and PSG >4 weeks; HSAT used in-lab; HSAT used simultaneously with PSG in-lab; multiple sleep latency test (MSLT); Maintenance of Wakefulness Test (MWT); and other nap tests performed

Number of Source Documents

  • A total of 98 studies were included in evidence base for recommendations.
  • A total of 86 studies were included in meta-analysis.

See Figure 1 in the original guideline document for details.

Methods Used to Assess the Quality and Strength of the Evidence

  • Weighting According to a Rating Scheme (Scheme Given)

Rating Scheme for the Strength of the Evidence

Quality of a Body of Evidence

High: Corresponds to a high level of certainty that the true effect lies close to that of the estimate of the effect.

Moderate: Corresponds to a moderate level of certainty in the effect estimate; the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.

Low: Corresponds to a low level of certainty in the effect estimate; the true effect may be substantially different from the estimate of the effect.

Very low: Corresponds to very little certainty in the effect estimate; the true effect is likely to be substantially different from the estimate of effect.

Methods Used to Analyze the Evidence

  • Meta-Analysis
  • Review of Published Meta-Analyses
  • Systematic Review with Evidence Tables

Description of the Methods Used to Analyze the Evidence

Meta-Analysis

Meta-analysis was performed on both diagnostic and clinical outcomes of interest for each PICO (Population/Problem, Intervention, Comparison, Outcome) question, when possible. Outcomes data for diagnostic approaches were categorized as follows: clinical tools, questionnaires, and prediction algorithms; history and physical exam; home sleep apnea testing (HSAT); attended polysomnography (PSG); split-night attended PSG; two-night attended PSG; single-night HSAT; multiple-night HSAT; follow-up attended PSG; and follow-up HSAT. The type of HSAT devices identified in literature search included type 2; type 3; 2–3 channel; single channel; oximetry; and peripheral arterial tonometry (PAT). A definition of these devices has been previously described. Adult patients were categorized as follows: suspected obstructive sleep apnea (OSA); suspected OSA with comorbid conditions; diagnosed OSA; and scheduled for upper airway surgery.

For diagnostic outcomes, the pretest probability for OSA (i.e., the prevalence within the study population), sensitivity and specificity of the tested diagnostic approach, and number of patients for each study was used to derive two-by-two tables (i.e., the number of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) diagnoses per 1,000 patients) in both high risk and low risk patients, for each OSA severity threshold (i.e., apnea-hypopnea index [AHI] ≥5, AHI ≥15, AHI ≥30). For analyses that included five or more studies, pooled estimates of sensitivity, specificity, and accuracy were calculated using hierarchical random effects modeling performed in STATA software (accuracy was derived by hierarchical summary receiver-operating characteristic [HSROC] curves). When analyses included fewer than five studies, ranges of sensitivity, specificity and accuracy were used. Based on their clinical expertise and a review of available literature, the Task Force (TF) established estimates of OSA prevalence among “low risk” and “high risk” patients for each OSA severity threshold. The TF envisioned a sleep clinic cohort of middle-aged obese men with typical symptoms of OSA as an example of a high-risk patient population. In contrast, a sleep clinic cohort of younger non-obese women with possible OSA symptoms was used as prototype for a low risk patient population. Prevalence estimates for these populations are presented in Table 4 in the original guideline document.

The sensitivity and specificity of included studies were entered into Review Manager 5.3 software to generate forest plots for each analysis. The estimates of sensitivity and specificity (pooled or ranges), and OSA prevalence were entered into the GRADE (Grading of Recommendations Assessment, Development and Evaluation) Guideline Development Tool (GDT) to generate the two-by-two tables. The TF determined the downstream consequences of an accurate diagnosis versus an inaccurate diagnosis (see supplemental material, Table S1 in the original guideline document), and used the estimates to weigh the benefits of an accurate diagnosis versus the harms of an inaccurate diagnosis. This information was used, in part, to assess whether a given diagnostic approach could be recommended when compared against PSG.

For clinical outcomes of interest, data on change scores were entered into the Review Manager 5.3 software to derive the mean difference and standard deviation between the experimental diagnostic approach and the gold standard or comparator. For studies that did not report change scores, data from post-treatment values taken from the last treatment time-point were used for meta-analysis. All meta-analyses of clinical outcomes were performed using the random effects model with results displayed as a forest plot. There was insufficient evidence to perform meta-analyses for PICOs 3 and 9 (see the original guideline document), thus no recommendations are provided.

Interpretation of clinical significance for the clinical outcomes of interest was conducted by comparing the absolute effects to the clinical significance threshold previously determined by the TF for each clinical outcome of interest (see Table 3 in the original guideline document).

Methods Used to Formulate the Recommendations

  • Expert Consensus

Description of Methods Used to Formulate the Recommendations

Expert Task Force

The American Academy of Sleep Medicine (AASM) commissioned a Task Force (TF) of board-certified sleep medicine physicians, with expertise in the diagnosis and management of adults with obstructive sleep apnea (OSA), to develop this guideline.

PICO Questions

A PICO (Patient, Population or Problem, Intervention, Comparison, and Outcomes) question template was used to develop clinical questions to be addressed in this guideline. PICO questions were developed based on a review of the existing AASM practice parameters on indications for use of polysomnography (PSG) and home sleep apnea testing (HSAT) for the diagnosis of patients with obstructive sleep apnea (OSA), and a review of systematic reviews, meta-analyses, and guidelines published since 2004. The AASM Board of Directors (BOD) approved the final list of PICO questions presented in Table 1 of the original guideline document before the literature search was performed. The PICO questions identify the commonly used approaches and devices for the diagnosis of OSA. Based on their expertise, the TF developed a list of patient-oriented clinically relevant outcomes that are indicative of whether a treatment should be recommended for clinical practice. A summary of the critical outcomes for each PICO is presented in Table 2 in the original guideline document. Lastly, clinical significance thresholds, used to determine if a change in an outcome was clinically significant, were defined for each outcome by TF clinical judgment, prior to statistical analysis. The clinical significance thresholds are presented by outcome in Table 3 in the original guideline document. It should be noted that there was insufficient evidence to directly address PICO question 1, as no studies were identified that compared the efficacy of clinical prediction algorithms to history and physical exam. However, the TF decided to compare the efficacy of clinical prediction algorithms to PSG and HSAT.

Strength of Recommendations

The assessment of evidence quality was performed according to the Grading of Recommendations Assessment, Development and Evaluation (GRADE) process. The TF assessed the following four components to determine the direction and strength of a recommendation: quality of evidence, balance of beneficial and harmful effects, patient values and preferences and resource use as described below.

  1. Quality of evidence : based on an assessment of the overall risk of bias (randomization, blinding, allocation concealment, selective reporting, and author disclosures), imprecision (clinical significance thresholds), inconsistency (I2 cutoff of 75%), indirectness (study population), and risk of publication bias (funding sources), the TF determined their overall confidence that the estimated effect found in the body of evidence was representative of the true treatment effect that patients would see. For diagnostic accuracy studies, the quality assessment of diagnostic accuracy studies (QUADAS)-2 tool was used in addition to the quality domains for the assessment of risk of bias in intervention studies. The quality of evidence was based on the outcomes that the TF deemed critical for decision-making.
  2. Benefits versus harms : based on the meta-analysis (if applicable), analysis of any harms or side effects reported within the accepted literature, and the clinical expertise of the TF, the TF determined if the beneficial outcomes of the intervention outweighed any harmful side effects.
  3. Patient values and preferences : based on the clinical expertise of the TF members and any data published on the topic relevant to patient preferences, the TF determined if patients would use the intervention based on the body of evidence, and if patient values and preferences would be generally consistent.
  4. Resource use : based on the clinical expertise of the TF members and a “spot check” for relevant literature the TF determined resource use to be important for determining whether to recommend the use of HSAT versus PSG, split-night versus full-night PSG and single-night versus multiple-night HSAT diagnostic protocols, and repeat testing. Resource use was not considered in-depth for clinical tools, questionnaires and prediction algorithms, diagnosis in adults with comorbid conditions, and repeat PSG.

Taking these major factors into consideration, each recommendation statement was assigned strength (“STRONG” or “WEAK”). Additional information is provided in the form of “Remarks” immediately following the recommendation statements, when deemed necessary by the TF. Remarks are based on the evidence evaluated during the systematic review, are intended to provide context for the recommendations, and to guide clinicians in implementing the recommendations in daily practice.

Discussions accompany each recommendation to summarize the relevant evidence and explain the rationale leading to each recommendation. These sections are an integral part of the GRADE system and offer transparency to the process (see the original guideline document).

Rating Scheme for the Strength of the Recommendations

Strength of Recommendations

Strong : A STRONG recommendation is one that clinicians should, under most circumstances, always follow (i.e., something that might qualify as a Quality Measure).

Weak : A WEAK recommendation reflects a lower degree of certainty in the appropriateness of the patient-care strategy and requires that the clinician use his/her clinical knowledge and experience, and refer to the individual patient’s values and preferences to determine the best course of action.

Example Characteristics of American Academy of Sleep Medicine (AASM) Strengths of Recommendations

AASM Strength of Recommendation Example Characteristics Guiding Recommendation
FOR STRONG * There is a high degree of clinical certainty that the balance between benefits vs. harms (i.e., net benefits) favors benefits for this patient-care strategy. * The vast majority of well-informed patients would most likely choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.
WEAK * There is a lower degree of clinical certainty in the balance between benefits vs. harms (i.e., net benefits) favors benefit for this patient-care strategy. * The majority of well-informed patients would most likely choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.
AGAINST WEAK * There is a lower degree of clinical certainty in the balance between benefits vs. harms (i.e., net harms) of this patient-care strategy. * The majority of well-informed patients would most likely not choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.
STRONG * There is a high degree of clinical certainty in the balance between benefits vs. harms (i.e., net harms) of this patient-care strategy. * The vast majority of well-informed patients would most likely not choose this patient-care strategy, compared to alternative patient-care strategies or no treatment.

Cost Analysis

  • Though a single night of home sleep apnea testing (HSAT) is less resource- intensive than a single night of polysomnography (PSG), the relative cost-effectiveness of management pathways that incorporate each of these diagnostic strategies is unclear. Economic analyses have compared the cost-effectiveness of management pathways that incorporate diagnostic strategies using HSAT or PSG. All have concluded that PSG is the preferred diagnostic strategy from an economic perspective for adults suspected to have moderate to severe obstructive sleep apnea (OSA). An important factor in these analyses is the favorable cost-effectiveness of OSA treatment in patients with moderate to severe OSA, particularly when longer time horizons are considered. As a result, diagnostic strategies that lead to increased false negatives, and leave patients untreated, or increase false positives, and unnecessarily treat patients, have less favorable cost-effectiveness. It is important to note that these economic analyses are susceptible to error because of imprecision in modelling of management pathways and limitations in the quality of data available to estimate parameters. The impact of errors can be magnified when extrapolated over long time horizons.
  • A randomized controlled trial (RCT) that assessed relative cost-effectiveness of management pathways that use HSAT or PSG for diagnosis of OSA reported that in-trial costs were 25% less in the home arm than the in-laboratory arm. These estimates were based on the Medicare Fee Schedule for the various study procedures, including office visits and diagnostic testing, and take into account the need to repeat studies. A subsequent cost minimization analysis of this RCT also considered costs from a provider perspective. While provider costs (capital, labor, overhead) were generally less for the home program, this was not true for all modelled scenarios. The provider perspective highlighted the large number of cost components necessary to ensure high quality home-based OSA management, which narrowed the cost difference relative to lab management.
  • The available studies indicate that the potential cost advantages of HSAT over PSG are not as high as reflected by the cost difference of a single night of testing. Even when HSAT is used in appropriate populations and conditions, additional HSAT and PSG are needed for patients with technically inadequate or inconclusive studies, in order to achieve an accurate diagnosis. In addition, if a home management pathway is used in a manner that results in reduced effectiveness relative to PSG, use of HSAT could in fact be less cost effective than using PSG. Examples of this include use in patient populations with predominantly mild OSA in which there are a higher proportion of negative or indeterminate HSAT results that require follow-up PSG, or use in patients at risk for non-obstructive sleep-related breathing disorders that may not be accurately diagnosed with HSAT. The TF determined that if HSAT is used in the recommended context and management pathway, it would be more cost-effective than if it is used outside this framework.
  • A single cost-effectiveness analysis demonstrated that split-night studies were less costly than full-night studies based on cost per quality of life year (QALY) gained ($1,979 versus $2,092) and would be considered more cost-effective than full-night studies when third-party willingness to pay fell below $11,500 per QALY gained (a level of cost per QALY that would still be considered a good value for payers). However, the TF had low confidence in the certainty of resource use, given the lack of high quality evidence to inform cost effectiveness.

Method of Guideline Validation

  • External Peer Review
  • Internal Peer Review

Description of Method of Guideline Validation

A draft of the guideline was available for public comment for a two-week period on the American Academy of Sleep Medicine (AASM) Web site. The Task Force (TF) took into consideration all the comments received and made revisions when appropriate. The revised guideline was submitted to the AASM Board of Directors (BOD) who approved these recommendations.

Evidence Supporting the Recommendations

Type of Evidence Supporting the Recommendations

The “Overall Quality of Evidence” sections following each recommendation in the original guideline document summarize the quality of evidence supporting individual recommendations. Also see Table 5 in the original guideline document, which provides a summary of the recommendation statements including the quality of evidence for each recommendation.

Benefits/Harms of Implementing the Guideline Recommendations

Potential Benefits

  • Use of home sleep apnea testing (HSAT) may provide potential benefits to patients with suspected obstructive sleep apnea (OSA). Such benefits could include convenience, comfort, increased access to testing, and decreased cost. HSAT can be performed in the home environment with fewer attached sensors during sleep. The availability of HSAT for diagnosis may improve access to diagnostic testing in resource-limited settings, or when the patient is unable to leave the home or healthcare setting for testing. In addition, HSAT may be less costly when used appropriately.
  • The split-night protocol, in comparison to a full-night baseline assessment followed by a separate positive airway pressure (PAP) titration, has the potential to provide the needed diagnostic information and effective continuous positive airway pressure (CPAP) settings within the same recording.
  • A second night of polysomnography (PSG) in symptomatic patients allows for the diagnosis of OSA in 8% to 25% of patients with initial false negative studies. Establishing a diagnosis of OSA in these patients allows for treatment that leads to improved symptom control (e.g., less daytime sleepiness), better QOL, and potentially decreased cardiovascular morbidity over time.

Refer to the “Benefits versus Harms” sections in the original guideline document for benefits/harms assessment of specific recommendations.

Potential Harms

  • Harms of home sleep apnea testing (HSAT) could result from the need for additional diagnostic testing among patients with technically inadequate or inconclusive HSAT findings, or from false-positive and false-negative results leading either to unnecessary testing and treatment or to misdiagnosis and subsequent inappropriate therapy or lack of therapy.
  • Potential disadvantages of the split-night study include insufficient diagnostic sampling (e.g., limited rapid eye movement [REM] sleep time and limited supine time in those with difficulty initiating sleep), and insufficient time to ascertain appropriate continuous positive airway pressure (CPAP) treatment settings.
  • Routinely repeating a polysomnography (PSG) in patients with an initial negative PSG has potential downsides. There is a risk that repeat testing could lead to false positive cases being identified, and unnecessarily treated. In addition, the routine use of a 2-night study protocol would cause inconvenience to the patient, increased utilization of resources and healthcare costs, and perhaps even delays in the care of other patients awaiting PSG.

Refer to the “Benefits versus Harms” sections in the original guideline document for benefits/harms assessment of specific recommendations.

Qualifying Statements

Qualifying Statements

  • The recommendations in this guideline define principles of practice that should meet the needs of most patients in most situations. This guideline should not, however, be considered inclusive of all proper methods of care or exclusive of other methods of care reasonably used to obtain the same results. A STRONG recommendation is one that clinicians should, under most circumstances, always follow (i.e., something that might qualify as a Quality Measure). A WEAK recommendation reflects a lower degree of certainty in the appropriateness of the patient-care strategy and requires that the clinician use his/her clinical knowledge and experience, and refer to the individual patient’s values and preferences to determine the best course of action. The ultimate judgment regarding the suitability of any specific recommendation must be made by the clinician, in light of the individual circumstances presented by the patient, the available diagnostic tools, the accessible treatment options, and available resources.
  • The American Academy of Sleep Medicine (AASM) expects this guideline to have an impact on professional behavior, patient outcomes, and possibly, health care costs. This clinical practice guideline reflects the state of knowledge at the time of the literature review and will be reexamined and updated as new information becomes available.
  • Refer to the “Discussions and Future Directions” section in the original guideline document for information on limitations of the literature.

Implementation of the Guideline

Description of Implementation Strategy

An implementation strategy was not provided.

Implementation Tools

  • Clinical Algorithm

Institute of Medicine (IOM) National Healthcare Quality Report Categories

IOM Care Need

  • Getting Better
  • Living with Illness

IOM Domain

  • Effectiveness
  • Patient-centeredness

Identifying Information and Availability

Bibliographic Source(s)

  • Kapur VK, Auckley DH, Chowdhuri S, Kuhlmann DC, Mehra R, Ramar K, Harrod CG. Clinical practice guideline for diagnostic testing for adult obstructive sleep apnea: an American Academy of Sleep Medicine clinical practice guideline. J Clin Sleep Med. 2017 Mar 15;13(3):479-504. [131 references] PubMed

Adaptation

Not applicable: The guideline was not adapted from another source.

Date Released

2017 Mar 15

Guideline Developer(s)

  • American Academy of Sleep Medicine - Professional Association

Source(s) of Funding

The development of this clinical practice guideline was funded by the American Academy of Sleep Medicine.

Guideline Committee

Expert Task Force

Composition of Group That Authored the Guideline

Task Force Members : Vishesh K. Kapur, MD, MPH, University of Washington, Seattle, WA; Dennis H. Auckley, MD, MetroHealth Medical Center and Case Western Reserve University, Cleveland, OH; Susmita Chowdhuri, MD, John D. Dingell VA Medical Center and Wayne State University, Detroit, MI; David C. Kuhlmann, MD, Bothwell Regional Health Center, Sedalia, MO; Reena Mehra, MD, MS, Cleveland Clinic, Cleveland, OH; Kannan Ramar, MBBS, MD, Mayo Clinic, Rochester, MN; Christopher G. Harrod, MS, American Academy of Sleep Medicine, Darien, IL

Financial Disclosures/Conflicts of Interest

The Task Force (TF) was required to disclose all potential conflicts of interest (COI) according to the American Academy of Sleep Medicine’s (AASM’s) COI policy, both prior to being appointed to the TF, and throughout the research and writing of this paper. In accordance with the AASM’s conflicts of interest policy, TF members with a Level 1 conflict were not allowed to participate. TF members with a Level 2 conflict were required to recuse themselves from any related discussion or writing responsibilities.

Disclosure Statement

Dr. Auckley receives royalties from Up-to-Date and is a consultant for the American Board of Internal Medicine. Dr. Chowdhuri has received research support from the Veterans’ Health Administration. Dr. Mehra has received research support from Philips Respironics and Resmed in the form of equipment used in clinical research. Dr. Mehra also received grant support from the NIH/NHLBI and royalties from Up-to-Date. Mr. Harrod is an employee of the American Academy of Sleep Medicine. The other authors have indicated no financial conflicts of interest.

Guideline Status

This is the current release of the guideline.

This guideline updates previous versions: Kushida CA, Littner MR, Morgenthaler T, Alessi CA, Bailey D, Coleman J Jr, Friedman L, Hirshkowitz M, Kapen S, Kramer M, Lee-Chiong T, Loube DL, Owens J, Pancer JP, Wise M. Practice parameters for the indications for polysomnography and related procedures: an update for 2005. Sleep. 2005 Apr 1;28(4):499-521. [150 references]

Collop NA, Anderson WM, Boehlecke B, Claman D, Goldberg R, Gottlieb DJ, Hudgel D, Sateia M, Schwab R, Portable Monitoring Task Force of the American Academy of Sleep Medicine. Clinical guidelines for the use of unattended portable monitors in the diagnosis of obstructive sleep apnea in adult patients. Portable Monitoring Task Force of the American Academy of Sleep Medicine. J Clin Sleep Med. 2007 Dec 15;3(7):737-47.

This guideline meets NGC’s 2013 (revised) inclusion criteria.

Guideline Availability

Available from the American Academy of Sleep Medicine Web site.

Availability of Companion Documents

The following is available:

  • Morgenthaler TI, Deriy L, Heald JL, Thomas SM. The evolution of the AASM clinical practice guidelines: another step forward. J Clin Sleep Med. 2016;12(1):129-35. Available from the American Academy of Sleep Medicine Web site.

Patient Resources

None available

NGC Status

This NGC summary was completed by ECRI Institute on July 18, 2010. The information was verified by the guideline developer on August 5, 2010. This summary was updated by ECRI Institute on May 15, 2017. The updated information was verified by the guideline developer on May 26, 2017.

This NEATS assessment was completed by ECRI Institute on June 22, 2017. The information was verified by the guideline developer on June 26, 2017.

This NGC summary is based on the original guideline, which is subject to the guideline developer’s copyright restrictions. Please contact the American Academy of Sleep Medicine for information regarding reproduction of American Academy of Sleep Medicine guidelines.

Disclaimer

NGC Disclaimer

The National Guideline Clearinghouse™ (NGC) does not develop, produce, approve, or endorse the guidelines represented on this site.

All guidelines summarized by NGC and hosted on our site are produced under the auspices of medical specialty societies, relevant professional associations, public or private organizations, other government agencies, health care organizations or plans, and similar entities.

Guidelines represented on the NGC Web site are submitted by guideline developers, and are screened solely to determine that they meet the NGC Inclusion Criteria.

NGC, AHRQ, and its contractor ECRI Institute make no warranties concerning the content or clinical efficacy or effectiveness of the clinical practice guidelines and related materials represented on this site. Moreover, the views and opinions of developers or authors of guidelines represented on this site do not necessarily state or reflect those of NGC, AHRQ, or its contractor ECRI Institute, and inclusion or hosting of guidelines in NGC may not be used for advertising or commercial endorsement purposes.

Readers with questions regarding guideline content are directed to contact the guideline developer.