GRADE and quality of evidence - Students 4 Best Evidence (2024)

Posted on 5th October 2018 by Touko Kuusi

Tutorials and Fundamentals

GRADE and quality of evidence - Students 4 Best Evidence (1)

Our every visit to the doctor is built upon evidence

Let’s imagine that you visit a doctor due to chronic sleep problems (chronic insomnia). In this case, the doctor is a sleep specialist and follows the latest research, basing his conclusions on a comparative effectiveness review of treatments for insomnia. The review shows that there are two effective treatments: Cognitive behavioural therapy for insomnia (CBT-I) (moderate evidence) and various sleeping pills (low evidence).

Comparative effectiveness reviews and systematic reviews (in future sentences, both are referred to just as systematic reviews) are widely held as the gold standard of evidence in many sciences. In traditional reviews, authors typically collect together the studies they already know or can easily identify. This often leads to overemphasising studies that are in line with his/her own hypothesis. With systematic reviews, authors first specify a research question and then they have to search, identify, take into account and potentially statistically sum up all studies according to a carefully pre-planned strategy.

It is often illustrated that in systematic reviews, individual studies are the ‘participants’ of the meta-analysis. Often, and in the case of this blog, systematic reviews are used to determine how effective on average the intervention is (e.g. how many minutes the sleep time of insomniacs is increased on average).

What is GRADE?

The most prominent framework for evaluating the effectiveness of systematic reviews is GRADE (Grading quality of evidence and strength of recommendations). GRADE is used to rate the certainty of evidence for a treatment efficacy from high to very low. The GRADE system takes in two types of studies: randomized controlled trials (RCTs) and observational studies (also including non-randomized trials). In RCTs, one group of participants is randomized to a treatment and another group to placebo or an alternative condition. In observational studies, participants that took part in the treatment are simply compared to placebo, no treatment, or an alternative condition without randomization. Observational studies can be conducted as trials where participants are recruited via email or other means but are not randomized, or they can be conducted using data that is collected during routine work in hospitals or other medical centres. Studies without a control group can also be counted as observational studies, although they are less often used in systematic reviews that use GRADE assessment.

Observational studies suffer from a problem of confounding. If sleeping pills, on average, worked better compared to placebo pills, it can always be possible that it only worked because the comparison group were younger and the drug only works for young people, or because the treatment group had a placebo effect. Although there are many statistical methods to control for confounding, no method can fully account for unknown confounding factors – that may be the real reason for the difference between treatment and control group. Randomized studies are considered the gold standard as successful randomization takes out both known and unknown confounding factors. Interestingly, in economics, there exists methods that use natural random effects to control for confounding.

The GRADE criteria

In the GRADE system, the evidence is therefore initially set to either high (if included studies are randomized studies) or low (if they are observational studies). There are then 5 criteria that can be used to downgrade one, two, or in the case of indirectness, sometimes three steps. These are:

  1. Risk of bias in individual studies – e.g. methodological issues in included studies such as inadequate blinding (participants knew they were in control/treatment group)
  2. Inconsistency of results between studies
  3. Indirectness of evidence – e.g. participants were children although the systematic review was about adults
  4. Imprecision – results were not statistically significant, or the effect was clinically important once the studies were meta-analysed
  5. Publication bias – result was biased due to a file-drawering effect, as studies not showing a statistically significant effect are less likely to be published.

Additionally, observational studies starting at low can be upgraded based on 3 criteria: large effect, dose-response effect and “Effect of all plausible confounding factors would be to reduce the effect (where an effect is observed) or suggest a spurious effect (when no effect is observed)”. An example of the ‘dose-response effect’ refers to a finding that a larger dose of medicine leads to better treatment outcomes. The last criteria is complex but refers to situations where there is a bias (e.g. all doctors are told about a potential side effect) among clinicians to over-diagnose certain side effects but nevertheless no increased number of side effects is found in the studies.

If you were to look at a systematic review for chronic insomnia with a GRADING of moderate for sleep quality outcome for CBT-I, we would find out that there are many gold standard studies, randomized controlled trials conducted, that have compared CBT-I against control (placebo) condition. Therefore, a GRADING of moderate for sleeping pills would be achieved because the evidence was downgraded from high to moderate using one of the following 5 criteria for downgrading evidence: Risk of bias (in individual studies), Inconsistency, Imprecision, Indirectness or Publication bias.

On the other hand, when it comes to sleeping pills it might be possible in a hypothetical scenario (even though in this case that is not true) that instead of the gold standard RCT we would only have observational studies with no individual randomization. Therefore, the evidence would have started for sleeping pills from low, and as none of the 3 criteria for upgrading the evidence for observational studies were fulfilled (large effect, dose-response effect, or plausible confounding/bias would have led to over/underestimation of the effect) the evidence would stay low.

Summary of GRADE for systematic reviews

The table below from the GRADE handbookprovides a very useful summary of the 5 downgrading and 3 upgrading criteria:

GRADE and quality of evidence - Students 4 Best Evidence (2)

Schünemann, H., Brozek, J., & Oxman, A. (2013). GRADE handbook for grading quality of evidence and strength of recommendations. Updated October.

This blog aims to clarify the structure of GRADE the most advanced evidence hierarchy (or evidence base hierarchy) in evidence-based medicine (Blunt, 2015).

During the last 15 years, the outlined GRADE system has become a widely used global standard. It is used by guideline makers, such as the World Health Organization. As Blunt points out, the term global standard may be slightly overconfident, as many guideline makers who have started to use GRADE still use other systems at the same time. However, it is the single most prominent system available and is constantly developing and providing a platform for co-operation for different guideline makers (Centers for Disease Control and Prevention (CDC), 2011).

For example, in recent years, GRADE has become a mandatory element for newly published systematic reviews conducted by Cochrane, widely held as the most prominent evidence-based medicine organization. You can read more about Cochrane’s Strategy 2020 here.

Useful resources:

www.gradeworkinggroup.org: For visual clarification, the GRADE website provides videos and articles to get started.

Understanding GRADE: an introduction. Goldet, G & Howick J (2013)

A visual explanation of GRADE from the gdt GRADE pro website

References

Tags:

biasGRADERCTSystematic review

GRADE and quality of evidence - Students 4 Best Evidence (2024)

FAQs

How do you grade quality of evidence? ›

In the GRADE approach to quality of evidence, randomised trials without important limitations constitute high quality evidence. Observational studies without special strengths or important limitations constitute low quality evidence. Limitations or special strengths can, however, modify the quality of the evidence.

What does Level 4 evidence mean? ›

Level IV - Evidence from well-designed case-control and cohort studies. Level V - Evidence from systematic reviews of descriptive and qualitative studies. Level VI - Evidence from single descriptive or qualitative studies. Level VII - Evidence from the opinion of authorities and/or reports of expert committees.

How do you determine the best evidence? ›

Systematic Reviews and Meta Analyses

Well done systematic reviews, with or without an included meta-analysis, are generally considered to provide the best evidence for all question types as they are based on the findings of multiple studies that were identified in comprehensive, systematic literature searches.

What is the best level of evidence? ›

For example, systematic reviews are at the top of the pyramid, meaning they are both the highest level of evidence and the least common. As you go down the pyramid, the amount of evidence will increase as the quality of the evidence decreases.

What is evidence-based quality? ›

Evidence-based quality improvement (EBQI) is a multi-faceted implementation strategy that is based on a clinical-researcher partnership. It promotes organizational change by fostering innovation and the spread of those innovations that are successful.

What are the quality points for grading? ›

Quality points are determined by multiplying the number of course credits by the numerical value of the assigned grade. For example, a three-credit course completed with a grade of B- would carry 8.1 quality points (three credits multiplied by a 2.7 numerical grade equals 8.1 quality points for that course).

What are the 4 levels of evidence based practice? ›

Levels of Evidence (LoBiondo-Wood & Haber 2022)
LevelResearch Designs
1Systematic Review or meta-analysis of RCTs (randomized control trials)
2Randomized control trials
3Quasi-experimental Studies
4Non-experimental studies
3 more rows
Apr 2, 2024

What are the 4 pillars of evidence? ›

Rationale, aims and objectives: Four pillars of evidence underpin evidence-based behavioural practice: research evidence, practice evidence, patient evidence and contextual evidence.

What are the 5 levels of evidence? ›

Table 2
LevelType of evidence
IISmall RCTs with unclear results
IIICohort and case-control studies
IVHistorical cohort or case-control studies
VCase series, studies with no controls
1 more row

What is an example of best evidence? ›

[9] For example, if a plaintiff states in a deposition that he took a photograph prior to an accident showing damage to his vehicle, the opposing party can use the contents from that deposition against the plaintiff to prove that the photograph showed pre-existing damage.

What makes strong evidence? ›

Strong evidence is accurate, convincing, and relevant to the argument at hand. It comes from a credible source, and it truly supports the reason it is supposed to prove. Evaluating the strengths and weaknesses of arguments is an important skill to develop.

What is the most effective form of evidence? ›

Direct Evidence

The most powerful type of evidence, direct evidence requires no inference and directly proves the fact you are investigating. The evidence alone is the proof, if you believe the accounts.

What is the grade approach to evaluate level of evidence? ›

Thus, when using GRADE approach in a synthesis of the quality of evidence from experimental and observational studies, the levels of evidence are classified (for each outcome studied) on a four-level scale: very low, low, moderate or high.

What is evidence based grading? ›

Evidence-Based Grading (EBG) is used to provide students, parents and teachers with an understanding of how well a student understands the class content through a body of evidence. Each class has essential competencies (standards) and targets that are assessed using evidence that the student provides.

Top Articles
Latest Posts
Article information

Author: Stevie Stamm

Last Updated:

Views: 5714

Rating: 5 / 5 (80 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Stevie Stamm

Birthday: 1996-06-22

Address: Apt. 419 4200 Sipes Estate, East Delmerview, WY 05617

Phone: +342332224300

Job: Future Advertising Analyst

Hobby: Leather crafting, Puzzles, Leather crafting, scrapbook, Urban exploration, Cabaret, Skateboarding

Introduction: My name is Stevie Stamm, I am a colorful, sparkling, splendid, vast, open, hilarious, tender person who loves writing and wants to share my knowledge and understanding with you.