π Key Information Summary
- Evidence-based medicine (EBM) integrates the best available research evidence with clinical expertise and patient values to guide decision-making in Australian general practice.
- Sensitivity measures a test's ability to correctly identify patients WITH the disease (rule-out value); specificity measures a test's ability to correctly identify patients WITHOUT the disease (rule-in value).
- Positive predictive value (PPV) and negative predictive value (NPV) are heavily influenced by disease prevalence β a critical concept for primary care where most conditions have low prevalence.
- Quantitative research (RCTs, cohort, case-control, cross-sectional) generates numerical data and is ranked higher on evidence hierarchies; qualitative research (interviews, focus groups, thematic analysis) explores patient experience, context, and meaning.
- The NHMRC levels of evidence (IβIV) and the broader evidence pyramid guide Australian clinicians in grading the quality and applicability of research findings.
- Systematic reviews and meta-analyses sit at the top of the evidence hierarchy, synthesising all available primary studies on a clinical question.
- Randomised controlled trials (RCTs) are the gold standard for evaluating treatment efficacy, but observational studies remain essential for rare outcomes, long-term safety, and real-world effectiveness.
- Critical appraisal requires evaluating three domains: validity (are the results believable?), importance (are the results clinically meaningful?), and applicability (can I use this for my patient?).
- Use the PICO framework (Population, Intervention, Comparison, Outcome) to formulate answerable clinical questions and structure literature searches.
- Key statistical measures for GPs include relative risk (RR), odds ratio (OR), number needed to treat (NNT), number needed to harm (NNH), confidence intervals, and p-values.
- Selection bias, information bias, and confounding are the three major threats to study validity that every GP should recognise when reading research.
- The RACGP and ACRRM require GPs to demonstrate continuing professional development in evidence-based practice as part of their fellowship and CPD obligations.
- In the Australian setting, applying EBM must account for Aboriginal and Torres Strait Islander health priorities, rural/remote access constraints, and culturally safe care frameworks.
Introduction & Australian Context
Research methodology and evidence-based medicine (EBM) form the intellectual backbone of modern general practice. Australian GPs encounter an enormous breadth of clinical presentations β from acute minor illness to complex multimorbidity β and must navigate an ever-expanding body of literature to deliver safe, effective, and cost-conscious care. This article provides a foundational guide to the statistical concepts, study designs, evidence hierarchies, and critical appraisal skills that underpin evidence-based decision-making in Australian primary care.
The concept of EBM was formalised in the early 1990s by a group at McMaster University in Canada, led by David Sackett and Gordon Guyatt. Since then, it has become embedded in Australian medical education, specialist training, and continuing professional development (CPD). The Royal Australian College of General Practitioners (RACGP) mandates that GPs maintain skills in literature appraisal as part of the Fellowship (FRACGP) curriculum and the CPD programme. The Australian Commission on Safety and Quality in Health Care (ACSQHC) similarly emphasises evidence-based clinical governance through the National Safety and Quality Health Service (NSQHS) Standards.
In Australia, the National Health and Medical Research Council (NHMRC) has developed a nationally recognised hierarchy of evidence and grading system for clinical recommendations. This framework is used by guideline developers including the RACGP, Cancer Council Australia, the Australasian Society of Clinical Immunology and Allergy (ASCIA), and Kidney Health Australia to produce clinical practice guidelines relevant to primary care.
The Australian general practice research landscape is supported by organisations including the Primary Health Care Research and Information Service (PHCRIS), the Australian Primary Health Care Research Institute (APHCRI), and the NHMRC Centre for Research Excellence in Primary Health Care. Each year, Australian GPs contribute to and consume a substantial body of research, with the Australian Journal of General Practice (AJGP, formerly AFP) serving as the principal peer-reviewed journal for the profession.
Sensitivity, Specificity & Predictive Values
Understanding the performance characteristics of diagnostic tests is fundamental to clinical reasoning in primary care. GPs order hundreds of pathology and imaging tests each year, and the interpretation of results depends critically on four inter-related concepts: sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV).
The 2 Γ 2 Contingency Table
All diagnostic test performance measures derive from the classic 2 Γ 2 table, which cross-tabulates test results against the true disease status (gold standard):
| Disease Present (D+) | Disease Absent (Dβ) | |
|---|---|---|
| Test Positive (T+) | True Positive (TP) | False Positive (FP) |
| Test Negative (Tβ) | False Negative (FN) | True Negative (TN) |
Sensitivity and Specificity
| Measure | Formula | Clinical Meaning | Mnemonic |
|---|---|---|---|
| Sensitivity (Sn) | TP Γ· (TP + FN) | Proportion of diseased patients who test positive. A highly sensitive test has few false negatives. | SnNOUT β Sensitive, Negative result rules OUT disease |
| Specificity (Sp) | TN Γ· (TN + FP) | Proportion of non-diseased patients who test negative. A highly specific test has few false positives. | SpPIN β Specific, Positive result rules IN disease |
Positive and Negative Predictive Values
| Measure | Formula | Clinical Meaning |
|---|---|---|
| Positive Predictive Value (PPV) | TP Γ· (TP + FP) | Probability that a patient with a POSITIVE test actually has the disease. |
| Negative Predictive Value (NPV) | TN Γ· (TN + FN) | Probability that a patient with a NEGATIVE test truly does not have the disease. |
Worked Example β Prostate-Specific Antigen (PSA) Screening
Consider a PSA test with 80% sensitivity and 60% specificity applied to two populations:
| Setting | Prevalence | PPV | Interpretation |
|---|---|---|---|
| Population screening (asymptomatic men) | ~3% | ~5.8% | For every 17 men with a positive PSA, only 1 has prostate cancer. |
| Urology clinic (high-risk referral) | ~30% | ~46% | Nearly half of positive results are true positives. |
Likelihood Ratios
Likelihood ratios (LRs) combine sensitivity and specificity into a single measure and are more clinically useful for individual patient assessment:
- Positive likelihood ratio (LR+): Sensitivity Γ· (1 β Specificity). An LR+ > 10 provides strong evidence to rule IN a diagnosis.
- Negative likelihood ratio (LRβ): (1 β Sensitivity) Γ· Specificity. An LRβ < 0.1 provides strong evidence to rule OUT a diagnosis.
- Using LRs in practice: Post-test odds = Pre-test odds Γ LR. This allows GPs to update the probability of disease after a test result, moving from pre-test probability to post-test probability using the Fagan nomogram or Bayesian calculations.
Receiver Operating Characteristic (ROC) Curves
An ROC curve plots sensitivity (y-axis) against (1 β specificity) (x-axis) across all possible cut-off values. The area under the ROC curve (AUC or C-statistic) summarises overall test performance: AUC = 0.5 is no better than chance; AUC = 1.0 is a perfect test. In clinical practice, an AUC > 0.8 is generally considered good and an AUC > 0.9 excellent.
Types of Research: Qualitative & Quantitative
Research methods in general practice encompass a broad spectrum of quantitative and qualitative approaches. Understanding the strengths, limitations, and appropriate applications of each design is essential for Australian GPs who both consume and generate primary care research.
Quantitative Research Designs
Quantitative research uses numerical data and statistical analysis to test hypotheses, estimate effect sizes, and establish associations or causation. The major designs are described below in descending order of evidentiary strength:
| Study Design | Description | Strengths | Limitations | Common Use in GP |
|---|---|---|---|---|
| Systematic Review & Meta-Analysis | Systematic identification, appraisal, and synthesis of all studies addressing a specific question. Meta-analysis pools results statistically. | Highest level of evidence; reduces random error; identifies consistency of findings | Dependent on quality of included studies ("garbage in, garbage out"); publication bias; clinical heterogeneity | Cochrane reviews of treatment effectiveness; screening programme evaluations |
| Randomised Controlled Trial (RCT) | Participants randomly allocated to intervention or control group. May be blinded (single or double). | Minimises confounding and selection bias; establishes causation; gold standard for efficacy | Expensive; ethical constraints; strict inclusion criteria may limit generalisability (external validity); often excludes multimorbid GP patients | Drug trials (e.g., SPRINT for BP targets); behavioural interventions; complex interventions |
| Cohort Study (Prospective or Retrospective) | Follows a group over time, comparing outcomes in exposed vs. unexposed groups. | Can study multiple outcomes; establishes temporality; suitable for rare exposures | Susceptible to confounding and attrition bias; expensive if prospective; long follow-up needed | Long-term medication safety (e.g., 45 and Up Study β large Australian cohort); risk factor identification |
| Case-Control Study | Identifies patients with a condition (cases) and compares past exposures to matched controls. | Efficient for rare diseases; relatively quick and inexpensive | Susceptible to recall bias and selection bias; cannot establish incidence; confounding difficult to fully control | Investigating risk factors for rare cancers; outbreak investigations |
| Cross-Sectional Study (Survey) | Measures exposure and outcome at a single point in time. | Quick; inexpensive; estimates prevalence; useful for health services planning | Cannot establish causation or temporality; susceptible to prevalence-incidence bias | National Health Survey (ABS); BEACH study (former GP activity data); disease prevalence estimates |
| Case Report / Case Series | Describes a single patient or small group with a particular condition or unusual presentation. | Generates hypotheses; useful for rare or novel findings | No control group; no statistical analysis; lowest level of analytic evidence | Reporting adverse drug reactions to TGA; novel clinical observations |
Qualitative Research
Qualitative research explores the why and how of health, illness, and healthcare β areas that quantitative methods cannot adequately address. It is increasingly valued in Australian primary care research for understanding patient experience, clinician decision-making, and the social determinants of health.
| Method | Description | Best Used For |
|---|---|---|
| In-depth Interviews | Semi-structured or unstructured one-on-one conversations exploring individual experiences and perspectives. | Understanding patient illness narratives; exploring sensitive topics (e.g., mental health, domestic violence) |
| Focus Groups | Group discussions (typically 6β10 participants) facilitated by a researcher to explore collective views. | Exploring community attitudes; piloting interventions; understanding group norms |
| Ethnography | Prolonged immersion in a clinical setting or community to observe behaviours and interactions in their natural context. | Understanding clinical practice cultures; Aboriginal and Torres Strait Islander health service delivery |
| Grounded Theory | Iterative data collection and analysis to develop an explanatory theory grounded in participant data. | Generating new theoretical frameworks; understanding processes (e.g., chronic disease self-management) |
| Thematic Analysis | Identifying, analysing, and reporting patterns (themes) within qualitative data. | Flexible approach applicable to most qualitative data; widely used in GP research |
| Participatory Action Research (PAR) | Collaborative research where community members are co-researchers who help design, implement, and evaluate changes. | Aboriginal and Torres Strait Islander health programmes; community health improvement |
Mixed-Methods Research
Mixed-methods research combines quantitative and qualitative approaches within a single study or programme of research. In Australian primary care, this is increasingly the preferred approach for complex interventions, health services evaluation, and implementation science. For example, a study evaluating a new chronic disease management programme might use a cluster RCT to measure clinical outcomes (quantitative) alongside semi-structured interviews with patients and GPs to understand barriers and enablers of implementation (qualitative).
Grey Literature and Real-World Evidence
- Grey literature: Government reports (AIHW, ABS), clinical registries, health department policy documents, conference abstracts, and theses. These sources are important in Australian primary care but are not peer-reviewed in the traditional sense.
- Real-world evidence (RWE): Data from electronic health records, PBS dispensing data, Medicare Benefits Schedule (MBS) claims, and disease registries (e.g., the Australian Orthopaedic Association National Joint Replacement Registry). RWE complements RCT data by capturing outcomes in diverse, real-world populations.
- Practice-based research networks (PBRNs): Australian examples include the NPS MedicineWise General Practice Analysis Team and the BEACH successor studies, which generate data directly from GP consultations.
Evidence-Based Medicine & Levels of Evidence
The Five Steps of EBM
The practice of EBM involves five interconnected steps, originally described by Sackett and colleagues:
The PICO Framework
| Element | Description | Example |
|---|---|---|
| P β Population / Patient | Who is the patient or population of interest? | Adults aged β₯ 65 years in residential aged care |
| I β Intervention / Exposure | What intervention, treatment, or exposure are you considering? | Annual influenza vaccination |
| C β Comparison / Control | What is the alternative (placebo, no treatment, standard care)? | No influenza vaccination |
| O β Outcome | What outcome are you measuring? | Hospitalisation for influenza-related pneumonia; all-cause mortality |
NHMRC Levels of Evidence
The Australian NHMRC has developed a nationally standardised hierarchy of evidence used by guideline developers across Australia. This framework classifies evidence by study design:
| Level | Study Design | Description |
|---|---|---|
| Level I | Systematic review of RCTs | Cochrane review or equivalent; highest level of evidence for treatment questions |
| Level II | Randomised controlled trial | Well-designed RCT with adequate power, allocation concealment, and follow-up |
| Level III-1 | Pseudo-randomised controlled trial | Quasi-experimental design with non-random allocation (e.g., alternation) |
| Level III-2 | Comparative study with concurrent controls | Cohort study, case-control study, or interrupted time series with a control group |
| Level III-3 | Comparative study without concurrent controls | Historical control study, interrupted time series without a parallel control group |
| Level IV | Case series (with or without intervention), cross-sectional studies | Descriptive studies, post-test only, pre-test/post-test without control group |
NHMRC Grades of Recommendation
| Grade | Meaning | Translation to Practice |
|---|---|---|
| A β Body of evidence can be trusted | Consistent findings from multiple well-designed studies; directly applicable to the Australian context | Recommend with confidence |
| B β Body of evidence can be trusted with caution | Good-quality evidence but limited in quantity, applicability, or consistency | Recommend with some caution |
| C β Body of evidence provides some support | Evidence is limited in quality, quantity, or consistency | Consider carefully; may need to individualise |
| D β Body of evidence is weak | Any evidence is of very low quality or absent | Apply with caution; expert opinion may be the primary guide |
The Evidence Pyramid
The "evidence pyramid" is a widely used visual representation of the hierarchy of evidence, with study types arranged from the broadest base (weakest) to the narrowest apex (strongest):
Key Statistical Concepts for GPs
| Concept | Definition | Clinical Interpretation |
|---|---|---|
| p-value | Probability of obtaining results at least as extreme as observed, assuming the null hypothesis is true | p < 0.05 is conventionally "statistically significant" but does NOT mean clinically important. A tiny, clinically irrelevant difference can be "significant" in a large study. |
| Confidence Interval (CI) | Range within which the true population parameter is expected to lie (usually 95% CI) | More informative than p-values. If the 95% CI crosses 1.0 (for ratios) or 0 (for differences), the result is not statistically significant. Wide CIs indicate imprecision (small sample size). |
| Relative Risk (RR) | Risk of outcome in intervention group Γ· risk in control group | RR = 0.75 means a 25% relative risk reduction. But the absolute benefit depends on baseline risk β an important distinction for shared decision-making. |
| Odds Ratio (OR) | Odds of outcome in exposed Γ· odds in unexposed | Common in case-control studies. For rare outcomes, OR β RR. For common outcomes, OR overestimates the RR. |
| Absolute Risk Reduction (ARR) | Risk in control group β risk in intervention group | The actual reduction in risk. More clinically meaningful than relative risk for individual patient discussions. |
| Number Needed to Treat (NNT) | 1 Γ· ARR | The number of patients who need to receive the intervention for one additional patient to benefit. Lower NNT = more effective treatment. |
| Number Needed to Harm (NNH) | 1 Γ· Absolute Risk Increase (ARI) | The number of patients treated for one additional patient to experience an adverse event. Higher NNHT = safer treatment. |
Critical Appraisal of Research
Critical appraisal is the systematic process of evaluating research evidence for its trustworthiness, value, and relevance. For Australian GPs, who are expected to manage uncertainty and apply the best available evidence across a vast clinical scope, critical appraisal is not an academic exercise β it is a core clinical skill.
The Three Fundamental Questions
Every critical appraisal should address three domains, as articulated in the JAMA Users' Guides series and the CASP (Critical Appraisal Skills Programme) checklists:
Types of Bias
Bias is a systematic error that distorts study findings. Recognising bias is perhaps the single most important critical appraisal skill. The three broad categories are:
Selection Bias
Occurs when the way participants are selected or allocated leads to systematic differences between groups. Examples include:
- Allocation bias: Non-random assignment to treatment groups (e.g., sicker patients getting the intervention).
- Volunteer bias: Healthier, more motivated individuals self-selecting into studies.
- Berkson's bias: Hospital-based case-control studies over-representing conditions associated with hospitalisation.
- Attrition bias: Differential loss to follow-up between groups (e.g., patients with side effects dropping out of the treatment arm).
Minimised by: Randomisation, allocation concealment, intention-to-treat analysis, and high follow-up rates (>80%).
Information Bias (Measurement Bias)
Occurs when data on exposure or outcome are measured inaccurately. Examples include:
- Recall bias: Cases (e.g., mothers of children with birth defects) recall exposures more completely than controls.
- Interviewer bias: The researcher's expectations influence data collection.
- Measurement bias: Inconsistent or inaccurate measurement instruments.
Minimised by: Blinding (double-blind preferred), validated measurement instruments, standardised data collection protocols, and objective outcome measures.
Confounding
A confounder is a variable that is associated with both the exposure and the outcome, but is not on the causal pathway. Classic examples include:
- Smoking confounds the relationship between coffee drinking and lung cancer.
- Socioeconomic status confounds the relationship between diet and cardiovascular disease.
- Age confounds almost everything in medicine.
Controlled by: Randomisation (RCTs), restriction, matching, stratified analysis, and multivariable regression (observational studies).
CASP Checklists
The Critical Appraisal Skills Programme (CASP) provides free, structured checklists for appraising different study types. These are widely used in Australian GP training:
| Checklist | Key Questions |
|---|---|
| CASP RCT Checklist | Was assignment randomised? Was allocation concealed? Were groups similar at baseline? Were patients, clinicians, and outcome assessors blinded? Was follow-up complete? Was intention-to-treat analysis used? |
| CASP Cohort Checklist | Was the cohort recruited appropriately? Was exposure measured accurately? Were confounders controlled? Was follow-up long and complete enough? |
| CASP Case-Control Checklist | Were cases and controls selected appropriately? Were exposures measured similarly in both groups? Were confounders addressed? |
| CASP Qualitative Checklist | Was there a clear research question? Was qualitative methodology appropriate? Was the research design appropriate? Was recruitment strategy appropriate to the aims? Was data saturation achieved? |
| CASP Systematic Review Checklist | Did the review address a focused question? Were appropriate criteria used to select articles? Was the quality of included studies assessed? Were the results combined and how? How precise are the results? |
Appraising Specific Study Types
Appraising an RCT
- Randomisation: Was the allocation sequence truly random (computer-generated, random number tables)? Was allocation concealment maintained (sealed opaque envelopes, central randomisation)?
- Blinding: Were patients, care providers, outcome assessors, and data analysts blinded? Double-blinding (patient + clinician) is the minimum standard for drug trials.
- Intention-to-treat (ITT) analysis: Were all randomised patients analysed in the group to which they were assigned, regardless of compliance? Per-protocol analysis alone introduces bias.
- Follow-up: Was loss to follow-up < 20%? Was it balanced between groups?
- Baseline comparability: Were groups similar at baseline? Any imbalances suggest failure of randomisation.
- External validity: Were inclusion/exclusion criteria so strict that the study population doesn't resemble your patients?
Appraising a Diagnostic Accuracy Study
- Was there an independent, blinded comparison with a valid reference ("gold") standard?
- Did the study population include a clinically relevant spectrum of disease (not just severe cases vs. healthy controls)?
- Were sensitivity, specificity, likelihood ratios, and predictive values reported with confidence intervals?
- Was the test evaluated in a population similar to your own (pre-test probability)?
Appraising a Systematic Review
- Was the research question clearly defined using PICO?
- Was the search strategy comprehensive (multiple databases, grey literature, hand-searching reference lists)?
- Was the quality of included studies assessed independently by two reviewers?
- Was there significant heterogeneity between studies? If so, was a random-effects model used, and were subgroup analyses or meta-regression performed?
- Was publication bias assessed (funnel plots, Egger's test)?
Clinical vs. Statistical Significance
Practical Tips for Time-Poor GPs
- Start with the abstract and conclusions, then go to methods: If the methods are flawed, the conclusions are untrustworthy regardless of how positive they sound.
- Read the "Limitations" section: Authors are required to disclose study limitations. If they don't, be suspicious.
- Check the funding source and conflicts of interest: Industry-funded studies are more likely to report favourable outcomes for the sponsor's product.
- Use pre-appraised resources when available: Cochrane Clinical Answers, BMJ Best Practice, NPS MedicineWise RADAR, and the RACGP's AJGP "Clinical Practice" summaries.
- Apply the "back of the envelope" test: Can you explain the key finding to a patient in one sentence? If not, the evidence may not be mature enough to change practice.
- Discuss with colleagues: Journal clubs, peer review groups, and RACGP local groups provide invaluable critical appraisal support.
Applying Evidence in Australian General Practice
The final β and arguably most important β step in EBM is the translation of research findings into clinical decisions that benefit individual patients. In Australian general practice, this involves navigating several contextual factors:
Patient-Centred Care & Shared Decision-Making
EBM explicitly includes patient values and preferences as one of its three pillars. Shared decision-making involves presenting evidence in an accessible format, exploring the patient's goals and concerns, and arriving at a management plan that respects both the evidence and the patient's autonomy. Decision aids (available from organisations such as the Wiser Healthcare collaboration and the Consumers Health Forum of Australia) can support this process.
Australian Regulatory & Funding Context
- Pharmaceutical Benefits Scheme (PBS): Not all evidence-based treatments are PBS-listed. GPs must consider cost to the patient and whether authority or streamlined authority is required.
- Medicare Benefits Schedule (MBS): Evidence-based investigations and procedures must align with MBS item descriptors for Medicare rebates to apply.
- Therapeutic Goods Administration (TGA): Medications must be TGA-registered for their indication; off-label use (supported by evidence but not TGA-approved for that indication) requires careful documentation and informed consent.
- National Immunisation Program (NIP): Immunisation recommendations are informed by ATAGI (Australian Technical Advisory Group on Immunisation) and reflect the best available evidence adapted to Australian epidemiology.
Implementing Evidence-Based Guidelines
Clinical practice guidelines (CPGs) synthesise evidence into actionable recommendations. In Australia, major guideline developers include the RACGP (Red Book, Green Book, Clinical Guidelines), Cancer Council Australia, ASCIA, Kidney Health Australia, Lung Foundation Australia, and the National Heart Foundation of Australia. GPs should critically evaluate guidelines using the AGREE II (Appraisal of Guidelines for Research and Evaluation) instrument, considering:
- Rigour of development β Was systematic evidence review conducted? Was the strength of evidence graded?
- Applicability to Australian primary care β Were local epidemiology, healthcare system factors, and resource constraints considered?
- Conflicts of interest β Were guideline panel members free from pharmaceutical industry conflicts?
- Currency β When was the guideline last updated? Evidence changes rapidly.
When Evidence Is Insufficient
Audit & Quality Improvement
The NSQHS Standards (ACSQHC) require health service organisations to use evidence to drive continuous quality improvement. GPs can apply evidence-practice gap analysis through clinical audit, PDSA (Plan-Do-Study-Act) cycles, and practice-based quality improvement activities. The RACGP's Practice Accreditation standards (Standards for General Practices, 5th edition) mandate systematic approaches to evidence-based care delivery.
Special Populations & Research Equity
Certain populations are systematically underrepresented in clinical research, yet they constitute a significant proportion of Australian GP consultations. GPs must be aware of how evidence limitations affect care for these groups.
Paediatrics
Pregnancy & Lactation
Older Australians
Renal Impairment
Immunocompromised Patients
Rural & Remote Populations
Aboriginal and Torres Strait Islander Health Considerations
Aboriginal and Torres Strait Islander Australians experience a disproportionate burden of chronic disease, lower life expectancy (approximately 8 years less than non-Indigenous Australians), and significant barriers to healthcare access. Evidence-based practice in this context requires particular attention to research equity, cultural safety, and the application of evidence within a framework of self-determination and community control.
Applying EBM with Aboriginal and Torres Strait Islander Patients
Key Australian Resources for Indigenous Health Evidence
- NHMRC guidelines on ethical conduct in Aboriginal and Torres Strait Islander health research (2018)
- RHDAustralia β Clinical guidelines for rheumatic fever and rheumatic heart disease
- NACCHO β National policy, workforce development, and clinical resources
- Lowitja Institute β Australia's National Institute for Aboriginal and Torres Strait Islander Health Research
- AIHW Indigenous health reports β Epidemiological data and performance monitoring
- RACGP Specific Interests β Aboriginal and Torres Strait Islander Health
π References
- 1. Sackett DL, Rosenberg WMC, Gray JAM, Haynes RB, Richardson WS. Evidence based medicine: what it is and what it isn't. BMJ. 1996;312(7023):71β72.
- 2. Straus SE, Glasziou P, Richardson WS, Haynes RB. Evidence-Based Medicine: How to Practice and Teach EBM. 5th ed. Edinburgh: Elsevier; 2018.
- 3. National Health and Medical Research Council (NHMRC). NHMRC additional levels of evidence and grades for recommendations for developers of guidelines. Canberra: NHMRC; 2009.
- 4. Greenhalgh T. How to Read a Paper: The Basics of Evidence-Based Medicine. 6th ed. Chichester: Wiley-Blackwell; 2019.
- 5. Grimes DA, Schulz KF. An overview of clinical research: the lay of the land. Lancet. 2002;359(9300):57β61.
- 6. Critical Appraisal Skills Programme (CASP). CASP Checklists. Oxford: CASP; 2018. Available at: casp-uk.net.
- 7. Guyatt GH, Rennie D, Meade MO, Cook DJ. Users' Guides to the Medical Literature: A Manual for Evidence-Based Clinical Practice. 3rd ed. New York: McGraw-Hill Education; 2015.
- 8. Royal Australian College of General Practitioners (RACGP). Standards for General Practices. 5th ed. Melbourne: RACGP; 2020.
- 9. Australian Institute of Health and Welfare (AIHW). Aboriginal and Torres Strait Islander Health Performance Framework. Canberra: AIHW; 2023.
- 10. National Health and Medical Research Council (NHMRC). Keeping Research on Track II: A companion document to the NHMRC ethical guidelines for research involving Aboriginal and Torres Strait Islander peoples. Canberra: NHMRC; 2018.
- 11. Higgins JPT, Thomas J, Chandler J, et al. Cochrane Handbook for Systematic Reviews of Interventions. Version 6.4. London: Cochrane; 2023.
- 12. Brouwers MC, Kho ME, Browman GP, et al. AGREE II: advancing guideline development, reporting and evaluation in health care. CMAJ. 2010;182(18):E839βE842.
- 13. O'Connell NE, Wand BM, McAuley J, Marston L, Moseley GL. Intensive strength and balance training for older adults with chronic low back pain (RESOLVE trial): protocol for a randomised controlled trial. BMC Musculoskelet Disord. 2023;24:518.
- 14. Australian Commission on Safety and Quality in Health Care (ACSQHC). National Safety and Quality Health Service Standards. 2nd ed. Sydney: ACSQHC; 2021.
- 15. Burchett H, Umoquit M, Dobrow M. How do we know when research from one setting can be useful in another? A review of external validity, applicability and transferability frameworks. J Health Serv Res Policy. 2011;16(4):238β244.