Root Cause Analysis in Clinical Trials: Using 5 Whys & Fishbone to Drive Effective CAPA

Published on 16/11/2025

Root Cause Analysis for Clinical Quality: Practical Methods That Stand Up to Inspection

Why Root Cause Analysis Matters in Clinical Research: Risks, Regulators, and Real-World Stakes

Root Cause Analysis (RCA) is how clinical teams move from symptoms (“a deviation occurred”) to mechanisms (“what in our design, process, or system allowed this to happen?”). RCA is not a paperwork ritual; it is a safety and data-integrity safeguard that converts Good Clinical Practice (GCP) principles into durable improvements. Global authorities—from the International Council for Harmonisation (ICH)

to the U.S. FDA, the European EMA, Japan’s PMDA, Australia’s TGA, and the public-health perspective of the WHO—expect sponsors and investigators to demonstrate that deviations and incidents are understood, controlled, and unlikely to recur.

Clinical context raises the bar. Unlike manufacturing, where defects can be scrapped, errors in trials can affect human participants and the credibility of decision-critical endpoints. RCA must therefore prioritize critical-to-quality (CtQ) factors: valid consent; accurate eligibility; on-time, correct primary endpoints; investigational product (IP)/device integrity (including temperature control and blinding); safety reporting clocks; and traceable data lineage across third parties (labs, imaging, eCOA/wearables, IRT). When an event touches a CtQ factor, your analysis and corrective action must be proportionally rigorous.

Avoid the “human error” trap. “Human error” is a starting point for inquiry, not an end state. Inspectors ask what made the error possible: ambiguous SOPs, unrealistic visit windows, capacity constraints (no weekend imaging), weak time-zone handling, firmware/app version drift, or vendor configurations that allow invalid inputs. RCA must probe the system—not just the person—so that corrective and preventive actions (CAPA) change conditions, not just training slides.

Evidence makes RCA persuasive. The story must be reconstructable from records: audit trails (who/what/when/why; prior/new values; local time + UTC offset), certified copies that preserve units and effective dates, device/software versions, courier logger PDFs, DICOM acquisition parameters, ticketing transcripts, and governance minutes. This aligns with ALCOA++ expectations (Attributable, Legible, Contemporaneous, Original, Accurate; plus Complete, Consistent, Enduring, Available) and convinces reviewers that conclusions are evidence-based.

Blinding and privacy are constraints, not afterthoughts. RCA cannot leak treatment assignment or expose unnecessary personal data. Keep randomization keys and kit mappings behind firewalls and use arm-agnostic language in the main file. Align access and disclosures with HIPAA (U.S.) and GDPR/UK-GDPR (EU/UK). If unblinding is medically required, follow pre-approved scripts and document who, when, why, and the analysis impact.

Where RCA is most often needed in trials.

Consent integrity (wrong version, incorrect timing, missing pages).
Eligibility accuracy (criterion misapplied, unit conversion errors, absent evidence).
Endpoint timing (missed windows; heaping at edges; tele-visit failures).
IP/device integrity (temperature excursions; reconciliation gaps; arm-revealing packaging).
Data integrity (audit trail gaps; algorithm/version drift; mapping errors across EDC↔LIMS/imaging/eCOA).
Privacy/security (overexposed PHI during remote reviews; cross-border transfer oversights).

Outcome to aim for. A well-run RCA yields a precise problem statement, an evidence-backed mechanism of failure, and a CAPA package with owners, deadlines, and measurable effectiveness checks tied to CtQ outcomes—e.g., primary endpoint on-time ≥95% sustained, “0 use of superseded consent,” 100% audit-trail retrieval success for sampled systems.

Choosing the Right Tools: 5 Whys, Fishbone, Fault Tree, and Friends

5 Whys is a fast, hypothesis-generating method that repeatedly asks “Why?” until the mechanism is exposed. It is ideal for straightforward single-chain failures (e.g., use of a superseded consent). Example: Why was the wrong version used? Old paper stock; Why was it still available? No withdrawal process; Why no process? SOP gap; Why gap? No owner for consent inventory; Why no owner? RACI not defined. The likely corrective action is not “retrain,” but implementing stock control, assignable ownership, and eConsent hard-stops.

Fishbone (Ishikawa) diagrams help when multiple causes interplay. Typical “bones” include Methods (SOPs, windows), Machines (EDC/eCOA/IRT, scanners, firmware), Materials (labels, kits, reference ranges), Manpower (competency, staffing, rater drift), Measurement (units, calibration), and Environment (clinic hours, heatwaves). Fishbone is excellent for endpoint timing issues where capacity constraints, scheduling logic, travel support, and reminder cadence all contribute.

Fault Tree Analysis (FTA) is suited for safety-critical logic where combinations of events cause failure (AND/OR gates). For example, “Participant dosed in error” may require both an eligibility gate failure and an IRT configuration gap. FTA clarifies which barriers must simultaneously fail and points to systemic fixes (eligibility sign-off before activation, configuration hard-stops).

Barrier Analysis maps preventive and detective barriers along a process timeline (e.g., consent → screening → IRT activation → dosing). It asks, “What barrier should have stopped this, and why didn’t it?” Pair with Swiss cheese thinking to visualize holes aligning (staffing shortage + scanner downtime + no weekend slots → endpoint misses).

Change Analysis compares “when it worked” vs “when it failed.” This is powerful for sudden KRI shifts (e.g., diary adherence drops after an app update; imaging failures after parameter template revision; courier excursions after summer schedule changes). Tie changes to release notes, firmware versions, parameter locks, and time stamps.

Human Factors methods examine workload, usability, and environment. They avoid blaming individuals by focusing on interface design, cognitive load, alarm fatigue, unreadable labels, and interruptions. In decentralized trials, consider participant burden (connectivity, device charging, screen readability, caregiver support) as potential root causes for missing endpoints.

Pareto Analysis (80/20) prioritizes the few causes producing most deviations (e.g., three eligibility criteria generate 70% of misclassifications). Focus CAPA where it moves CtQ outcomes most.

Data sources for every tool. Regardless of method, RCAs should be fueled by: audit trails across systems (with local time + UTC offset), scheduler exports, logger PDFs (shipping/storage), DICOM parameter reports and phantom checks, LIMS accession-to-result timelines, eCOA adherence and “time-last-synced,” IRT configuration snapshots, and access-control changes. Without these, conclusions are opinions, not evidence.

Tool selection matrix (practical heuristics).

Single-chain, obvious symptom → 5 Whys.
Many plausible contributors → Fishbone + Pareto.
Logical combinations/barrier failures → FTA + Barrier Analysis.
Sudden performance change → Change Analysis.
Usability/cognitive issues → Human Factors review.

Keep blinding protected while analyzing. If logs or tickets could reveal treatment, use blinded-safe aliases in the main RCA pack and keep unblinded keys in a restricted repository. Document who accessed unblinded data, when, and why.

An Inspection-Ready RCA Workflow: From Signal to Evidence-Backed Conclusions

1) Start with containment and clarity. Before analysis, stabilize the clinical situation: pause at-risk procedures, re-consent, quarantine product, reschedule endpoints within window, or initiate privacy containment. Record event time and awareness time with local time and UTC offset. Open the case in the log and assign a case manager.

2) Frame the problem precisely. Write a single-sentence statement that names the CtQ factor, affected sites/participants, scale, and time window. Example: “Out-of-window tumor assessments rose from 4% to 12% at Sites 103/106 between 15 March–30 April; heaping on last day observed; weekend imaging unavailable.” Avoid vague phrasing; ambiguity fuels weak CAPA.

3) Build a timeline with sources. Combine scheduler exports, eCOA prompts, IRT transactions, courier scans, and site notes to reconstruct what happened. Screenshots and certified copies should preserve metadata (units, effective dates, time zones, device/software versions). For wearables, include “time-last-synced.”

4) Gather the right evidence once. Pull audit trails from EDC, eSource/EMR interfaces, eCOA, IRT, imaging portals, LIMS, and safety databases. Export point-in-time datasets where available (e.g., configuration at failure date). Capture scanner make/model, parameter templates, phantom logs; temperature logger PDFs with unique IDs; LIMS accession numbers with collection and result times; and help-desk transcripts. File certified copies in TMF/ISF.

5) Choose and apply the method. For endpoint timing: Fishbone + Barrier Analysis to surface capacity, reminders, travel support, and parameter locks; Pareto to prioritize the biggest contributors. For consent version drift: 5 Whys + Change Analysis (new amendment; old stock not withdrawn; eConsent not enabled for this site). For temperature excursions: FTA (packout, courier lane, weather, logger) + Barrier Analysis to identify missing or failed controls.

6) Validate hypotheses with data. Don’t stop at plausible stories. Confirm with quantitative checks: proportion of visits on weekend; help-desk response times; logger alarm rates per lane; version adoption curves post-release; time from amendment approval to re-consent; unit/reference-range changes versus mapping dates. Engage statistics when an estimand or analysis set could be impacted.

7) Protect blinding and privacy throughout. Segregate unblinded materials (kit mappings, randomization lists, pharmacy notes) from the main pack. Use arm-agnostic language in all communications that enter the blinded file. If unblinding occurred for medical need, document justification, timing, and analysis consequences.

8) Decide on regulatory/ethics notifications. Check your jurisdictional matrix: does this meet “serious breach,” device vigilance, or privacy-breach criteria? Identify the responsible sender and the clock. Align content with expectations recognizable to FDA, EMA, PMDA, TGA, and ethics bodies informed by the WHO.

9) Conclude with mechanisms, not labels. A defensible RCA states the mechanism (“Eligibility gate bypassed because IRT was configured to accept randomization without PI sign-off after a role change”) and evidence (configuration snapshot; access log; audit trails), not just “staff forgot.” Mechanisms point directly to system changes (gates, capacity, version locks) that will be verified later.

10) Record decisions and quality evidence in the TMF/ISF. The case dossier should contain: problem statement; timeline; evidence library; method artifacts (fishbone, FTA); conclusions; risk assessment to rights/safety and endpoints; notification records; and the CAPA package (corrections, corrective/preventive actions, owners, due dates, and effectiveness checks with metrics and observation windows). Include meeting minutes that show governance decisions, owners, deadlines, and rationale.

Illustrative mini-cases (abbreviated).

Consent version drift: 5 Whys reveals lack of stock control and missing eConsent at one site. Actions: destroy old stock; enable eConsent with hard-stops; add pre-randomization consent check; measure “0 use of superseded forms” (QTL).
Endpoint imaging heaping: Fishbone shows absence of weekend slots, travel reimbursement delays, and late reminder cadence. Actions: add weekend capacity; revise reminders; pre-book scans; track ≥95% on-time and <10% last-day concentration for 8 weeks.
Temperature excursions: FTA points to courier lane through a hot hub + packout not validated for summer. Actions: re-qualify lanes; add data-loggers; change dispatch cut-offs; excursion ≤1 per 100 storage/shipping days with 100% scientific disposition files.
Privacy incident: Barrier analysis shows remote EMR view exceeded minimum-necessary. Actions: redaction workflows; certified copies for monitors; access profile review; notifications within HIPAA/GDPR clocks; zero repeat incidents in 90 days.

From RCA to Lasting Change: CAPA, Metrics, and Governance That Stick

Design CAPA from the mechanism up. Each action should reflect the specific failure path. For capacity issues, add resources and scheduling rules; for configuration gaps, add system gates and version locks; for mapping/unit problems, lock units and file reference-range effective dates; for privacy risks, restrict views and require certified-copy workflows; for time-zone errors, mandate local time + UTC offset and device sync.

Structure the CAPA package.

Corrections (immediate): re-consent, reschedule within window, quarantine product, issue corrected safety reports.
Corrective actions (remove the cause): eConsent hard-stops; PI sign-off gate in IRT; weekend imaging; route redesign for couriers; parameter locks in scanners; change help-desk SLAs.
Preventive actions (reduce recurrence): SOP updates; role-based access changes and same-day deactivation; monitoring KRI thresholds; stress tests (table-top exercises for outages, heatwaves, time changes).

Make success measurable. Define objective effectiveness checks, data sources, and observation windows tied to CtQ outcomes:

Consent integrity: “0 use of superseded forms” (QTL); ≥98% comprehension check completion; re-consent cycle time ≤10 business days.
Eligibility precision: ≤2% misclassification; 0 ineligible randomized; PI sign-off documented pre-randomization for 100% of cases in audit sample.
Endpoint timing: ≥95% on-time; <10% visits on final window day; device “time-last-synced” recorded; time-zone fields complete.
IP/device integrity: excursions ≤1 per 100 storage/shipping days; 100% scientific disposition documentation; reconciliation discrepancies closed ≤1 business day.
Auditability: 100% audit-trail retrieval success for sampled systems (without vendor engineering help); point-in-time exports available.
Privacy/security: containment <24 h; legal notifications within clocks; zero repeat remote-access scope violations in 90 days.

Integrate with RBQM. Convert root causes into Key Risk Indicators (KRIs) and adjust Quality Tolerance Limits (QTLs) when warranted. Example: after temperature issues, add “excursions per 100 storage/shipping days” as a KRI and keep the ≤1 threshold as a QTL. Centralized monitoring should watch for relapse and trigger for-cause reviews if signals recur.

Vendor alignment matters. If the mechanism sits in a vendor system (eCOA algorithm, imaging portal parameters, courier routes), embed actions into the Quality Agreement: audit-trail/point-in-time export obligations, change control and release notes, help-desk metrics, uptime SLAs, and subcontractor flow-downs. File validation summaries and configuration snapshots in the TMF.

Document what changed and why. Use change control artifacts (requirements, risk assessment, test scripts/results, approvals, go-live time stamps) for any system or parameter changes. Link micro-training to the change (“what changed and why”) and gate system access until competence is demonstrated. Reconcile the training matrix with Delegation of Duties and user access lists.

Governance that closes the loop. Operate a cross-functional Risk Review Board (operations, data management/biostats, pharmacovigilance, supply/pharmacy, privacy/security, vendor management). Minutes must show signals → decisions → actions → effectiveness. When a study-level QTL is breached (e.g., “0 use of superseded consent” violated, primary endpoint on-time <95%, audit-trail retrieval failure), convene within the predefined window, perform RCA beyond “human error,” implement system changes, and keep the case open until metrics demonstrate sustained improvement without new failure modes.

Common pitfalls—and durable fixes.

Labeling “human error” and retraining only → add structural fixes (gates, capacity, version locks, lane re-qualification) and verify with metrics.
Unclear time handling → mandate local time + UTC offset in source and logs; sync devices; verify via audit-trail sampling.
Vendor black boxes → require exportable logs and configuration snapshots; rehearse retrieval; keep certified samples in TMF.
Blinding leaks → segregate unblinded repositories; arm-agnostic templates; access logs for any randomization-key view.
Fragmented evidence → maintain a “rapid-pull” RCA bundle: problem statement, timeline, method artifacts, evidence, conclusions, CAPA, effectiveness checks, and governance minutes.

Bottom line. RCA is the bridge from detection to dependable improvement. When you apply the right methods, ground conclusions in auditable evidence, protect blinding and privacy, and wire results into CAPA, RBQM, and vendor agreements, your clinical program protects participants and produces data that withstand scrutiny from FDA, EMA, PMDA, TGA, the ICH, and the WHO.