Cleaning Validation for Reusable Surgical Instruments: Soil, Residue, and Worst Case
Build cleaning validation protocols for reusable surgical instruments, covering test soils, residue endpoints, worst-case devices, ST98 criteria, sampling, and reports.
Why Cleaning Validation Demands Its Own Protocol
Cleaning is the foundational step in every reprocessing workflow. If residual soil remains on a reusable surgical instrument after cleaning, downstream disinfection or sterilization may be compromised — organic material can shield microorganisms from chemical and thermal kill mechanisms. FDA, EU MDR, and international standards treat cleaning validation as a separate study that must stand on its own evidence, not as a footnote to disinfection or sterilization validation.
This guide provides the protocol-level detail manufacturers need: test soil selection, analyte endpoints, worst-case device selection, manual vs automated methods, sample size, acceptance criteria, and report structure. It complements the broader ISO 17664 reprocessing validation article by going deeper into cleaning-specific study design and analytical methods.
Regulatory and Standards Framework
| Standard / Guidance | Scope | Key Contribution to Cleaning Validation |
|---|---|---|
| ANSI/AAMI ST98:2022 | Cleaning validation of reusable medical devices | Defines study design, analyte selection, acceptance criteria, sample size, and reporting |
| ISO 15883-5:2021 | Washer-disinfector cleaning efficacy | Performance requirements and test methods for automated cleaning |
| ASTM F3208-20 | Test soil selection | Guide for choosing clinically relevant test soils by device type |
| ISO 17664-1:2021 | IFU content for critical/semi-critical devices | Specifies what reprocessing instructions manufacturers must provide |
| FDA Guidance (March 2015) | Reprocessing in health care settings | Defines labeling content, validation expectations, and premarket submission requirements |
| AAMI TIR30:2011 (superseded by ST98) | Compendium of test methods | Historical reference; ST98 now contains updated acceptance criteria |
| DGKH/DGSV/AKI Guideline (2017) | Automated cleaning and thermal disinfection (DACH region) | Validation and routine monitoring of washer-disinfectors |
| ISO 20417:2026 | Medical device labeling | Updated IFU readability requirements including measurable lay-user comprehension |
The Cleaning Validation Workflow
Cleaning validation follows a structured sequence. Skipping or combining steps is the most common source of FDA additional information requests and notified body findings.
Step 1: Device Grouping and Worst-Case Selection
Devices are grouped by design features that affect cleanability, not by product code or marketing name. Within each group, the worst-case device — the one most challenging to clean — is selected for testing.
Grouping criteria:
- Clinical use and soil type: Devices contacting blood vs. mucous membranes vs. skin
- Material compatibility: Stainless steel, aluminum, polymer, coated surfaces
- Design complexity: Lumens, hinges, serrations, blind holes, matted surfaces, spring-loaded mechanisms
- Surface area: Total surface area determines extraction volume and analyte concentration calculations
Worst-case device identification:
Select the device within each family with the greatest number of hard-to-clean features. Common worst-case features include:
| Feature | Cleaning Challenge | Example Devices |
|---|---|---|
| Narrow lumens (≤ 2 mm) | Limited fluid access, protein adhesion to walls | Arthroscopic shavers, suction cannulas, laparoscopic trocars |
| Hinges and box locks | Soil trapped in crevices during actuation | Hemostatic clamps, needle holders, scissors |
| Serrated jaws | Organic material wedged between teeth | Bone rongeurs, towel clips, forceps |
| Blind holes | No through-flow possible; relies on immersion and agitation | Drill bits with depth stops, reamer handles |
| Matted/braided surfaces | Soil woven into fiber matrix | Electrosurgical return electrodes, some cable assemblies |
| Spring-loaded mechanisms | Soil compressed into internal cavity during use | Bipolar forceps, articulating instruments |
Step 2: End-of-Life Conditioning
Before cleaning validation begins, devices undergo simulated-use conditioning to represent their maximum claimed reprocessing life. Per AAMI ST98 and FDA expectations:
- A minimum of six simulated-use cycles are performed (soiling, drying, cleaning, disinfection/sterilization)
- Each cycle includes full soiling, worst-case drying time, and complete reprocessing per the IFU
- Functional testing after conditioning confirms the device remains operational
- The last cycle is the validation cycle — soiled but not yet cleaned, to test the cleaning protocol at end of life
Step 3: Test Soil Selection
The test soil must simulate clinically relevant contamination. ASTM F3208-20 provides a framework for selection based on the device's intended clinical use.
| Clinical Use Category | Recommended Test Soil | Primary Analytes |
|---|---|---|
| Blood-contacting instruments | Defibrinated sheep blood or mixed soil (blood + protein) | Protein, hemoglobin |
| Tissue-contacting instruments | Mixed organic soil (protein + carbohydrate + lipid) | Protein, TOC |
| Endoscopes (GI, respiratory) | Simulated gastrointestinal soil | Protein, carbohydrate |
| Arthroscopic/laparoscopic instruments | Mixed blood-tissue soil | Protein, hemoglobin, TOC |
| General surgical instruments | ASTM F3208 default mixed soil | Protein + one additional analyte |
Soiling procedure:
- Apply soil to all device surfaces, focusing on hard-to-clean areas identified in Step 1
- Actuate movable parts during soiling to simulate clinical use
- Immersion or targeted application depending on clinical scenario
- Allow soil to dry for the maximum clinically relevant delay (often 30–120 minutes at room temperature) to simulate worst-case transport time
Step 4: Cleaning Under Worst-Case Conditions
The cleaning protocol tested must represent the least rigorous implementation of the IFU — shortest time, lowest temperature, weakest concentration, minimum mechanical action. This is a critical FDA expectation.
Manual cleaning worst-case parameters:
| Parameter | Worst-Case Choice |
|---|---|
| Detergent concentration | Minimum recommended dilution |
| Water temperature | Minimum specified temperature |
| Soaking time | Shortest specified soak duration |
| Brushing action | Minimum brush strokes per surface |
| Rinse volume | Minimum specified rinse volume |
| User technique | Trained but instructed not to exceed minimum effort |
Automated cleaning worst-case parameters:
| Parameter | Worst-Case Choice |
|---|---|
| Cycle selection | Shortest/mildest cycle in the IFU |
| Load configuration | Maximum load per IFU |
| Detergent concentration | Minimum per IFU |
| Water quality | Minimum acceptable quality grade |
| Drying | Minimum time or omitted if optional |
Step 5: Residue Extraction and Analyte Testing
After cleaning, devices undergo extraction to recover residual soil for quantitative analysis.
Extraction methods:
- Immersion/sonication: Device submerged in extraction solution (e.g., reverse osmosis water, 1% SDS) with ultrasonic agitation at controlled temperature (typically 40–50 °C for 30 minutes)
- Flush/elution for lumened devices: Measured volume of extraction solution flushed through lumen; effluent collected
- Swabbing for surfaces: Used for specific localized areas; less common for full-device validation
Analytes and assay methods:
| Analyte | Assay Method | AAMI ST98 Acceptance Criterion | Clinical Relevance |
|---|---|---|---|
| Protein | BCA, Bradford, Lowry, or OPA assay | ≤ 6.4 µg/cm² | Universal soil marker; dominant in blood and tissue |
| Hemoglobin | Spectrophotometric (absorbance at 540 nm) or HPLC | ≤ 2.2 µg/cm² | Blood-specific marker for blood-contacting devices |
| Total Organic Carbon (TOC) | Combustion or persulfate TOC analyzer | ≤ 12 µg/cm² | Broad organic residue measure |
| Carbohydrate | Phenol-sulfuric acid or enzymatic assay | ≤ 1.8 µg/cm² | Relevant for GI/respiratory contact devices |
| ATP | Bioluminescence assay | No formal ST98 limit; used as supplementary | Rapid cleanliness indicator |
Per AAMI ST98, critical and semi-critical devices require visual inspection plus at least two quantitative, clinically relevant analytes. Protein is routinely the first; the second is selected based on clinical use (e.g., hemoglobin for blood-contacting devices, TOC for mixed soil).
ISO 15883-5 alert and action levels for protein:
| Level | Threshold (as BSA equivalent) | Significance |
|---|---|---|
| Alert level | ≥ 3 µg/cm² | Investigate; may indicate process drift |
| Action level | ≥ 6.4 µg/cm² | Process failure; corrective action required |
Step 6: Visual Inspection
All devices must be visually inspected after cleaning. Visual inspection is a required pass/fail criterion regardless of quantitative results.
- Method: Trained analyst under controlled lighting (minimum 1,000 lux), with magnification (5–10×) for complex features
- Borescope/endoscope inspection for lumened devices to detect internal residue
- Acceptance: No visible soil on any surface, joint, lumen, or crevice
Step 7: Detergent Residue Evaluation
After quantitative analyte testing, cleaned devices should be evaluated for detergent residuals. Per AAMI ST98 and FDA expectations, detergent must be reduced to safe, non-toxic levels. This is typically demonstrated through:
- ISO cytotoxicity testing (ISO 10993-5) on the cleaned device extract — confirms residual detergent is below cytotoxic threshold
- Specific detergent assays if the extraction method may interfere with downstream testing
- This step is separate from soil removal validation and must not be omitted
Step 8: Controls
| Control Type | Purpose | Implementation |
|---|---|---|
| Positive device control | Confirm soiling was effective; validate extraction recovery | Soiled, extracted without cleaning |
| Negative device control | Confirm no background contamination | Clean, never-soiled device extracted |
| Extraction blank | Verify extraction solution is free of interfering analytes | Extraction solution processed without device |
| Method validation | Confirm extraction efficiency | Spike-and-recovery studies per analyte |
Sample Size Justification
AAMI ST98 specifies sample size requirements that differ from earlier TIR30 guidance:
- A minimum of three data points per endpoint test
- All data points must be within stated acceptance criteria
- The standard deviation of the data set is calculated and added to the highest data point; the result must not exceed the acceptance criterion
- If the sum exceeds the criterion, the sample size must be increased
| Study Element | Minimum |
|---|---|
| Devices per group | 3 test devices + 3 positive controls + 1 negative control |
| Extraction replicates | Per method validation (typically 1 per device) |
| Simulated-use cycles | Minimum 6 (per AAMI ST98 / FDA) |
Manual vs Automated Cleaning Validation
The choice between manual and automated cleaning affects study design significantly.
| Factor | Manual Cleaning | Automated Cleaning |
|---|---|---|
| Primary standard | AAMI ST98 | AAMI ST98 + ISO 15883-5 |
| Variability source | User technique, brush selection, soak time | Cycle parameters, load configuration, detergent dosing |
| Worst-case simulation | Minimum effort by trained user | Minimum cycle parameters per IFU |
| Reproducibility | Lower; requires tighter protocol control | Higher; machine-controlled |
| Typical deficiencies | Incomplete brushing of hinges/lumens, insufficient soak time | Incorrect load configuration, channel blockage detection failures |
| Recommended for | Devices without IFU for automated reprocessing | Devices with validated washer-disinfector cycle |
Key point: If the IFU offers both manual and automated options, each must be validated separately.
Common Deficiencies and How to Avoid Them
| Deficiency | Root Cause | Prevention |
|---|---|---|
| Soil selection does not match clinical use | Generic soil used instead of clinically relevant one | Reference ASTM F3208-20; justify soil choice based on clinical use |
| Only one analyte tested | Protocol written before ST98 required two | Plan for protein + one additional analyte from the start |
| No positive/negative controls | Study design oversight | Include controls in protocol template |
| Extraction method not validated | Assumed lab's standard method is sufficient | Perform spike-and-recovery before validation |
| Worst-case device not justified | Grouping rationale not documented | Document grouping criteria and worst-case selection in the protocol |
| Acceptance criteria not justified for non-standard analytes | Using ST98 limits for analytes not in ST98 | Justify with published literature or internal studies |
| No visual inspection protocol | Quantitative only | Add visual inspection criteria and lighting requirements |
| Sample size insufficient | Three devices tested but high variability | Calculate SD + max; increase sample size if needed |
Cleaning Validation Report Structure
A well-structured report facilitates FDA review and notified body audit. Recommended sections:
| Section | Content |
|---|---|
| 1. Protocol reference | Protocol number, revision, approval dates |
| 2. Device description | Trade name, model, materials, design features, claimed reprocessing life |
| 3. Device grouping rationale | Grouping criteria, family members, worst-case selection justification |
| 4. Test soil justification | Clinical use basis, soil composition, reference to ASTM F3208-20 |
| 5. Simulated-use conditioning | Number of cycles, cycle details, functional testing results |
| 6. Cleaning procedure | Complete IFU text reproduced, worst-case parameter table |
| 7. Extraction method | Solution, volume, time, temperature, sonication parameters, validation data |
| 8. Analytical methods | Assay type, calibration, limits of detection/quantitation |
| 9. Controls results | Positive control recovery, negative control, extraction blank |
| 10. Results per device | Per-device analyte concentrations (µg/cm²), visual inspection findings |
| 11. Statistical analysis | Mean, SD, SD + max calculation, sample size adequacy assessment |
| 12. Conclusion | Pass/fail per acceptance criterion; statement of validation status |
| 13. Deviations | Any protocol deviations with impact assessment |
| 14. Appendices | Raw data, calibration certificates, photographs, extraction method validation |
Decision Tree: Which Cleaning Validation Path Applies
Is the device reusable?
├── No → Cleaning validation not required
└── Yes
├── Is it supplied non-sterile requiring initial processing?
│ └── Yes → Validate initial cleaning per IFU
├── Is it critical (contacts sterile tissue)?
│ └── Yes → ST98: visual + ≥2 analytes + worst-case
├── Is it semi-critical (contacts mucous membranes)?
│ └── Yes → ST98: visual + ≥2 analytes + worst-case
└── Is it non-critical (contacts intact skin only)?
└── Yes → ST98 allows reduced rigor (visual + 1 analyte may suffice)
├── Manual cleaning in IFU?
│ └── Yes → Validate manual path separately
└── Automated cleaning in IFU?
└── Yes → Validate automated path per ISO 15883-5
Submission Evidence: What Goes Into Your 510(k) or Technical File
| Submission Type | FDA Expectation | EU MDR Expectation |
|---|---|---|
| 510(k) | Summary of cleaning validation; protocol and report (Appendix E devices require full report) | N/A |
| PMA / De Novo | Full protocol and test report | N/A |
| EU MDR Technical Documentation | N/A | Full validation report in Annex II technical file |
| Design History File | Documentation of tests per 21 CFR 820.30(j) | Design verification evidence |
| Device Master Record | Per 21 CFR 820.181 | N/A |
For FDA submissions, Emergo by UL (March 2026) notes that reprocessing is one of three areas where FDA expectations are "usually significantly different from most other regions of the world" — specifically, FDA expects device-based cleaning test data (summary or full report), whereas most other regulators require only sterilization validation.
Key Takeaways
- AAMI ST98:2022 is now the primary standard for cleaning validation, replacing TIR30 with updated acceptance criteria including TOC ≤ 12 µg/cm²
- Critical and semi-critical devices require visual inspection plus at least two quantitative analytes (protein + hemoglobin, TOC, or carbohydrate)
- Worst-case device selection is based on design complexity, not product code; document the rationale
- End-of-life conditioning (minimum 6 cycles) must precede the validation cleaning cycle
- Test soil must be clinically relevant per ASTM F3208-20
- Manual and automated cleaning paths require separate validation studies
- Sample size adequacy is determined by the SD + max calculation, not just minimum device count
Sources
- ANSI/AAMI ST98:2022 — Cleaning validation of health care products
- ISO 15883-5:2021 — Washer-disinfectors, Part 5: Performance requirements and test method criteria for demonstrating cleaning efficacy
- ASTM F3208-20 — Standard Guide for Selecting Test Soils for Validation of Cleaning Methods for Reusable Medical Devices
- ISO 17664-1:2021 — Processing of health care products, Part 1: Critical and semi-critical medical devices
- FDA Guidance — Reprocessing Medical Devices in Health Care Settings: Validation Methods and Labeling (March 2015)
- Emergo by UL — US FDA Reprocessing Expectations (March 2026)
- Nelson Labs — How to Perform Cleaning Validations for Semi-Critical and Critical Devices According to ANSI/AAMI ST98
- AAMI ST98:2022 acceptance criteria: Protein ≤ 6.4 µg/cm², Hemoglobin ≤ 2.2 µg/cm², TOC ≤ 12 µg/cm², Carbohydrate ≤ 1.8 µg/cm²
- ISO 20417:2026 — Medical device labeling requirements
- TÜV SÜD — Reprocessing Validation for Reusable Medical Devices (whitepaper)
- LexaMed — Cleaning Validation for Reusable Medical Devices (March 2025)