Cleaning Validation for Reusable Surgical Instruments: Soil, Residue, Worst-Case Devices, and Acceptance Criteria
Protocol-level guide to cleaning validation for reusable surgical instruments — covering artificial soil selection, protein/hemoglobin/TOC endpoints, worst-case device families, manual vs automated cleaning, ANSI/AAMI ST98 acceptance criteria, sample size justification, and report structure.
Why Cleaning Validation Demands Its Own Protocol
Cleaning is the foundational step in every reprocessing workflow. If residual soil remains on a reusable surgical instrument after cleaning, downstream disinfection or sterilization may be compromised — organic material can shield microorganisms from chemical and thermal kill mechanisms. FDA, EU MDR, and international standards treat cleaning validation as a separate study that must stand on its own evidence, not as a footnote to disinfection or sterilization validation.
This guide provides the protocol-level detail manufacturers need: test soil selection, analyte endpoints, worst-case device selection, manual vs automated methods, sample size, acceptance criteria, and report structure. It complements the broader ISO 17664 reprocessing validation article by going deeper into cleaning-specific study design and analytical methods.
Regulatory and Standards Framework
| Standard / Guidance | Scope | Key Contribution to Cleaning Validation |
|---|---|---|
| ANSI/AAMI ST98:2022 | Cleaning validation of reusable medical devices | Defines study design, analyte selection, acceptance criteria, sample size, and reporting |
| ISO 15883-5:2021 | Washer-disinfector cleaning efficacy | Performance requirements and test methods for automated cleaning |
| ASTM F3208-20 | Test soil selection | Guide for choosing clinically relevant test soils by device type |
| ISO 17664-1:2021 | IFU content for critical/semi-critical devices | Specifies what reprocessing instructions manufacturers must provide |
| FDA Guidance (March 2015) | Reprocessing in health care settings | Defines labeling content, validation expectations, and premarket submission requirements |
| AAMI TIR30:2011 (superseded by ST98) | Compendium of test methods | Historical reference; ST98 now contains updated acceptance criteria |
| DGKH/DGSV/AKI Guideline (2017) | Automated cleaning and thermal disinfection (DACH region) | Validation and routine monitoring of washer-disinfectors |
| ISO 20417:2026 | Medical device labeling | Updated IFU readability requirements including measurable lay-user comprehension |
The Cleaning Validation Workflow
Cleaning validation follows a structured sequence. Skipping or combining steps is the most common source of FDA additional information requests and notified body findings.
Step 1: Device Grouping and Worst-Case Selection
Devices are grouped by design features that affect cleanability, not by product code or marketing name. Within each group, the worst-case device — the one most challenging to clean — is selected for testing.
Grouping criteria:
- Clinical use and soil type: Devices contacting blood vs. mucous membranes vs. skin
- Material compatibility: Stainless steel, aluminum, polymer, coated surfaces
- Design complexity: Lumens, hinges, serrations, blind holes, matted surfaces, spring-loaded mechanisms
- Surface area: Total surface area determines extraction volume and analyte concentration calculations
Worst-case device identification:
Select the device within each family with the greatest number of hard-to-clean features. Common worst-case features include:
| Feature | Cleaning Challenge | Example Devices |
|---|---|---|
| Narrow lumens (≤ 2 mm) | Limited fluid access, protein adhesion to walls | Arthroscopic shavers, suction cannulas, laparoscopic trocars |
| Hinges and box locks | Soil trapped in crevices during actuation | Hemostatic clamps, needle holders, scissors |
| Serrated jaws | Organic material wedged between teeth | Bone rongeurs, towel clips, forceps |
| Blind holes | No through-flow possible; relies on immersion and agitation | Drill bits with depth stops, reamer handles |
| Matted/braided surfaces | Soil woven into fiber matrix | Electrosurgical return electrodes, some cable assemblies |
| Spring-loaded mechanisms | Soil compressed into internal cavity during use | Bipolar forceps, articulating instruments |
Step 2: End-of-Life Conditioning
Before cleaning validation begins, devices undergo simulated-use conditioning to represent their maximum claimed reprocessing life. Per AAMI ST98 and FDA expectations:
- A minimum of six simulated-use cycles are performed (soiling, drying, cleaning, disinfection/sterilization)
- Each cycle includes full soiling, worst-case drying time, and complete reprocessing per the IFU
- Functional testing after conditioning confirms the device remains operational
- The last cycle is the validation cycle — soiled but not yet cleaned, to test the cleaning protocol at end of life
Step 3: Test Soil Selection
The test soil must simulate clinically relevant contamination. ASTM F3208-20 provides a framework for selection based on the device's intended clinical use.
| Clinical Use Category | Recommended Test Soil | Primary Analytes |
|---|---|---|
| Blood-contacting instruments | Defibrinated sheep blood or mixed soil (blood + protein) | Protein, hemoglobin |
| Tissue-contacting instruments | Mixed organic soil (protein + carbohydrate + lipid) | Protein, TOC |
| Endoscopes (GI, respiratory) | Simulated gastrointestinal soil | Protein, carbohydrate |
| Arthroscopic/laparoscopic instruments | Mixed blood-tissue soil | Protein, hemoglobin, TOC |
| General surgical instruments | ASTM F3208 default mixed soil | Protein + one additional analyte |
Soiling procedure:
- Apply soil to all device surfaces, focusing on hard-to-clean areas identified in Step 1
- Actuate movable parts during soiling to simulate clinical use
- Immersion or targeted application depending on clinical scenario
- Allow soil to dry for the maximum clinically relevant delay (often 30–120 minutes at room temperature) to simulate worst-case transport time
Step 4: Cleaning Under Worst-Case Conditions
The cleaning protocol tested must represent the least rigorous implementation of the IFU — shortest time, lowest temperature, weakest concentration, minimum mechanical action. This is a critical FDA expectation.
Manual cleaning worst-case parameters:
| Parameter | Worst-Case Choice |
|---|---|
| Detergent concentration | Minimum recommended dilution |
| Water temperature | Minimum specified temperature |
| Soaking time | Shortest specified soak duration |
| Brushing action | Minimum brush strokes per surface |
| Rinse volume | Minimum specified rinse volume |
| User technique | Trained but instructed not to exceed minimum effort |
Automated cleaning worst-case parameters:
| Parameter | Worst-Case Choice |
|---|---|
| Cycle selection | Shortest/mildest cycle in the IFU |
| Load configuration | Maximum load per IFU |
| Detergent concentration | Minimum per IFU |
| Water quality | Minimum acceptable quality grade |
| Drying | Minimum time or omitted if optional |
Step 5: Residue Extraction and Analyte Testing
After cleaning, devices undergo extraction to recover residual soil for quantitative analysis.
Extraction methods:
- Immersion/sonication: Device submerged in extraction solution (e.g., reverse osmosis water, 1% SDS) with ultrasonic agitation at controlled temperature (typically 40–50 °C for 30 minutes)
- Flush/elution for lumened devices: Measured volume of extraction solution flushed through lumen; effluent collected
- Swabbing for surfaces: Used for specific localized areas; less common for full-device validation
Analytes and assay methods:
| Analyte | Assay Method | AAMI ST98 Acceptance Criterion | Clinical Relevance |
|---|---|---|---|
| Protein | BCA, Bradford, Lowry, or OPA assay | ≤ 6.4 µg/cm² | Universal soil marker; dominant in blood and tissue |
| Hemoglobin | Spectrophotometric (absorbance at 540 nm) or HPLC | ≤ 2.2 µg/cm² | Blood-specific marker for blood-contacting devices |
| Total Organic Carbon (TOC) | Combustion or persulfate TOC analyzer | ≤ 12 µg/cm² | Broad organic residue measure |
| Carbohydrate | Phenol-sulfuric acid or enzymatic assay | ≤ 1.8 µg/cm² | Relevant for GI/respiratory contact devices |
| ATP | Bioluminescence assay | No formal ST98 limit; used as supplementary | Rapid cleanliness indicator |
Per AAMI ST98, critical and semi-critical devices require visual inspection plus at least two quantitative, clinically relevant analytes. Protein is routinely the first; the second is selected based on clinical use (e.g., hemoglobin for blood-contacting devices, TOC for mixed soil).
ISO 15883-5 alert and action levels for protein:
| Level | Threshold (as BSA equivalent) | Significance |
|---|---|---|
| Alert level | ≥ 3 µg/cm² | Investigate; may indicate process drift |
| Action level | ≥ 6.4 µg/cm² | Process failure; corrective action required |
Step 6: Visual Inspection
All devices must be visually inspected after cleaning. Visual inspection is a required pass/fail criterion regardless of quantitative results.
- Method: Trained analyst under controlled lighting (minimum 1,000 lux), with magnification (5–10×) for complex features
- Borescope/endoscope inspection for lumened devices to detect internal residue
- Acceptance: No visible soil on any surface, joint, lumen, or crevice
Step 7: Detergent Residue Evaluation
After quantitative analyte testing, cleaned devices should be evaluated for detergent residuals. Per AAMI ST98 and FDA expectations, detergent must be reduced to safe, non-toxic levels. This is typically demonstrated through:
- ISO cytotoxicity testing (ISO 10993-5) on the cleaned device extract — confirms residual detergent is below cytotoxic threshold
- Specific detergent assays if the extraction method may interfere with downstream testing
- This step is separate from soil removal validation and must not be omitted
Step 8: Controls
| Control Type | Purpose | Implementation |
|---|---|---|
| Positive device control | Confirm soiling was effective; validate extraction recovery | Soiled, extracted without cleaning |
| Negative device control | Confirm no background contamination | Clean, never-soiled device extracted |
| Extraction blank | Verify extraction solution is free of interfering analytes | Extraction solution processed without device |
| Method validation | Confirm extraction efficiency | Spike-and-recovery studies per analyte |
Sample Size Justification
AAMI ST98 specifies sample size requirements that differ from earlier TIR30 guidance:
- A minimum of three data points per endpoint test
- All data points must be within stated acceptance criteria
- The standard deviation of the data set is calculated and added to the highest data point; the result must not exceed the acceptance criterion
- If the sum exceeds the criterion, the sample size must be increased
| Study Element | Minimum |
|---|---|
| Devices per group | 3 test devices + 3 positive controls + 1 negative control |
| Extraction replicates | Per method validation (typically 1 per device) |
| Simulated-use cycles | Minimum 6 (per AAMI ST98 / FDA) |
Manual vs Automated Cleaning Validation
The choice between manual and automated cleaning affects study design significantly.
| Factor | Manual Cleaning | Automated Cleaning |
|---|---|---|
| Primary standard | AAMI ST98 | AAMI ST98 + ISO 15883-5 |
| Variability source | User technique, brush selection, soak time | Cycle parameters, load configuration, detergent dosing |
| Worst-case simulation | Minimum effort by trained user | Minimum cycle parameters per IFU |
| Reproducibility | Lower; requires tighter protocol control | Higher; machine-controlled |
| Typical deficiencies | Incomplete brushing of hinges/lumens, insufficient soak time | Incorrect load configuration, channel blockage detection failures |
| Recommended for | Devices without IFU for automated reprocessing | Devices with validated washer-disinfector cycle |
Key point: If the IFU offers both manual and automated options, each must be validated separately.
Common Deficiencies and How to Avoid Them
| Deficiency | Root Cause | Prevention |
|---|---|---|
| Soil selection does not match clinical use | Generic soil used instead of clinically relevant one | Reference ASTM F3208-20; justify soil choice based on clinical use |
| Only one analyte tested | Protocol written before ST98 required two | Plan for protein + one additional analyte from the start |
| No positive/negative controls | Study design oversight | Include controls in protocol template |
| Extraction method not validated | Assumed lab's standard method is sufficient | Perform spike-and-recovery before validation |
| Worst-case device not justified | Grouping rationale not documented | Document grouping criteria and worst-case selection in the protocol |
| Acceptance criteria not justified for non-standard analytes | Using ST98 limits for analytes not in ST98 | Justify with published literature or internal studies |
| No visual inspection protocol | Quantitative only | Add visual inspection criteria and lighting requirements |
| Sample size insufficient | Three devices tested but high variability | Calculate SD + max; increase sample size if needed |
Cleaning Validation Report Structure
A well-structured report facilitates FDA review and notified body audit. Recommended sections:
| Section | Content |
|---|---|
| 1. Protocol reference | Protocol number, revision, approval dates |
| 2. Device description | Trade name, model, materials, design features, claimed reprocessing life |
| 3. Device grouping rationale | Grouping criteria, family members, worst-case selection justification |
| 4. Test soil justification | Clinical use basis, soil composition, reference to ASTM F3208-20 |
| 5. Simulated-use conditioning | Number of cycles, cycle details, functional testing results |
| 6. Cleaning procedure | Complete IFU text reproduced, worst-case parameter table |
| 7. Extraction method | Solution, volume, time, temperature, sonication parameters, validation data |
| 8. Analytical methods | Assay type, calibration, limits of detection/quantitation |
| 9. Controls results | Positive control recovery, negative control, extraction blank |
| 10. Results per device | Per-device analyte concentrations (µg/cm²), visual inspection findings |
| 11. Statistical analysis | Mean, SD, SD + max calculation, sample size adequacy assessment |
| 12. Conclusion | Pass/fail per acceptance criterion; statement of validation status |
| 13. Deviations | Any protocol deviations with impact assessment |
| 14. Appendices | Raw data, calibration certificates, photographs, extraction method validation |
Decision Tree: Which Cleaning Validation Path Applies
Is the device reusable?
├── No → Cleaning validation not required
└── Yes
├── Is it supplied non-sterile requiring initial processing?
│ └── Yes → Validate initial cleaning per IFU
├── Is it critical (contacts sterile tissue)?
│ └── Yes → ST98: visual + ≥2 analytes + worst-case
├── Is it semi-critical (contacts mucous membranes)?
│ └── Yes → ST98: visual + ≥2 analytes + worst-case
└── Is it non-critical (contacts intact skin only)?
└── Yes → ST98 allows reduced rigor (visual + 1 analyte may suffice)
├── Manual cleaning in IFU?
│ └── Yes → Validate manual path separately
└── Automated cleaning in IFU?
└── Yes → Validate automated path per ISO 15883-5
Submission Evidence: What Goes Into Your 510(k) or Technical File
| Submission Type | FDA Expectation | EU MDR Expectation |
|---|---|---|
| 510(k) | Summary of cleaning validation; protocol and report (Appendix E devices require full report) | N/A |
| PMA / De Novo | Full protocol and test report | N/A |
| EU MDR Technical Documentation | N/A | Full validation report in Annex II technical file |
| Design History File | Documentation of tests per 21 CFR 820.30(j) | Design verification evidence |
| Device Master Record | Per 21 CFR 820.181 | N/A |
For FDA submissions, Emergo by UL (March 2026) notes that reprocessing is one of three areas where FDA expectations are "usually significantly different from most other regions of the world" — specifically, FDA expects device-based cleaning test data (summary or full report), whereas most other regulators require only sterilization validation.
Key Takeaways
- AAMI ST98:2022 is now the primary standard for cleaning validation, replacing TIR30 with updated acceptance criteria including TOC ≤ 12 µg/cm²
- Critical and semi-critical devices require visual inspection plus at least two quantitative analytes (protein + hemoglobin, TOC, or carbohydrate)
- Worst-case device selection is based on design complexity, not product code; document the rationale
- End-of-life conditioning (minimum 6 cycles) must precede the validation cleaning cycle
- Test soil must be clinically relevant per ASTM F3208-20
- Manual and automated cleaning paths require separate validation studies
- Sample size adequacy is determined by the SD + max calculation, not just minimum device count
Sources
- ANSI/AAMI ST98:2022 — Cleaning validation of health care products
- ISO 15883-5:2021 — Washer-disinfectors, Part 5: Performance requirements and test method criteria for demonstrating cleaning efficacy
- ASTM F3208-20 — Standard Guide for Selecting Test Soils for Validation of Cleaning Methods for Reusable Medical Devices
- ISO 17664-1:2021 — Processing of health care products, Part 1: Critical and semi-critical medical devices
- FDA Guidance — Reprocessing Medical Devices in Health Care Settings: Validation Methods and Labeling (March 2015)
- Emergo by UL — US FDA Reprocessing Expectations (March 2026)
- Nelson Labs — How to Perform Cleaning Validations for Semi-Critical and Critical Devices According to ANSI/AAMI ST98
- AAMI ST98:2022 acceptance criteria: Protein ≤ 6.4 µg/cm², Hemoglobin ≤ 2.2 µg/cm², TOC ≤ 12 µg/cm², Carbohydrate ≤ 1.8 µg/cm²
- ISO 20417:2026 — Medical device labeling requirements
- TÜV SÜD — Reprocessing Validation for Reusable Medical Devices (whitepaper)
- LexaMed — Cleaning Validation for Reusable Medical Devices (March 2025)