MedDeviceGuideMedDeviceGuide
Back

Cleaning Validation for Reusable Surgical Instruments: Soil, Residue, Worst-Case Devices, and Acceptance Criteria

Protocol-level guide to cleaning validation for reusable surgical instruments — covering artificial soil selection, protein/hemoglobin/TOC endpoints, worst-case device families, manual vs automated cleaning, ANSI/AAMI ST98 acceptance criteria, sample size justification, and report structure.

Ran Chen
Ran Chen
Global MedTech Expert | 10× MedTech Global Access
2026-04-3014 min read

Why Cleaning Validation Demands Its Own Protocol

Cleaning is the foundational step in every reprocessing workflow. If residual soil remains on a reusable surgical instrument after cleaning, downstream disinfection or sterilization may be compromised — organic material can shield microorganisms from chemical and thermal kill mechanisms. FDA, EU MDR, and international standards treat cleaning validation as a separate study that must stand on its own evidence, not as a footnote to disinfection or sterilization validation.

This guide provides the protocol-level detail manufacturers need: test soil selection, analyte endpoints, worst-case device selection, manual vs automated methods, sample size, acceptance criteria, and report structure. It complements the broader ISO 17664 reprocessing validation article by going deeper into cleaning-specific study design and analytical methods.

Regulatory and Standards Framework

Standard / Guidance Scope Key Contribution to Cleaning Validation
ANSI/AAMI ST98:2022 Cleaning validation of reusable medical devices Defines study design, analyte selection, acceptance criteria, sample size, and reporting
ISO 15883-5:2021 Washer-disinfector cleaning efficacy Performance requirements and test methods for automated cleaning
ASTM F3208-20 Test soil selection Guide for choosing clinically relevant test soils by device type
ISO 17664-1:2021 IFU content for critical/semi-critical devices Specifies what reprocessing instructions manufacturers must provide
FDA Guidance (March 2015) Reprocessing in health care settings Defines labeling content, validation expectations, and premarket submission requirements
AAMI TIR30:2011 (superseded by ST98) Compendium of test methods Historical reference; ST98 now contains updated acceptance criteria
DGKH/DGSV/AKI Guideline (2017) Automated cleaning and thermal disinfection (DACH region) Validation and routine monitoring of washer-disinfectors
ISO 20417:2026 Medical device labeling Updated IFU readability requirements including measurable lay-user comprehension

The Cleaning Validation Workflow

Cleaning validation follows a structured sequence. Skipping or combining steps is the most common source of FDA additional information requests and notified body findings.

Step 1: Device Grouping and Worst-Case Selection

Devices are grouped by design features that affect cleanability, not by product code or marketing name. Within each group, the worst-case device — the one most challenging to clean — is selected for testing.

Grouping criteria:

  • Clinical use and soil type: Devices contacting blood vs. mucous membranes vs. skin
  • Material compatibility: Stainless steel, aluminum, polymer, coated surfaces
  • Design complexity: Lumens, hinges, serrations, blind holes, matted surfaces, spring-loaded mechanisms
  • Surface area: Total surface area determines extraction volume and analyte concentration calculations

Worst-case device identification:

Select the device within each family with the greatest number of hard-to-clean features. Common worst-case features include:

Feature Cleaning Challenge Example Devices
Narrow lumens (≤ 2 mm) Limited fluid access, protein adhesion to walls Arthroscopic shavers, suction cannulas, laparoscopic trocars
Hinges and box locks Soil trapped in crevices during actuation Hemostatic clamps, needle holders, scissors
Serrated jaws Organic material wedged between teeth Bone rongeurs, towel clips, forceps
Blind holes No through-flow possible; relies on immersion and agitation Drill bits with depth stops, reamer handles
Matted/braided surfaces Soil woven into fiber matrix Electrosurgical return electrodes, some cable assemblies
Spring-loaded mechanisms Soil compressed into internal cavity during use Bipolar forceps, articulating instruments

Step 2: End-of-Life Conditioning

Before cleaning validation begins, devices undergo simulated-use conditioning to represent their maximum claimed reprocessing life. Per AAMI ST98 and FDA expectations:

  • A minimum of six simulated-use cycles are performed (soiling, drying, cleaning, disinfection/sterilization)
  • Each cycle includes full soiling, worst-case drying time, and complete reprocessing per the IFU
  • Functional testing after conditioning confirms the device remains operational
  • The last cycle is the validation cycle — soiled but not yet cleaned, to test the cleaning protocol at end of life

Step 3: Test Soil Selection

The test soil must simulate clinically relevant contamination. ASTM F3208-20 provides a framework for selection based on the device's intended clinical use.

Clinical Use Category Recommended Test Soil Primary Analytes
Blood-contacting instruments Defibrinated sheep blood or mixed soil (blood + protein) Protein, hemoglobin
Tissue-contacting instruments Mixed organic soil (protein + carbohydrate + lipid) Protein, TOC
Endoscopes (GI, respiratory) Simulated gastrointestinal soil Protein, carbohydrate
Arthroscopic/laparoscopic instruments Mixed blood-tissue soil Protein, hemoglobin, TOC
General surgical instruments ASTM F3208 default mixed soil Protein + one additional analyte

Soiling procedure:

  • Apply soil to all device surfaces, focusing on hard-to-clean areas identified in Step 1
  • Actuate movable parts during soiling to simulate clinical use
  • Immersion or targeted application depending on clinical scenario
  • Allow soil to dry for the maximum clinically relevant delay (often 30–120 minutes at room temperature) to simulate worst-case transport time

Step 4: Cleaning Under Worst-Case Conditions

The cleaning protocol tested must represent the least rigorous implementation of the IFU — shortest time, lowest temperature, weakest concentration, minimum mechanical action. This is a critical FDA expectation.

Manual cleaning worst-case parameters:

Parameter Worst-Case Choice
Detergent concentration Minimum recommended dilution
Water temperature Minimum specified temperature
Soaking time Shortest specified soak duration
Brushing action Minimum brush strokes per surface
Rinse volume Minimum specified rinse volume
User technique Trained but instructed not to exceed minimum effort

Automated cleaning worst-case parameters:

Parameter Worst-Case Choice
Cycle selection Shortest/mildest cycle in the IFU
Load configuration Maximum load per IFU
Detergent concentration Minimum per IFU
Water quality Minimum acceptable quality grade
Drying Minimum time or omitted if optional

Step 5: Residue Extraction and Analyte Testing

After cleaning, devices undergo extraction to recover residual soil for quantitative analysis.

Extraction methods:

  • Immersion/sonication: Device submerged in extraction solution (e.g., reverse osmosis water, 1% SDS) with ultrasonic agitation at controlled temperature (typically 40–50 °C for 30 minutes)
  • Flush/elution for lumened devices: Measured volume of extraction solution flushed through lumen; effluent collected
  • Swabbing for surfaces: Used for specific localized areas; less common for full-device validation

Analytes and assay methods:

Analyte Assay Method AAMI ST98 Acceptance Criterion Clinical Relevance
Protein BCA, Bradford, Lowry, or OPA assay ≤ 6.4 µg/cm² Universal soil marker; dominant in blood and tissue
Hemoglobin Spectrophotometric (absorbance at 540 nm) or HPLC ≤ 2.2 µg/cm² Blood-specific marker for blood-contacting devices
Total Organic Carbon (TOC) Combustion or persulfate TOC analyzer ≤ 12 µg/cm² Broad organic residue measure
Carbohydrate Phenol-sulfuric acid or enzymatic assay ≤ 1.8 µg/cm² Relevant for GI/respiratory contact devices
ATP Bioluminescence assay No formal ST98 limit; used as supplementary Rapid cleanliness indicator

Per AAMI ST98, critical and semi-critical devices require visual inspection plus at least two quantitative, clinically relevant analytes. Protein is routinely the first; the second is selected based on clinical use (e.g., hemoglobin for blood-contacting devices, TOC for mixed soil).

ISO 15883-5 alert and action levels for protein:

Level Threshold (as BSA equivalent) Significance
Alert level ≥ 3 µg/cm² Investigate; may indicate process drift
Action level ≥ 6.4 µg/cm² Process failure; corrective action required

Step 6: Visual Inspection

All devices must be visually inspected after cleaning. Visual inspection is a required pass/fail criterion regardless of quantitative results.

  • Method: Trained analyst under controlled lighting (minimum 1,000 lux), with magnification (5–10×) for complex features
  • Borescope/endoscope inspection for lumened devices to detect internal residue
  • Acceptance: No visible soil on any surface, joint, lumen, or crevice

Step 7: Detergent Residue Evaluation

After quantitative analyte testing, cleaned devices should be evaluated for detergent residuals. Per AAMI ST98 and FDA expectations, detergent must be reduced to safe, non-toxic levels. This is typically demonstrated through:

  • ISO cytotoxicity testing (ISO 10993-5) on the cleaned device extract — confirms residual detergent is below cytotoxic threshold
  • Specific detergent assays if the extraction method may interfere with downstream testing
  • This step is separate from soil removal validation and must not be omitted

Step 8: Controls

Control Type Purpose Implementation
Positive device control Confirm soiling was effective; validate extraction recovery Soiled, extracted without cleaning
Negative device control Confirm no background contamination Clean, never-soiled device extracted
Extraction blank Verify extraction solution is free of interfering analytes Extraction solution processed without device
Method validation Confirm extraction efficiency Spike-and-recovery studies per analyte
Recommended Reading
Washer-Disinfector Validation for Medical Devices: ISO 15883, Thermal Disinfection, Process Parameters, and Routine Monitoring
Quality Systems Regulatory2026-04-30 · 14 min read

Sample Size Justification

AAMI ST98 specifies sample size requirements that differ from earlier TIR30 guidance:

  1. A minimum of three data points per endpoint test
  2. All data points must be within stated acceptance criteria
  3. The standard deviation of the data set is calculated and added to the highest data point; the result must not exceed the acceptance criterion
  4. If the sum exceeds the criterion, the sample size must be increased
Study Element Minimum
Devices per group 3 test devices + 3 positive controls + 1 negative control
Extraction replicates Per method validation (typically 1 per device)
Simulated-use cycles Minimum 6 (per AAMI ST98 / FDA)

Manual vs Automated Cleaning Validation

The choice between manual and automated cleaning affects study design significantly.

Factor Manual Cleaning Automated Cleaning
Primary standard AAMI ST98 AAMI ST98 + ISO 15883-5
Variability source User technique, brush selection, soak time Cycle parameters, load configuration, detergent dosing
Worst-case simulation Minimum effort by trained user Minimum cycle parameters per IFU
Reproducibility Lower; requires tighter protocol control Higher; machine-controlled
Typical deficiencies Incomplete brushing of hinges/lumens, insufficient soak time Incorrect load configuration, channel blockage detection failures
Recommended for Devices without IFU for automated reprocessing Devices with validated washer-disinfector cycle

Key point: If the IFU offers both manual and automated options, each must be validated separately.

Common Deficiencies and How to Avoid Them

Deficiency Root Cause Prevention
Soil selection does not match clinical use Generic soil used instead of clinically relevant one Reference ASTM F3208-20; justify soil choice based on clinical use
Only one analyte tested Protocol written before ST98 required two Plan for protein + one additional analyte from the start
No positive/negative controls Study design oversight Include controls in protocol template
Extraction method not validated Assumed lab's standard method is sufficient Perform spike-and-recovery before validation
Worst-case device not justified Grouping rationale not documented Document grouping criteria and worst-case selection in the protocol
Acceptance criteria not justified for non-standard analytes Using ST98 limits for analytes not in ST98 Justify with published literature or internal studies
No visual inspection protocol Quantitative only Add visual inspection criteria and lighting requirements
Sample size insufficient Three devices tested but high variability Calculate SD + max; increase sample size if needed
Recommended Reading
Coordinated Vulnerability Disclosure for Medical Devices: Building a Post-Market Cybersecurity Program
Cybersecurity Regulatory2026-04-30 · 11 min read

Cleaning Validation Report Structure

A well-structured report facilitates FDA review and notified body audit. Recommended sections:

Section Content
1. Protocol reference Protocol number, revision, approval dates
2. Device description Trade name, model, materials, design features, claimed reprocessing life
3. Device grouping rationale Grouping criteria, family members, worst-case selection justification
4. Test soil justification Clinical use basis, soil composition, reference to ASTM F3208-20
5. Simulated-use conditioning Number of cycles, cycle details, functional testing results
6. Cleaning procedure Complete IFU text reproduced, worst-case parameter table
7. Extraction method Solution, volume, time, temperature, sonication parameters, validation data
8. Analytical methods Assay type, calibration, limits of detection/quantitation
9. Controls results Positive control recovery, negative control, extraction blank
10. Results per device Per-device analyte concentrations (µg/cm²), visual inspection findings
11. Statistical analysis Mean, SD, SD + max calculation, sample size adequacy assessment
12. Conclusion Pass/fail per acceptance criterion; statement of validation status
13. Deviations Any protocol deviations with impact assessment
14. Appendices Raw data, calibration certificates, photographs, extraction method validation

Decision Tree: Which Cleaning Validation Path Applies

Is the device reusable?
├── No → Cleaning validation not required
└── Yes
    ├── Is it supplied non-sterile requiring initial processing?
    │   └── Yes → Validate initial cleaning per IFU
    ├── Is it critical (contacts sterile tissue)?
    │   └── Yes → ST98: visual + ≥2 analytes + worst-case
    ├── Is it semi-critical (contacts mucous membranes)?
    │   └── Yes → ST98: visual + ≥2 analytes + worst-case
    └── Is it non-critical (contacts intact skin only)?
        └── Yes → ST98 allows reduced rigor (visual + 1 analyte may suffice)
            ├── Manual cleaning in IFU?
            │   └── Yes → Validate manual path separately
            └── Automated cleaning in IFU?
                └── Yes → Validate automated path per ISO 15883-5

Submission Evidence: What Goes Into Your 510(k) or Technical File

Submission Type FDA Expectation EU MDR Expectation
510(k) Summary of cleaning validation; protocol and report (Appendix E devices require full report) N/A
PMA / De Novo Full protocol and test report N/A
EU MDR Technical Documentation N/A Full validation report in Annex II technical file
Design History File Documentation of tests per 21 CFR 820.30(j) Design verification evidence
Device Master Record Per 21 CFR 820.181 N/A

For FDA submissions, Emergo by UL (March 2026) notes that reprocessing is one of three areas where FDA expectations are "usually significantly different from most other regions of the world" — specifically, FDA expects device-based cleaning test data (summary or full report), whereas most other regulators require only sterilization validation.

Recommended Reading
ISO 17664 Reprocessing Validation for Reusable Medical Devices: Standards, FDA Expectations, and Submission Evidence
Quality Systems Regulatory2026-04-30 · 12 min read

Key Takeaways

  • AAMI ST98:2022 is now the primary standard for cleaning validation, replacing TIR30 with updated acceptance criteria including TOC ≤ 12 µg/cm²
  • Critical and semi-critical devices require visual inspection plus at least two quantitative analytes (protein + hemoglobin, TOC, or carbohydrate)
  • Worst-case device selection is based on design complexity, not product code; document the rationale
  • End-of-life conditioning (minimum 6 cycles) must precede the validation cleaning cycle
  • Test soil must be clinically relevant per ASTM F3208-20
  • Manual and automated cleaning paths require separate validation studies
  • Sample size adequacy is determined by the SD + max calculation, not just minimum device count

Sources

  • ANSI/AAMI ST98:2022 — Cleaning validation of health care products
  • ISO 15883-5:2021 — Washer-disinfectors, Part 5: Performance requirements and test method criteria for demonstrating cleaning efficacy
  • ASTM F3208-20 — Standard Guide for Selecting Test Soils for Validation of Cleaning Methods for Reusable Medical Devices
  • ISO 17664-1:2021 — Processing of health care products, Part 1: Critical and semi-critical medical devices
  • FDA Guidance — Reprocessing Medical Devices in Health Care Settings: Validation Methods and Labeling (March 2015)
  • Emergo by UL — US FDA Reprocessing Expectations (March 2026)
  • Nelson Labs — How to Perform Cleaning Validations for Semi-Critical and Critical Devices According to ANSI/AAMI ST98
  • AAMI ST98:2022 acceptance criteria: Protein ≤ 6.4 µg/cm², Hemoglobin ≤ 2.2 µg/cm², TOC ≤ 12 µg/cm², Carbohydrate ≤ 1.8 µg/cm²
  • ISO 20417:2026 — Medical device labeling requirements
  • TÜV SÜD — Reprocessing Validation for Reusable Medical Devices (whitepaper)
  • LexaMed — Cleaning Validation for Reusable Medical Devices (March 2025)