A 2025 Retrospective for Investigations of a Dog

If the history of pharmaceutical quality management were written as a geological timeline, 2025 would hopefully mark the end of the Holocene of Compliance—a long, stable epoch where “following the procedure” was sufficient to ensure survival—and the beginning of the Anthropocene of Complexity.

For decades, our industry has operated under a tacit social contract. We agreed to pretend that “compliance” was synonymous with “quality.” We agreed to pretend that a validated method would work forever because we proved it worked once in a controlled protocol three years ago. We agreed to pretend that “zero deviations” meant “perfect performance,” rather than “blind surveillance.” We agreed to pretend that if we wrote enough documents, reality would conform to them.

If I had my wish 2025 would be the year that contract finally dissolved.

Throughout the year—across dozens of posts, technical analyses, and industry critiques on this blog—I have tried to dismantle the comfortable illusions of “Compliance Theater” and show how this theater collides violently with the unforgiving reality of complex systems.

The connecting thread running through every one of these developments is the concept I have returned to obsessively this year: Falsifiable Quality.

This Year in Review is not merely a summary of blog posts. It is an attempt to synthesize the fragmented lessons of 2025 into a coherent argument. The argument is this: A quality system that cannot be proven wrong is a quality system that cannot be trusted.

If our systems—our validation protocols, our risk assessments, our environmental monitoring programs—are designed only to confirm what we hope is true (the “Happy Path”), they are not quality systems at all. They are comfort blankets. And 2025 was the year we finally started pulling the blanket off.

The Philosophy of Doubt

(Reflecting on: The Effectiveness Paradox, Sidney Dekker, and Gerd Gigerenzer)

Before we dissect the technical failures of 2025, let me first establish the philosophical framework that defined this year’s analysis.

In August, I published “The Effectiveness Paradox: Why ‘Nothing Bad Happened’ Doesn’t Prove Your Quality System Works.” It became one of the most discussed posts of the year because it attacked the most sacred metric in our industry: the trend line that stays flat.

We are conditioned to view stability as success. If Environmental Monitoring (EM) data shows zero excursions for six months, we throw a pizza party. If a method validation passes all acceptance criteria on the first try, we commend the development team. If a year goes by with no Critical deviations, we pay out bonuses.

But through the lens of Falsifiable Quality—a concept heavily influenced by the philosophy of Karl Popper, the challenging insights of Deming, and the safety science of Sidney Dekker, whom we discussed in November—these “successes” look suspiciously like failures of inquiry.

The Problem with Unfalsifiable Systems

Karl Popper famously argued that a scientific theory is only valid if it makes predictions that can be tested and proven false. “All swans are white” is a scientific statement because finding one black swan falsifies it. “God is love” is not, because no empirical observation can disprove it.

In 2025, I argued that most Pharmaceutical Quality Systems (PQS) are designed to be unfalsifiable.

The Unfalsifiable Alert Limit: We set alert limits based on historical averages + 3 standard deviations. This ensures that we only react to statistical outliers, effectively blinding us to gradual drift or systemic degradation that remains “within the noise.”
The Unfalsifiable Robustness Study: We design validation protocols that test parameters we already know are safe (e.g., pH +/- 0.1), avoiding the “cliff edges” where the method actually fails. We prove the method works where it works, rather than finding where it breaks.
The Unfalsifiable Risk Assessment: We write FMEAs where the conclusion (“The risk is acceptable”) is decided in advance, and the RPN scores are reverse-engineered to justify it.

This is “Safety Theater,” a term Dekker uses to describe the rituals organizations perform to look safe rather than be safe.

Safety-I vs. Safety-II

In November’s post “Sidney Dekker: The Safety Scientist Who Influences How I Think About Quality,“ I explored Dekker’s distinction between Safety-I (minimizing things that go wrong) and Safety-II (understanding how things usually go right).

Traditional Quality Assurance is obsessed with Safety-I. We count deviations. We count OOS results. We count complaints. When those counts are low, we assume the system is healthy.
But as the LeMaitre Vascular warning letter showed us this year (discussed in Part III), a system can have “zero deviations” simply because it has stopped looking for them. LeMaitre had excellent water data—because they were cleaning the valves before they sampled them. They were measuring their ritual, not their water.

Falsifiable Quality is the bridge to Safety-II. It demands that we treat every batch record not as a compliance artifact, but as a hypothesis test.

Hypothesis: “The contamination control strategy is effective.”
Test: Aggressive monitoring in worst-case locations, not just the “representative” center of the room.
Result: If we find nothing, the hypothesis survives another day. If we find something, we have successfully falsified the hypothesis—which is a good thing because it reveals reality.

The shift from “fearing the deviation” to “seeking the falsification” is a cultural pivot point of 2025.

The Epistemological Crisis in the Lab (Method Validation)

(Reflecting on: USP <1225>, Method Qualification vs. Validation, and Lifecycle Management)

Nowhere was the battle for Falsifiable Quality fought more fiercely in 2025 than in the analytical laboratory.

The proposed revision to USP <1225> Validation of Compendial Procedures (published in Pharmacopeial Forum 51(6)) arrived late in the year, but it serves as the perfect capstone to the arguments I’ve been making since January.

For forty years, analytical validation has been the ultimate exercise in “Validation as an Event.” You develop a method. You write a protocol. You execute the protocol over three days with your best analyst and fresh reagents. You print the report. You bind it. You never look at it again.

This model is unfalsifiable. It assumes that because the method worked in the “Work-as-Imagined” conditions of the validation study, it will work in the “Work-as-Done” reality of routine QC for the next decade.

The Reportable Result: Validating Decisions, Not Signals

The revised USP <1225>—aligned with ICH Q14(Analytical Procedure Development) and USP <1220> (The Lifecycle Approach)—destroys this assumption. It introduces concepts that force falsifiability into the lab.

The most critical of these is the Reportable Result.

Historically, we validated “the instrument” or “the measurement.” We proved that the HPLC could inject the same sample ten times with < 1.0% RSD.

But the Reportable Result is the final value used for decision-making—the value that appears on the Certificate of Analysis. It is the product of a complex chain: Sampling -> Transport -> Storage -> Preparation -> Dilution -> Injection -> Integration -> Calculation -> Averaging.

Validating the injection precision (the end of the chain) tells us nothing about the sampling variability (the beginning of the chain).

By shifting focus to the Reportable Result, USP <1225> forces us to ask: “Does this method generate decisions we can trust?”

The Replication Strategy: Validating “Work-as-Done”

The new guidance insists that validation must mimic the replication strategy of routine testing.
If your SOP says “We report the average of 3 independent preparations,” then your validation must evaluate the precision and accuracy of that average, not of the individual preparations.

This seems subtle, but it is revolutionary. It prevents the common trick of “averaging away” variability during validation to pass the criteria, only to face OOS results in routine production because the routine procedure doesn’t use the same averaging scheme.

It forces the validation study to mirror the messy reality of the “Work-as-Done,” making the validation data a falsifiable predictor of routine performance, rather than a theoretical maximum capability.

Method Qualification vs. Validation: The June Distinction

I wrote “Method Qualification and Validation,” clarifying a distinction that often confuses the industry.

Qualification is the “discovery phase” where we explore the method’s limits. It is inherently falsifiable—we want to find where the method breaks.
Validation has traditionally been the “confirmation phase” where we prove it works.

The danger, as I noted in that post, is when we skip the falsifiable Qualification step and go straight to Validation. We write the protocol based on hope, not data.

USP <1225> essentially argues that Validation must retain the falsifiable spirit of Qualification. It is not a coronation; it is a stress test.

The Death of “Method Transfer” as We Know It

In a Falsifiable Quality system, a method is never “done.” The Analytical Target Profile (ATP)—a concept from ICH Q14 that permeates the new thinking—is a standing hypothesis: “This method measures Potency within +/- 2%.”

Every time we run a system suitability check, every time we run a control standard, we are testing that hypothesis.

If the method starts drifting—even if it still passes broad system suitability limits—a falsifiable system flags the drift. An unfalsifiable system waits for the OOS.

The draft revision of USP <1225> is a call to arms. It asks us to stop treating validation as a “ticket to ride”—a one-time toll we pay to enter GMP compliance—and start treating it as a “ticket to doubt.” Validation gives us permission to use the method, but only as long as the data continues to support the hypothesis of fitness.

The Reality Check (The “Unholy Trinity” of Warning Letters)

Philosophy and guidelines are fine, but in 2025, reality kicked in the door. The regulatory year was defined by three critical warning letters—Sanofi, LeMaitre, and Rechon—that collectively dismantled the industry’s illusions of control.

It began, as these things often do, with a ghost from the past.

Sanofi Framingham: The Pendulum Swings Back

**(Reflecting on: Failure to Investigate Critical Deviations and The Sanofi Warning Letter)**

The year opened with a shock. On January 15, 2025, the FDA issued a warning letter to Sanofi’s Framingham facility—the sister site to the legacy Genzyme Allston landing, whose consent decree defined an entire generation of biotech compliance and of my career.

In my January analysis (“Failure to Investigate Critical Deviations: A Cautionary Tale“), I noted that the FDA’s primary citation was a failure to “thoroughly investigate any unexplained discrepancy.”

This is the cardinal sin of Falsifiable Quality.

An “unexplained discrepancy” is a signal from reality. It is the system telling you, “Your hypothesis about this process is wrong.”

The Falsifiable Response: You dive into the discrepancy. You assume your control strategy missed something. You use Causal Reasoning (the topic of my May post) to find the mechanism of failure.
The Sanofi Response: As the warning letter detailed, they frequently attributed failures to “isolated incidents” or superficial causes without genuine evidence.

This is the “Refusal to Falsify.” By failing to investigate thoroughly, the firm protects the comfortable status quo. They choose to believe the “Happy Path” (the process is robust) over the evidence (the discrepancy).

The Pendulum of Compliance

In my companion post (“Sanofi Warning Letter”), I discussed the “pendulum of compliance.” The Framingham site was supposed to be the fortress of quality, built on the lessons of the Genzyme crisis.

The failure at Sanofi wasn’t a lack of SOPs; it was a lack of curiosity.

The investigators likely had checklists, templates, and timelines (Compliance Theater), but they lacked the mandate—or perhaps the Expertise —to actually solve the problem.

This set the thematic stage for the rest of 2025. Sanofi showed us that “closing the deviation” is not the same as fixing the problem. This insight led directly into my August argument in “The Effectiveness Paradox“: You can close 100% of your deviations on time and still have a manufacturing process that is spinning out of control.

If Sanofi was the failure of investigation (looking back), Rechon and LeMaitre were failures of surveillance (looking forward). Together, they form a complete picture of why unfalsifiable systems fail.

Reflecting on: Rechon Life Science and LeMaitre Vascular

Philosophy and guidelines are fine, but in September, reality kicked in the door.

Two warning letters in 2025—Rechon Life Science (September) and LeMaitre Vascular (August)—provided brutal case studies in what happens when “representative sampling” is treated as a buzzword rather than a statistical requirement.

Rechon Life Science: The Map vs. The Territory

The Rechon Life Science warning letter was a significant regulatory signal of 2025 regarding sterile manufacturing. It wasn’t just a list of observations; it was an indictment of unfalsifiable Contamination Control Strategies (CCS).

We spent 2023 and 2024 writing massive CCS documents to satisfy Annex 1. Hundreds of pages detailing airflows, gowning procedures, and material flows. We felt good about them. We felt “compliant.”

Then the FDA walked into Rechon and essentially asked: “If your CCS is so good, why does your smoke study show turbulence over the open vials?”

The warning letter highlighted a disconnect I’ve called “The Map vs. The Territory.”

The Map: The CCS document says the airflow is unidirectional and protects the product.
The Territory: The smoke study video shows air eddying backward from the operator to the sterile core.

In an unfalsifiable system, we ignore the smoke study (or film it from a flattering angle) because it contradicts the CCS. We prioritize the documentation (the claim) over the observation (the evidence).

In a falsifiable system, the smoke study is the test. If the smoke shows turbulence, the CCS is falsified. We don’t defend the CCS; we rewrite it. We redesign the line.

The FDA’s critique of Rechon’s “dynamic airflow visualization” was devastating because it showed that Rechon was using the smoke study as a marketing video, not a diagnostic tool. They filmed “representative” operations that were carefully choreographed to look clean, rather than the messy reality of interventions.

LeMaitre Vascular: The Sin of “Aspirational Data”

If Rechon was about air, LeMaitre Vascular (analyzed in my August post “When Water Systems Fail“) was about water. And it contained an even more egregious sin against falsifiability.

The FDA observed that LeMaitre’s water sampling procedures required cleaning and purging the sample valves before taking the sample.

Let’s pause and consider the epistemology of this.

The Goal: To measure the quality of the water used in manufacturing.
The Reality: Manufacturing operators do not purge and sanitize the valve for 10 minutes before filling the tank. They open the valve and use the water.
The Sample: By sanitizing the valve before sampling, LeMaitre was measuring the quality of the sampling process, not the quality of the water system.

I call this “Aspirational Data.” It is data that reflects the system as we wish it existed, not as it actually exists. It is the ultimate unfalsifiable metric. You can never find biofilm in a valve if you scrub the valve with alcohol before you open it.

The FDA’s warning letter was clear: “Sampling… must include any pathway that the water travels to reach the process.”

LeMaitre also performed an unauthorized “Sterilant Switcheroo,” changing their sanitization agent without change control or biocompatibility assessment. This is the hallmark of an unfalsifiable culture: making changes based on convenience, assuming they are safe, and never designing the study to check if that assumption is wrong.

The “Representative” Trap

Both warning letters pivot on the misuse of the word “representative.”

Firms love to claim their EM sampling locations are “representative.” But representative of what? Usually, they are representative of the average condition of the room—the clean, empty spaces where nothing happens.

But contamination is not an “average” event. It is a specific, localized failure. A falsifiable EM program places probes in the “worst-case” locations—near the door, near the operator’s hands, near the crimping station. It tries to find contamination. It tries to falsify the claim that the zone is sterile, asceptic or bioburden reducing.

When Rechon and LeMaitre failed to justify their sampling locations, they were guilty of designing an unfalsifiable experiment. They placed the “microscope” where they knew they wouldn’t find germs.

2025 taught us that regulators are no longer impressed by the thickness of the CCS binder. They are looking for the logic of control. They are testing your hypothesis. And if you haven’t tested it yourself, you will fail.

The Investigation as Evidence

(Reflecting on: The Golden Start to a Deviation Investigation, Causal Reasoning, Take-the-Best Heuristics, and The Catalent Case)

If Rechon, LeMaitre, and Sanofi teach us anything, it is that the quality system’s ability to discover failure is more important than its ability to prevent failure.

A perfect manufacturing process that no one is looking at is indistinguishable from a collapsing process disguised by poor surveillance. But a mediocre process that is rigorously investigated, understood, and continuously improved is a path toward genuine control.

The investigation itself—how we respond to a deviation, how we reason about causation, how we design corrective actions—is where falsifiable quality either succeeds or fails.

The Golden Day: When Theory Meets Work-as-Done

In April, I published “The Golden Start to a Deviation Investigation,” which made a deceptively simple argument: The first 24 hours after a deviation is discovered are where your quality system either commits to discovering truth or retreats into theater.

This argument sits at the heart of falsifiable quality.

When a deviation occurs, you have a narrow window—what I call the “Golden Day”—where evidence is fresh, memories are intact, and the actual conditions that produced the failure still exist. If you waste this window with vague problem statements and abstract discussions, you permanently lose the ability to test causal hypotheses later.

The post outlined a structured protocol:

First, crystallize the problem. Not “potency was low”—but “Lot X234, potency measured at 87% on January 15th at 14:32, three hours after completion of blending in Vessel C-2.” Precision matters because only specific, bounded statements can be falsified. A vague problem statement can always be “explained away.”

Second, go to the Gemba. This is the antidote to “work-as-imagined” investigation. The SOP says the temperature controller should maintain 37°C +/- 2°C. But the Gemba walk reveals that the probe is positioned six inches from the heating element, the data logger is in a recessed pocket where humidity accumulates, and the operator checks it every four hours despite a requirement to check hourly. These are the facts that predict whether the deviation will recur.

Third, interview with cognitive discipline. Most investigations fail not because investigators lack information, but because they extract information poorly. Cognitive interviewing—developed by the FBI and the National Transportation Safety Board—uses mental reinstatement, multiple perspectives, and sequential reordering to access accurate recall rather than confabulated narrative. The investigator asks the operator to walk through the event in different orders, from different viewpoints, each time triggering different memory pathways. This is not “soft” technique; it is a mechanism for generating falsifiable evidence.

The Golden Day post makes it clear: You do not investigate deviations to document compliance. You investigate deviations to gather evidence about whether your understanding of the process is correct.

Causal Reasoning: Moving Beyond “What Was Missing”

Most investigation tools fail not because they are flawed, but because they are applied with the wrong mindset. In my May post “Causal Reasoning: A Transformative Approach to Root Cause Analysis,” I argued that pharmaceutical investigations are often trapped in “negative reasoning.”

Negative reasoning asks: “What barrier was missing? What should have been done but wasn’t?” This mindset leads to unfalsifiable conclusions like “Procedure not followed” or “Training was inadequate.” These are dead ends because they describe the absence of an ideal, not the presence of a cause.

Causal reasoning flips the script. It asks: “What was present in the system that made the observed outcome inevitable?”

Instead of settling for “human error,” causal reasoning demands we ask: What environmental cues made the action sensible to the operator at that moment? Were the instructions ambiguous? Did competing priorities make compliance impossible? Was the process design fragile?

This shift transforms the investigation from a compliance exercise into a scientific inquiry.

Consider the LeMaitre example:

Negative Reasoning: “Why didn’t they sample the true condition?” Answer: “Because they didn’t follow the intent of the sampling plan.”
Causal Reasoning: “What made the pre-cleaning practice sensible to them?” Answer: “They believed it ensured sample validity by removing valve residue.”

By understanding the why, we identify a knowledge gap that can be tested and corrected, rather than a negligence gap that can only be punished.

In September, “Take-the-Best Heuristic for Causal Investigation” provided a practical framework for this. Instead of listing every conceivable cause—a process that often leads to paralysis—the “Take-the-Best” heuristic directs investigators to focus on the most information-rich discriminators. These are the factors that, if different, would have prevented the deviation. This approach focuses resources where they matter most, turning the investigation into a targeted search for truth.

CAPA: Predictions, Not Promises

The Sanofi warning letter—analyzed in January—showed the destination of unfalsifiable investigation: CAPAs that exist mainly as paperwork.

Sanofi had investigation reports. They had “corrective actions.” But the FDA noted that deviations recurred in similar patterns, suggesting that the investigation had identified symptoms, not mechanisms, and that the “corrective” action had not actually addressed causation.

This is the sin of treating CAPA as a promise rather than a hypothesis.

A falsifiable CAPA is structured as an explicit prediction: “If we implement X change, then Y undesirable outcome will not recur under conditions Z.”

This can be tested. If it fails the test, the CAPA itself becomes evidence—not of failure, but of incomplete causal understanding. Which is valuable.

In the Rechon analysis, this showed up concretely: The FDA’s real criticism was not just that contamination was found; it was that Rechon’s Contamination Control Strategy had no mechanism to falsify itself. If the CCS said “unidirectional airflow protects the product,” and smoke studies showed bidirectional eddies, the CCS had been falsified. But Rechon treated the falsification as an anomaly to be explained away, rather than evidence that the CCS hypothesis was wrong.

A falsifiable organization would say: “Our CCS predicted that Grade A in an isolator with this airflow pattern would remain sterile. The smoke study proves that prediction wrong. Therefore, the CCS is false. We redesign.”

Instead, they filmed from a different angle and said the aerodynamics were “acceptable.”

Knowledge Integration: When Deviations Become the Curriculum

The final piece of falsifiable investigation is what I call “knowledge integration.” A single deviation is a data point. But across the organization, deviations should form a curriculum about how systems actually fail.

Sanofi’s failure was not that they investigated each deviation badly (though they did). It was that they investigated them in isolation. Each deviation closed on its own. Each CAPA addressed its own batch. There was no organizational learning—no mechanism for a pattern of similar deviations to trigger a hypothesis that the control strategy itself was fundamentally flawed.

This is where the Catalent case study, analyzed in September’s “When 483s Reveal Zemblanity,” becomes instructive. Zemblanity is the opposite of serendipity: the seemingly random recurrence of the same failure through different paths. Catalent’s 483 observations were not isolated mistakes; they formed a pattern that revealed a systemic assumption (about equipment capability, about environmental control, about material consistency) that was false across multiple products and locations.

A falsifiable quality system catches zemblanity early by:

Treating each deviation as a test of organizational hypotheses, not as an isolated incident.
Trending deviation patterns to detect when the same causal mechanism is producing failures across different products, equipment, or operators.
Revising control strategies when patterns falsify the original assumptions, rather than tightening parameters at the margins.

The Digital Hallucination (CSA, AI, and the Expertise Crisis)

(Reflecting on: CSA: The Emperor’s New Clothes, Annex 11, and The Expertise Crisis)

While we battled microbes in the cleanroom, a different battle was raging in the server room. 2025 was the year the industry tried to “modernize” validation through Computer Software Assurance (CSA) and AI, and in many ways, it was the year we tried to automate our way out of thinking.

CSA: The Emperor’s New Validation Clothes

In September, I published “Computer System Assurance: The Emperor’s New Validation Clothes,” a critique of the the contortions being made around the FDA’s guidance. The narrative sold by consultants for years was that traditional Computer System Validation (CSV) was “broken”—too much documentation, too much testing—and that CSA was a revolutionary new paradigm of “critical thinking.”

My analysis showed that this narrative is historically illiterate.

The principles of CSA—risk-based testing, leveraging vendor audits, focusing on intended use—are not new. They are the core principles of GAMP5 and have been applied for decades now.

The industry didn’t need a new guidance to tell us to use critical thinking; we had simply chosen not to use the critical thinking tools we already had. We had chosen to apply “one-size-fits-all” templates because they were safe (unfalsifiable).

The CSA guidance is effectively the FDA saying: “Please read the GAMP5 guide you claimed to be following for the last 15 years.”

The danger of the “CSA Revolution” narrative is that it encourages a swing to the opposite extreme: “Unscripted Testing” that becomes “No Testing.”

In a falsifiable system, “unscripted testing” is highly rigorous—it is an expert trying to break the software (“Ad Hoc testing”). But in an unfalsifiable system, “unscripted testing” becomes “I clicked around for 10 minutes and it looked fine.”

The Expertise Crisis: AI and the Death of the Apprentice

This leads directly to the Expertise Crisis. In September, I wrote “The Expertise Crisis: Why AI’s War on Entry-Level Jobs Threatens Quality’s Future.” This was perhaps the most personal topic I covered this year, because it touches on the very survival of our profession.

We are rushing to integrate Artificial Intelligence (AI) into quality systems. We have AI writing deviations, AI drafting SOPs, AI summarizing regulatory changes. The efficiency gains are undeniable. But the cost is hidden, and it is epistemological.

Falsifiability requires expertise.
To falsify a claim—to look at a draft investigation report and say, “No, that conclusion doesn’t follow from the data”—you need deep, intuitive knowledge of the process. You need to know what a “normal” pH curve looks like so you can spot the “abnormal” one that the AI smoothed over.

Where does that intuition come from? It comes from the “grunt work.” It comes from years of reviewing batch records, years of interviewing operators, years of struggling to write a root cause analysis statement.

The Expertise Crisis is this: If we give all the entry-level work to AI, where will the next generation of Quality Leaders come from?

The Junior Associate doesn’t review the raw data; the AI summarizes it.
The Junior Associate doesn’t write the deviation; the AI generates the text.
Therefore, the Junior Associate never builds the mental models necessary to critique the AI.

The Loop of Unfalsifiable Hallucination

We are creating a closed loop of unfalsifiability.

The AI generates a plausible-sounding investigation report.
The human reviewer (who has been “de-skilled” by years of AI reliance) lacks the deep expertise to spot the subtle logical flaw or the missing data point.
The report is approved.
The “hallucination” becomes the official record.

In a falsifiable quality system, the human must remain the adversary of the algorithm. The human’s job is to try to break the AI’s logic, to check the citations, to verify the raw data.
But in 2025, we saw the beginnings of a “Compliance Autopilot”—a desire to let the machine handle the “boring stuff.”

My warning in September remains urgent: Efficiency without expertise is just accelerated incompetence. If we lose the ability to falsify our own tools, we are no longer quality professionals; we are just passengers in a car driven by a statistical model that doesn’t know what “truth” is.

My post “The Missing Middle in GMP Decision Making: How Annex 22 Redefines Human-Machine Collaboration in Pharmaceutical Quality Assurance” goes a lot deeper here.

Annex 11 and Data Governance

In August, I analyzed the draft Annex 11 (Computerised Systems) in the post “Data Governance Systems: A Fundamental Shift.”

The Europeans are ahead of the FDA here. While the FDA talks about “Assurance” (testing less), the EU is talking about “Governance” (controlling more). The new Annex 11 makes it clear: You cannot validate a system if you do not control the data lifecycle. Validation is not a test script; it is a state of control.

This aligns perfectly with USP <1225> and <1220>. Whether it’s a chromatograph or an ERP system, the requirement is the same: Prove that the data is trustworthy, not just that the software is installed.

The Process as a Hypothesis (CPV & Cleaning)

(Reflecting on: Continuous Process Verification and Hypothesis Formation)

The final frontier of validation we explored in 2025 was the manufacturing process itself.

CPV: Continuous Falsification

In March, I published “Continuous Process Verification (CPV) Methodology and Tool Selection.”
CPV is the ultimate expression of Falsifiable Quality in manufacturing.

Traditional Validation (3 Batches): “We made 3 good batches, therefore the process is perfect forever.” (Unfalsifiable extrapolation).
CPV: “We made 3 good batches, so we have a license to manufacture, but we will statistically monitor every subsequent batch to detect drift.” (Continuous hypothesis testing).

The challenge with CPV, as discussed in the post, is that it requires statistical literacy. You cannot implement CPV if your quality unit doesn’t understand the difference between Cpk and Ppk, or between control limits and specification limits.

This circles back to the Expertise Crisis. We are implementing complex statistical tools (CPV software) at the exact moment we are de-skilling the workforce. We risk creating a “CPV Dashboard” that turns red, but no one knows why or what to do about it.

Cleaning Validation: The Science of Residue

In August, I tried to apply falsifiability to one of the most stubborn areas of dogma: Cleaning Validation.

In “Building Decision-Making with Structured Hypothesis Formation,“ I argued that cleaning validation should not be about “proving it’s clean.” It should be about “understanding why it gets dirty.”

Traditional Approach: Swab 10 spots. If they pass, we are good.
Hypothesis Approach: “We hypothesize that the gasket on the bottom valve is the hardest to clean. We predict that if we reduce rinse time by 1 minute, that gasket will fail.”

By testing the boundaries—by trying to make the cleaning fail—we understand the Design Space of the cleaning process.

We discussed the “Visual Inspection” paradox in cleaning: If you can see the residue, it failed. But if you can’t see it, does it pass?

Only if you have scientifically determined the Visible Residue Limit (VRL). Using “visually clean” without a validated VRL is—you guessed it—unfalsifiable.

To: Jeremiah Genest
From: Perplexity Research
Subject: Draft Content – Single-Use Systems & E&L Section

Here is a section on Single-Use Systems (SUS) and Extractables & Leachables (E&L).

I have positioned this piece to bridge the gap between “Part III: The Reality Check” (Contamination/Water) and “Part V: The Process as a Hypothesis” (Cleaning Validation).

The argument here is that by switching from Stainless Steel to Single-Use, we traded a visible risk (cleaning residue) for an invisible one (chemical migration), and that our current approach to E&L is often just “Paper Safety”—relying on vendor data that doesn’t reflect the “Work-as-Done” reality of our specific process conditions.

The Plastic Paradox (Single-Use Systems and the E&L Mirage)

If the Rechon and LeMaitre warning letters were about the failure to control biological contaminants we can find, the industry’s struggle with Single-Use Systems (SUS) in 2025 was about the chemical contaminants we choose not to find.

We have spent the last decade aggressively swapping stainless steel for plastic. The value proposition was irresistible: Eliminate cleaning validation, eliminate cross-contamination, increase flexibility. We traded the “devil we know” (cleaning residue) for the “devil we don’t” (Extractables and Leachables).

But in 2025, with the enforcement reality of USP <665> (Plastic Components and Systems) settling in, we had to confront the uncomfortable truth: Most E&L risk assessments are unfalsifiable.

The Vendor Data Trap

The standard industry approach to E&L is the ultimate form of “Compliance Theater.”

We buy a single-use bag.
We request the vendor’s regulatory support package (the “Map”).
We see that the vendor extracted the film with aggressive solvents (ethanol, hexane) for 7 days.
We conclude: “Our process uses water for 24 hours; therefore, we are safe.”

This logic is epistemologically bankrupt. It assumes that the Vendor’s Model (aggressive solvents/short time) maps perfectly to the User’s Reality (complex buffers/long duration/specific surfactants).

It ignores the fact that plastics are dynamic systems. Polymers age. Gamma irradiation initiates free radical cascades that evolve over months. A bag manufactured in January might have a different leachable profile than a bag manufactured in June, especially if the resin supplier made a “minor” change that didn’t trigger a notification.

By relying solely on the vendor’s static validation package, we are choosing not to falsify our safety hypothesis. We are effectively saying, “If the vendor says it’s clean, we will not look for dirt.”

USP <665>: A Baseline, Not a Ceiling

The full adoption of USP <665> was supposed to bring standardization. And it has—it provides a standard set of extraction conditions. But standards can become ceilings.

In 2025, I observed a troubling trend of “Compliance by Citation.” Firms are citing USP <665> compliance as proof of absence of risk, stopping the inquiry there.

A Falsifiable E&L Strategy goes further. It asks:

“What if the vendor data is irrelevant to my specific surfactant?”
“What if the gamma irradiation dose varied?”
“What if the interaction between the tubing and the connector creates a new species?”

The Invisible Process Aid

We must stop viewing Single-Use Systems as inert piping. They are active process components. They are chemically reactive vessels that participate in our reaction kinetics.

When we treat them as inert, we are engaging in the same “Aspirational Thinking” that LeMaitre used on their water valves. We are modeling the system we want (pure, inert plastic), not the system we have (a complex soup of antioxidants, slip agents, and degradants).

The lesson of 2025 is that Material Qualification cannot be a paper exercise. If you haven’t done targeted simulation studies that mimic your actual “Work-as-Done” conditions, you haven’t validated the system. You’ve just filed the receipt.

The Mandate for 2026

As we look toward 2026, the path is clear. We cannot go back to the comfortable fiction of the pre-2025 era.

The regulatory environment (Annex 1, ICH Q14, USP <1225>, Annex 11) is explicitly demanding evidence of control, not just evidence of compliance. The technological environment (AI) is demanding that we sharpen our human expertise to avoid becoming obsolete. The physical environment (contamination, supply chain complexity) is demanding systems that are robust, not just rigid.

The mandate for the coming year is to build Falsifiable Quality Systems.

What does that look like practically?

In the Lab: Implement USP <1225> logic now. Don’t wait for the official date. Validate your reportable results. Add “challenge tests” to your routine monitoring.
In the Plant: Redesign your Environmental Monitoring to hunt for contamination, not to avoid it. If you have a “perfect” record in a Grade C area, move the plates until you find the dirt.
In the Office: Treat every investigation as a chance to falsify the control strategy. If a deviation occurs that the control strategy said was impossible, update the control strategy.
In the Culture: Reward the messenger. The person who finds the crack in the system is not a troublemaker; they are the most valuable asset you have. They just falsified a false sense of security.
In Design: Embrace the “Elegant Quality System“ (discussed in May). Complexity is the enemy of falsifiability. Complex systems hide failures; simple, elegant systems reveal them.

2025 was the year we stopped pretending. 2026 must be the year we start building. We must build systems that are honest enough to fail, so that we can build processes that are robust enough to endure.

Thank you for reading, challenging, and thinking with me this year. The investigation continues.

Computer System Assurance: The Emperor’s New Validation Clothes

How the Quality Industry Repackaged Existing Practices and Called Them Revolutionary

As someone who has spent decades implementing computer system validation practices across multiple regulated environments, I consistently find myself skeptical of the breathless excitement surrounding Computer System Assurance (CSA). The pharmaceutical quality community’s enthusiastic embrace of CSA as a revolutionary departure from traditional Computer System Validation (CSV) represents a troubling case study in how our industry allows consultants to rebrand established practices as breakthrough innovations, selling back to us concepts we’ve been applying for over two decades.

The truth is both simpler and more disappointing than the CSA evangelists would have you believe: there is nothing fundamentally new in computer system assurance that wasn’t already embedded in risk-based validation approaches, GAMP5 principles, or existing regulatory guidance. What we’re witnessing is not innovation, but sophisticated marketing—a coordinated effort to create artificial urgency around “modernizing” validation practices that were already fit for purpose.

The Historical Context: Why We Need to Remember Where We Started

To understand why CSA represents more repackaging than revolution, we must revisit the regulatory and industry context from which our current validation practices emerged. Computer system validation didn’t develop in a vacuum—it arose from genuine regulatory necessity in response to real-world failures that threatened patient safety and product quality.

The origins of systematic software validation in regulated industries trace back to military applications in the 1960s, specifically independent verification and validation (IV&V) processes developed for critical defense systems. The pharmaceutical industry’s adoption of these concepts began in earnest during the 1970s as computerized systems became more prevalent in drug manufacturing and quality control operations.

The regulatory foundation for what we now call computer system validation was established through a series of FDA guidance documents throughout the 1980s and 1990s. The 1983 FDA “Guide to Inspection of Computerized Systems in Drug Processing” represented the first systematic approach to ensuring the reliability of computer-based systems in pharmaceutical manufacturing. This was followed by increasingly sophisticated guidance, culminating in 21 CFR Part 11 in 1997 and the “General Principles of Software Validation” in 2002.

These regulations didn’t emerge from academic theory—they were responses to documented failures. The FDA’s analysis of 3,140 medical device recalls between 1992 and 1998 revealed that 242 (7.7%) were attributable to software failures, with 192 of those (79%) caused by defects introduced during software changes after initial deployment. Computer system validation developed as a systematic response to these real-world risks, not as an abstract compliance exercise.

The GAMP Evolution: Building Risk-Based Practices from the Ground Up

Perhaps no single development better illustrates how the industry has already solved the problems CSA claims to address than the evolution of the Good Automated Manufacturing Practice (GAMP) guidelines. GAMP didn’t start as a theoretical framework—it emerged from practical necessity when FDA inspectors began raising concerns about computer system validation during inspections of UK pharmaceutical facilities in 1991

The GAMP community’s response was methodical and evidence-based. Rather than creating bureaucratic overhead, GAMP sought to provide a practical framework that would satisfy regulatory requirements while enabling business efficiency. Each revision of GAMP incorporated lessons learned from real-world implementations:

GAMP 1 (1994) focused on standardizing validation activities for computerized systems, addressing the inconsistency that characterized early validation efforts.

GAMP 2 and 3 (1995-1998) introduced early concepts of risk-based approaches and expanded scope to include IT infrastructure, recognizing that validation needed to be proportional to risk rather than uniformly applied.

GAMP 4 (2001) emphasized a full system lifecycle model and defined clear validation deliverables, establishing the structured approach that remains fundamentally unchanged today.

GAMP 5 (2008) represented a decisive shift toward risk-based validation, promoting scalability and efficiency while maintaining regulatory compliance. This version explicitly recognized that validation effort should be proportional to the system’s impact on product quality, patient safety, and data integrity.

The GAMP 5 software categorization system (Categories 1, 3, 4, and 5, with Category 2 eliminated as obsolete) provided the risk-based framework that CSA proponents now claim as innovative. A Category 1 infrastructure software requires minimal validation beyond verification of installation and version control, while a Category 5 custom application demands comprehensive lifecycle validation including detailed functional and design specifications. This isn’t just risk-based thinking—it’s risk-based practice that has been successfully implemented across thousands of systems for over fifteen years.

The Risk-Based Spectrum: What GAMP Already Taught Us

One of the most frustrating aspects of CSA advocacy is how it presents risk-based validation as a novel concept. The pharmaceutical industry has been applying risk-based approaches to computer system validation since the early 2000s, not as a revolutionary breakthrough, but as basic professional competence.

The foundation of risk-based validation rests on a simple principle: validation rigor should be proportional to the potential impact on product quality, patient safety, and data integrity. This principle was explicitly articulated in ICH Q9 (Quality Risk Management) and embedded throughout GAMP 5, creating what is effectively a validation spectrum rather than a binary validated/not-validated state.

At the lower end of this spectrum, we find systems with minimal GMP impact—infrastructure software, standard office applications used for non-GMP purposes, and simple monitoring tools that generate no critical data. For these systems, validation consists primarily of installation verification and fitness-for-use confirmation, with minimal documentation requirements.

In the middle of the spectrum are configurable commercial systems—LIMS, ERP modules, and manufacturing execution systems that require configuration to meet specific business needs. These systems demand functional testing of configured elements, user acceptance testing, and ongoing change control, but can leverage supplier documentation and industry standard practices to streamline validation efforts.

At the high end of the spectrum are custom applications and systems with direct impact on batch release decisions, patient safety, or regulatory submissions. These systems require comprehensive validation including detailed functional specifications, extensive testing protocols, and rigorous change control procedures.

The elegance of this approach is that it scales validation effort appropriately while maintaining consistent quality outcomes. A risk assessment determines where on the spectrum a particular system falls, and validation activities align accordingly. This isn’t theoretical—it’s been standard practice in well-run validation programs for over a decade.

The 2003 FDA Guidance: The CSA Framework Hidden in Plain Sight

Perhaps the most damning evidence that CSA represents repackaging rather than innovation lies in the 2003 FDA guidance “Part 11, Electronic Records; Electronic Signatures — Scope and Application.” This guidance, issued over twenty years ago, contains virtually every principle that CSA advocates now present as revolutionary insights.

The 2003 guidance established several critical principles that directly anticipate CSA approaches:

Narrow Scope Interpretation: The FDA explicitly stated that Part 11 would only be enforced for records required to be kept where electronic versions are used in lieu of paper, avoiding the over-validation that characterized early Part 11 implementations.
Risk-Based Enforcement: Rather than treating Part 11 as a checklist, the FDA indicated that enforcement priorities would be risk-based, focusing on systems where failures could compromise data integrity or patient safety.
Legacy System Pragmatism: The guidance exercised discretion for systems implemented before 1997, provided they were fit for purpose and maintained data integrity.
Focus on Predicate Rules: Companies were encouraged to focus on fulfilling underlying regulatory requirements rather than treating Part 11 as an end in itself.
Innovation Encouragement: The guidance explicitly stated that “innovation should not be stifled” by fear of Part 11, encouraging adoption of new technologies provided they maintained appropriate controls.

These principles—narrow scope, risk-based approach, pragmatic implementation, focus on underlying requirements, and innovation enablement—constitute the entire conceptual framework that CSA now claims as its contribution to validation thinking. The 2003 guidance didn’t just anticipate CSA; it embodied CSA principles in FDA policy over two decades before the “Computer Software Assurance” marketing campaign began.

The EU Annex 11 Evolution: Proof That the System Was Already Working

The evolution of EU GMP Annex 11 provides another powerful example of how existing regulatory frameworks have continuously incorporated the principles that CSA now claims as innovations. The current Annex 11, dating from 2011, already included most elements that CSA advocates present as breakthrough thinking.

The original Annex 11 established several key principles that remain relevant today:

Risk-Based Validation: Clause 1 requires that “Risk management should be applied throughout the lifecycle of the computerised system taking into account patient safety, data integrity and product quality”—a clear articulation of risk-based thinking.
Supplier Assessment: The regulation required assessment of suppliers and their quality systems, anticipating the “trusted supplier” concepts that CSA emphasizes.
Lifecycle Management: Annex 11 required that systems be validated and maintained in a validated state throughout their operational life.
Change Control: The regulation established requirements for managing changes to validated systems.
Data Integrity: Electronic records requirements anticipated many of the data integrity concerns that now drive validation practices.

The 2025 draft revision of Annex 11 represents evolution, not revolution. While the document has expanded significantly, most additions address technological developments—cloud computing, artificial intelligence, cybersecurity—rather than fundamental changes in validation philosophy. The core principles remain unchanged: risk-based validation, lifecycle management, supplier oversight, and data integrity protection.

Importantly, the draft Annex 11 demonstrates regulatory convergence rather than divergence. The revision aligns more closely with FDA CSA guidance, GAMP 5 second edition, ICH Q9, and ISO 27001. This alignment doesn’t validate CSA as revolutionary—it demonstrates that global regulators recognize the maturity and effectiveness of existing validation approaches.

The FDA CSA Final Guidance: Official Release and the Repackaging of Established Principles

On September 24, 2025, the FDA officially published its final guidance on “Computer Software Assurance for Production and Quality System Software,” marking the culmination of a three-year journey from draft to final policy. This final guidance, while presented as a modernization breakthrough by consulting industry advocates, provides perhaps the clearest evidence yet that CSA represents sophisticated rebranding rather than genuine innovation.

The Official Position: Supplement, Not Revolution

The FDA’s own language reveals the evolutionary rather than revolutionary nature of CSA. The guidance explicitly states that it “supplements FDA’s guidance, ‘General Principles of Software Validation'” with one notable exception: “this guidance supersedes Section 6: Validation of Automated Process Equipment and Quality System Software of the Software Validation guidance”.

This measured approach directly contradicts the consulting industry narrative that positions CSA as a wholesale replacement for traditional validation approaches. The FDA is not abandoning established software validation principles—it is refining their application to production and quality system software while maintaining the fundamental framework that has served the industry effectively for over two decades.

What Actually Changed: Evolutionary Refinement

The final guidance incorporates several refinements that demonstrate the FDA’s commitment to practical implementation rather than theoretical innovation:

Risk-Based Framework Formalization: The guidance provides explicit criteria for determining “high process risk” versus “not high process risk” software functions, creating a binary classification system that simplifies risk assessment while maintaining proportionate validation effort. However, this risk-based thinking merely formalizes the spectrum approach that mature GAMP implementations have applied for years.

Cloud Computing Integration: The guidance addresses Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS) deployments, providing clarity on when cloud-based systems require validation. This represents adaptation to technological evolution rather than philosophical innovation—the same risk-based principles apply regardless of deployment model.

Unscripted Testing Validation: The guidance explicitly endorses “unscripted testing” as an acceptable validation approach, encouraging “exploratory, ad hoc, and unscripted testing methods” when appropriate. This acknowledgment of testing methods that experienced practitioners have used for years represents regulatory catch-up rather than breakthrough thinking.

Digital Evidence Acceptance: The guidance states that “FDA recommends incorporating the use of digital records and digital signature capabilities rather than duplicating results already digitally retained,” providing regulatory endorsement for practices that reduce documentation burden. Again, this formalizes efficiency measures that sophisticated organizations have implemented within existing frameworks.

The Definitional Games: CSA Versus CSV

The final guidance provides perhaps the most telling evidence of CSA’s repackaging nature through its definition of Computer Software Assurance: “a risk-based approach for establishing and maintaining confidence that software is fit for its intended use”. This definition could have been applied to effective computer system validation programs throughout the past two decades without modification.

The guidance emphasizes that CSA “follows a least-burdensome approach, where the burden of validation is no more than necessary to address the risk”. This principle was explicitly articulated in ICH Q9 (Quality Risk Management) published in 2005 and embedded in GAMP 5 guidance from 2008. The FDA is not introducing least-burdensome thinking—it is providing regulatory endorsement for principles that the industry has applied successfully for over fifteen years.

More significantly, the guidance acknowledges that CSA “establishes and maintains that the software used in production or the quality system is in a state of control throughout its life cycle (‘validated state’)”. The concept of maintaining validated state through lifecycle management represents core computer system validation thinking that predates CSA by decades.

Practical Examples: Repackaged Wisdom

The final guidance includes four detailed examples in Appendix A that demonstrate CSA application to real-world scenarios: Nonconformance Management Systems, Learning Management Systems, Business Intelligence Applications, and Software as a Service (SaaS) Product Life Cycle Management Systems. These examples provide valuable practical guidance, but they illustrate established validation principles rather than innovative approaches.

Consider the Nonconformance Management System example, which demonstrates risk assessment, supplier evaluation, configuration testing, and ongoing monitoring. Each element represents standard GAMP-based validation practice:

Risk Assessment: Determining that failure could impact product quality aligns with established risk-based validation principles
Supplier Evaluation: Assessing vendor development practices and quality systems follows GAMP supplier guidance
Configuration Testing: Verifying that system configuration meets business requirements represents basic user acceptance testing
Ongoing Monitoring: Maintaining validated state through change control and periodic review embodies lifecycle management concepts

The Business Intelligence Applications example similarly demonstrates established practices repackaged with CSA terminology. The guidance recommends focusing validation effort on “data integrity, accuracy of calculations, and proper access controls”—core concerns that experienced validation professionals have addressed routinely using GAMP principles.

The Regulatory Timing: Why Now?

The timing of the final CSA guidance publication reveals important context about regulatory motivation. The guidance development began in earnest in 2022, coinciding with increasing industry pressure to address digital transformation challenges, cloud computing adoption, and artificial intelligence integration in manufacturing environments.

However, the three-year development timeline suggests careful consideration rather than urgent need for wholesale validation reform. If existing validation approaches were fundamentally inadequate, we would expect more rapid regulatory response to address patient safety concerns. Instead, the measured development process indicates that the FDA recognized the adequacy of existing approaches while seeking to provide clearer guidance for emerging technologies.

The final guidance explicitly states that FDA “believes that applying a risk-based approach to computer software used as part of production or the quality system would better focus manufacturers’ quality assurance activities to help ensure product quality while helping to fulfill validation requirements”. This language acknowledges that existing approaches fulfill regulatory requirements—the guidance aims to optimize resource allocation rather than address compliance failures.

The Consulting Industry’s Role in Manufacturing Urgency

To understand why CSA has gained traction despite offering little genuine innovation, we must examine the economic incentives that drive consulting industry behavior. The computer system validation consulting market represents hundreds of millions of dollars annually, with individual validation projects ranging from tens of thousands to millions of dollars depending on system complexity and organizational scope.

This market faces a fundamental problem: mature practices don’t generate consulting revenue. If organizations understand that their current GAMP-based validation approaches are fundamentally sound and regulatory-compliant, they’re less likely to engage consultants for expensive “modernization” projects. CSA provides the solution to this problem by creating artificial urgency around practices that were already fit for purpose.

The CSA marketing campaign follows a predictable pattern that the consulting industry has used repeatedly across different domains:

Step 1: Problem Creation. Traditional CSV is portrayed as outdated, burdensome, and potentially non-compliant with evolving regulatory expectations. This creates anxiety among quality professionals who fear falling behind industry best practices.

Step 2: Solution Positioning. CSA is presented as the modern, efficient, risk-based alternative that leading organizations are already adopting. Early adopters are portrayed as innovative leaders, while traditional practitioners risk being perceived as laggards.

Step 3: Urgency Amplification. Regulatory changes (like the Annex 11 revision) are leveraged to suggest that traditional approaches may become non-compliant, requiring immediate action.

Step 4: Capability Marketing. Consulting firms position themselves as experts in the “new” CSA approach, offering training, assessment services, and implementation support for organizations seeking to “modernize” their validation practices.

This pattern is particularly insidious because it exploits legitimate professional concerns. Quality professionals genuinely want to ensure their practices remain current and effective. However, the CSA campaign preys on these concerns by suggesting that existing practices are inadequate when, in fact, they remain perfectly sufficient for regulatory compliance and business effectiveness.

The False Dichotomy: CSV Versus CSA

Perhaps the most misleading aspect of CSA promotion is the suggestion that organizations must choose between “traditional CSV” and “modern CSA” approaches. This creates a false dichotomy that obscures the reality: well-implemented GAMP-based validation programs already incorporate every principle that CSA advocates as innovative.

Consider the claimed distinctions between CSV and CSA:

Critical Thinking Over Documentation: CSA proponents suggest that traditional CSV focuses on documentation production rather than system quality. However, GAMP 5 has emphasized risk-based thinking and proportionate documentation for over fifteen years. Organizations producing excessive documentation were implementing GAMP poorly, not following its actual guidance.
Testing Over Paperwork: The claim that CSA prioritizes testing effectiveness over documentation completeness misrepresents both approaches. GAMP has always emphasized that validation should provide confidence in system performance, not just documentation compliance. The GAMP software categories explicitly scale testing requirements to risk levels.
Automation and Modern Technologies: CSA advocates present automation and advanced testing methods as CSA innovations. However, Annex 11 Clause 4.7 has required consideration of automated testing tools since 2011, and GAMP 5 second edition explicitly addresses agile development, cloud computing, and artificial intelligence.
Risk-Based Resource Allocation: The suggestion that CSA introduces risk-based resource allocation ignores decades of GAMP implementation where validation effort is explicitly scaled to system risk and business impact.
Supplier Leverage: CSA emphasis on leveraging supplier documentation and testing is presented as innovative thinking. However, GAMP has advocated supplier assessment and documentation leverage since its early versions, with detailed guidance on when and how to rely on supplier work.

The reality is that organizations with mature, well-implemented validation programs are already applying CSA principles without recognizing them as such. They conduct risk assessments, scale validation activities appropriately, leverage supplier documentation effectively, and focus resources on high-impact systems. They didn’t need CSA to tell them to think critically—they were already applying critical thinking to validation challenges.

The Spectrum Reality: Quality as a Continuous Variable

One of the most important concepts that both GAMP and effective validation practice have always recognized is that system quality exists on a spectrum, not as a binary state. Systems aren’t simply “validated” or “not validated”—they exist at various points along a continuum of validation rigor that corresponds to their risk profile and business impact.

This spectrum concept directly contradicts the CSA marketing message that suggests traditional validation approaches treat all systems identically. In reality, experienced validation professionals have always applied different approaches to different system types.

This spectrum approach enables organizations to allocate validation resources effectively while maintaining appropriate controls. A simple email archiving system doesn’t receive the same validation rigor as a batch manufacturing execution system—not because we’re cutting corners, but because the risks are fundamentally different.

CSA doesn’t introduce this spectrum concept—it restates principles that have been embedded in GAMP guidance for over a decade. The suggestion that traditional validation approaches lack risk-based thinking demonstrates either ignorance of GAMP principles or deliberate misrepresentation of current practices.

Regulatory Convergence: Proof of Existing Framework Maturity

The convergence of global regulatory approaches around risk-based validation principles provides compelling evidence that existing frameworks were already effective and didn’t require CSA “modernization.” The 2025 draft Annex 11 revision demonstrates this convergence clearly.

Key aspects of the draft revision align closely with established GAMP principles:

Risk Management Integration: Section 6 requires risk management throughout the system lifecycle, aligning with ICH Q9 and existing GAMP guidance.
Lifecycle Perspective: Section 4 emphasizes lifecycle management from planning through retirement, consistent with GAMP lifecycle models.
Supplier Oversight: Section 7 requires supplier qualification and ongoing assessment, building on existing GAMP supplier guidance.
Security Integration: Section 15 addresses cybersecurity as a GMP requirement, reflecting technological evolution rather than philosophical change.
Periodic Review: Section 14 mandates periodic system review, formalizing practices that mature organizations already implement.

This alignment doesn’t validate CSA as revolutionary—it demonstrates that global regulators recognize the effectiveness of existing risk-based validation approaches and are codifying them more explicitly. The fact that CSA principles align with regulatory evolution proves that these principles were already embedded in effective validation practice.

The finalized FDA guidance fits into this by providing educational clarity for validation professionals who have struggled to apply risk-based principles effectively. The detailed examples and explicit risk classification criteria offer practical guidance that can improve validation program implementation. This is not a call by the FDA for radical changes, it is an educational moment on the current consensus.

The Technical Reality: What Actually Drives System Quality

Beneath the consulting industry rhetoric about CSA lies a more fundamental question: what actually drives computer system quality in regulated environments? The answer has remained consistent across decades of validation practice and won’t change regardless of whether we call our approach CSV, CSA, or any other acronym.

System quality derives from several key factors that transcend validation methodology:

Requirements Definition: Systems must be designed to meet clearly defined user requirements that align with business processes and regulatory obligations. Poor requirements lead to poor systems regardless of validation approach.
Supplier Competence: The quality of the underlying software depends fundamentally on the supplier’s development practices, quality systems, and technical expertise. Validation can detect defects but cannot create quality that wasn’t built into the system.
Configuration Control: Proper configuration of commercial systems requires deep understanding of both the software capabilities and the business requirements. Poor configuration creates risks that no amount of validation testing can eliminate.
Change Management: System quality degrades over time without effective change control processes that ensure modifications maintain validated status. This requires ongoing attention regardless of initial validation approach.
User Competence: Even perfectly validated systems fail if users lack adequate training, motivation, or procedural guidance. Human factors often determine system effectiveness more than technical validation.
Operational Environment: Systems must be maintained within their designed operational parameters—appropriate hardware, network infrastructure, security controls, and environmental conditions. Environmental failures can compromise even well-validated systems.

These factors have driven system quality throughout the history of computer system validation and will continue to do so regardless of methodological labels. CSA doesn’t address any of these fundamental quality drivers differently than GAMP-based approaches—it simply rebrands existing practices with contemporary terminology.

The Economics of Validation: Why Efficiency Matters

One area where CSA advocates make legitimate points involves the economics of validation practice. Poor validation implementations can indeed create excessive costs and time delays that provide minimal risk reduction benefit. However, these problems result from poor implementation, not inherent methodological limitations.

Effective validation programs have always balanced several economic considerations:

Resource Allocation: Validation effort should be concentrated on systems with the highest risk and business impact. Organizations that validate all systems identically are misapplying GAMP principles, not following them.
Documentation Efficiency: Validation documentation should support business objectives rather than existing for its own sake. Excessive documentation often results from misunderstanding regulatory requirements rather than regulatory over-reach.
Testing Effectiveness: Validation testing should build confidence in system performance rather than simply following predetermined scripts. Effective testing combines scripted protocols with exploratory testing, automated validation, and ongoing monitoring.
Lifecycle Economics: The total cost of validation includes initial validation plus ongoing maintenance throughout the system lifecycle. Front-end investment in robust validation often reduces long-term operational costs.
Opportunity Cost: Resources invested in validation could be applied to other quality improvements. Effective validation programs consider these opportunity costs and optimize overall quality outcomes.

These economic principles aren’t CSA innovations—they’re basic project management applied to validation activities. Organizations experiencing validation inefficiencies typically suffer from poor implementation of established practices rather than inadequate methodological guidance.

The Agile Development Challenge: Old Wine in New Bottles

One area where CSA advocates claim particular expertise involves validating systems developed using agile methodologies, continuous integration/continuous deployment (CI/CD), and other modern software development approaches. This represents a more legitimate consulting opportunity because these development methods do create genuine challenges for traditional validation approaches.

However, the validation industry’s response to agile development demonstrates both the adaptability of existing frameworks and the consulting industry’s tendency to oversell new approaches as revolutionary breakthroughs.

GAMP 5 second edition, published in 2022, explicitly addresses agile development challenges and provides guidance for validating systems developed using modern methodologies. The core principles remain unchanged—validation should provide confidence that systems are fit for their intended use—but the implementation approaches adapt to different development lifecycles.

Key adaptations for agile development include:

Iterative Validation: Rather than conducting validation at the end of development, validation activities occur throughout each development sprint, allowing for earlier defect detection and correction.
Automated Testing Integration: Automated testing tools become part of the validation approach rather than separate activities, leveraging the automated testing that agile development teams already implement.
Risk-Based Prioritization: User stories and system features are prioritized based on risk assessment, ensuring that high-risk functionality receives appropriate validation attention.
Continuous Documentation: Documentation evolves continuously rather than being produced as discrete deliverables, aligning with agile documentation principles.
Supplier Collaboration: Validation activities are integrated with supplier development processes rather than conducted independently, leveraging the transparency that agile methods provide.

These adaptations represent evolutionary improvements, often slight, in validation practice rather than revolutionary breakthroughs. They address genuine challenges created by modern development methods while maintaining the fundamental goal of ensuring system fitness for intended use.

The Cloud Computing Reality: Infrastructure Versus Application

Another area where CSA advocates claim particular relevance involves cloud-based systems and Software as a Service (SaaS) applications. This represents a more legitimate area of methodological development because cloud computing does create genuine differences in validation approach compared to traditional on-premises systems.

However, the core validation challenges remain unchanged: organizations must ensure that cloud-based systems are fit for their intended use, maintain data integrity, and comply with applicable regulations. The differences lie in implementation details rather than fundamental principles.

Key considerations for cloud-based system validation include:

Shared Responsibility Models: Cloud providers and customers share responsibility for different aspects of system security and compliance. Validation approaches must clearly delineate these responsibilities and ensure appropriate controls at each level.
Supplier Assessment: Cloud providers require more extensive assessment than traditional software suppliers because they control critical infrastructure components that customers cannot directly inspect.
Data Residency and Transfer: Cloud systems often involve data transfer across geographic boundaries and storage in multiple locations. Validation must address these data handling practices and their regulatory implications.
Service Level Agreements: Cloud services operate under different availability and performance models than on-premises systems. Validation approaches must adapt to these service models.
Continuous Updates: Cloud providers often update their services more frequently than traditional software suppliers. Change control processes must adapt to this continuous update model.

These considerations require adaptation of validation practices but don’t invalidate existing principles. Organizations can validate cloud-based systems using GAMP principles with appropriate modification for cloud-specific characteristics. CSA doesn’t provide fundamentally different guidance—it repackages existing adaptation strategies with cloud-specific terminology.

The Data Integrity Connection: Where Real Innovation Occurs

One area where legitimate innovation has occurred in pharmaceutical quality involves data integrity practices and their integration with computer system validation. The FDA’s data integrity guidance documents, EU data integrity guidelines, and industry best practices have evolved significantly over the past decade, creating genuine opportunities for improved validation approaches.

However, this evolution represents refinement of existing principles rather than replacement of established practices. Data integrity concepts build directly on computer system validation foundations:

ALCOA+ Principles: Attributable, Legible, Contemporaneous, Original, Accurate data requirements, plus Complete, Consistent, Enduring, and Available requirements, extend traditional validation concepts to address specific data handling challenges.
Audit Trail Requirements: Enhanced audit trail capabilities build on existing Part 11 requirements while addressing modern data manipulation risks.
System Access Controls: Improved user authentication and authorization extend traditional computer system security while addressing contemporary threats.
Data Lifecycle Management: Systematic approaches to data creation, processing, review, retention, and destruction integrate with existing system lifecycle management.
Risk-Based Data Review: Proportionate data review approaches apply risk-based thinking to data integrity challenges.

These developments represent genuine improvements in validation practice that address real regulatory and business challenges. They demonstrate how existing frameworks can evolve to address new challenges without requiring wholesale replacement of established approaches.

The Training and Competence Reality: Where Change Actually Matters

Perhaps the area where CSA advocates make the most legitimate points involves training and competence development for validation professionals. Traditional validation training has often focused on procedural compliance rather than risk-based thinking, creating practitioners who can follow protocols but struggle with complex risk assessment and decision-making.

This competence gap creates real problems in validation practice:

Protocol-Following Over Problem-Solving: Validation professionals trained primarily in procedural compliance may miss system risks that don’t fit predetermined testing categories.
Documentation Focus Over Quality Focus: Emphasis on documentation completeness can obscure the underlying goal of ensuring system fitness for intended use.
Risk Assessment Limitations: Many validation professionals lack the technical depth needed for effective risk assessment of complex modern systems.
Regulatory Interpretation Challenges: Understanding the intent behind regulatory requirements rather than just their literal text requires experience and training that many practitioners lack.
Technology Evolution: Rapid changes in information technology create knowledge gaps for validation professionals trained primarily on traditional systems.

These competence challenges represent genuine opportunities for improvement in validation practice. However, they result from inadequate implementation of existing approaches rather than flaws in the approaches themselves. GAMP has always emphasized risk-based thinking and proportionate validation—the problem lies in how practitioners are trained and supported, not in the methodological framework.

Effective responses to these competence challenges include:

Risk-Based Training: Education programs that emphasize risk assessment and critical thinking rather than procedural compliance.
Technical Depth Development: Training that builds understanding of information technology principles rather than just validation procedures.
Regulatory Context Education: Programs that help practitioners understand the regulatory intent behind validation requirements.
Scenario-Based Learning: Training that uses complex, real-world scenarios rather than simplified examples.
Continuous Learning Programs: Ongoing education that addresses technology evolution and regulatory changes.

These improvements can be implemented within existing GAMP frameworks without requiring adoption of any ‘new’ paradigm. They address real professional development needs while building on established validation principles.

The Measurement Challenge: How Do We Know What Works?

One of the most frustrating aspects of the CSA versus CSV debate is the lack of empirical evidence supporting claims of CSA superiority. Validation effectiveness ultimately depends on measurable outcomes: system reliability, regulatory compliance, cost efficiency, and business enablement. However, CSA advocates rarely present comparative data demonstrating improved outcomes.

Meaningful validation metrics might include:

System Reliability: Frequency of system failures, time to resolution, and impact on business operations provide direct measures of validation effectiveness.
Regulatory Compliance: Inspection findings, regulatory citations, and compliance costs indicate how well validation approaches meet regulatory expectations.
Cost Efficiency: Total cost of ownership including initial validation, ongoing maintenance, and change control activities reflects economic effectiveness.
Time to Implementation: Speed of system deployment while maintaining appropriate quality controls indicates process efficiency.
User Satisfaction: System usability, training effectiveness, and user adoption rates reflect practical validation outcomes.
Change Management Effectiveness: Success rate of system changes, time required for change implementation, and change-related defects indicate validation program maturity.

Without comparative data on these metrics, claims of CSA superiority remain unsupported marketing assertions. Organizations considering CSA adoption should demand empirical evidence of improved outcomes rather than accepting theoretical arguments about methodological superiority.

The Global Regulatory Perspective: Why Consistency Matters

The pharmaceutical industry operates in a global regulatory environment where consistency across jurisdictions provides significant business value. Validation approaches that work effectively across multiple regulatory frameworks reduce compliance costs and enable efficient global operations.

GAMP-based validation approaches have demonstrated this global effectiveness through widespread adoption across major pharmaceutical markets:

FDA Acceptance: GAMP principles align with FDA computer system validation expectations and have been successfully applied in thousands of FDA-regulated facilities.
EMA/European Union Compatibility: GAMP approaches satisfy EU GMP requirements including Annex 11 and have been widely implemented across European pharmaceutical operations.
Other Regulatory Bodies: GAMP principles are compatible with Health Canada, TGA (Australia), PMDA (Japan), and other regulatory frameworks, enabling consistent global implementation.
Industry Standards Integration: GAMP integrates effectively with ISO standards, ICH guidelines, and other international frameworks that pharmaceutical companies must address.

This global consistency represents a significant competitive advantage for established validation approaches. CSA, despite alignment with FDA thinking, has not demonstrated equivalent acceptance across other regulatory frameworks. Organizations adopting CSA risk creating validation approaches that work well in FDA-regulated environments but require modification for other jurisdictions.

The regulatory convergence demonstrated by the draft Annex 11 revision suggests that global harmonization is occurring around established risk-based validation principles rather than newer CSA concepts. This convergence validates existing approaches rather than supporting wholesale methodological change.

The Practical Implementation Reality: What Actually Happens

Beyond the methodological debates and consulting industry marketing lies the practical reality of how validation programs actually function in pharmaceutical organizations. This reality demonstrates why existing GAMP-based approaches remain effective and why CSA adoption often creates more problems than it solves.

Successful validation programs, regardless of methodological label, share several common characteristics:

Senior Leadership Support: Validation programs succeed when senior management understands their business value and provides appropriate resources.
Cross-Functional Integration: Effective validation requires collaboration between quality assurance, information technology, operations, and regulatory affairs functions.
Appropriate Resource Allocation: Validation programs must be staffed with competent professionals and provided with adequate tools and budget.
lear Procedural Guidance: Staff need clear, practical procedures that explain how to apply validation principles to specific situations.
Ongoing Training and Development: Validation effectiveness depends on continuous learning and competence development.
Metrics and Continuous Improvement: Programs must measure their effectiveness and adapt based on performance data.

These success factors operate independently of methodological labels.

The practical implementation reality also reveals why consulting industry solutions often fail to deliver promised benefits. Consultants typically focus on methodological frameworks and documentation rather than the organizational factors that actually drive validation effectiveness. A organization with poor cross-functional collaboration, inadequate resources, and weak senior management support won’t solve these problems by adopting some consultants version of CSA—they need fundamental improvements in how they approach validation as a business function.

The Future of Validation: Evolution, Not Revolution

Looking ahead, computer system validation will continue to evolve in response to technological change, regulatory development, and business needs. However, this evolution will likely occur within existing frameworks rather than through wholesale replacement of established approaches.

Several trends will shape validation practice over the coming decade:

Increased Automation: Automated testing tools, artificial intelligence applications, and machine learning capabilities will become more prevalent in validation practice, but they will augment rather than replace human judgment.
Cloud and SaaS Integration: Cloud computing and Software as a Service applications will require continued adaptation of validation approaches, but these adaptations will build on existing risk-based principles.
Data Analytics Integration: Advanced analytics capabilities will provide new insights into system performance and risk patterns, enabling more sophisticated validation approaches.
Regulatory Harmonization: Continued convergence of global regulatory approaches will simplify validation for multinational organizations.
Agile and DevOps Integration: Modern software development methodologies will require continued adaptation of validation practices, but the fundamental goals remain unchanged.

These trends represent evolutionary development rather than revolutionary change. They will require validation professionals to develop new technical competencies and adapt established practices to new contexts, but they don’t invalidate the fundamental principles that have guided effective validation for decades.

Organizations preparing for these future challenges will be best served by building strong foundational capabilities in risk assessment, technical understanding, and adaptability rather than adopting particular methodological labels. The ability to apply established validation principles to new challenges will prove more valuable than expertise in any specific framework or approach.

The Emperor’s New Validation Clothes

Computer System Assurance represents a textbook case of how the pharmaceutical consulting industry creates artificial innovation by rebranding established practices as revolutionary breakthroughs. Every principle that CSA advocates present as innovative thinking has been embedded in risk-based validation approaches, GAMP guidance, and regulatory expectations for over two decades.

The fundamental question is not whether CSA principles are sound—they generally are, because they restate established best practices. The question is whether the pharmaceutical industry benefits from treating existing practices as obsolete and investing resources in “modernization” projects that deliver minimal incremental value.

The answer should be clear to any quality professional who has implemented effective validation programs: we don’t need CSA to tell us to think critically about validation challenges, apply risk-based approaches to system assessment, or leverage supplier documentation effectively. We’ve been doing these things successfully for years using GAMP principles and established regulatory guidance.

What we do need is better implementation of existing approaches—more competent practitioners, stronger organizational support, clearer procedural guidance, and continuous improvement based on measurable outcomes. These improvements can be achieved within established frameworks without expensive consulting engagements or wholesale methodological change.

The computer system assurance emperor has no clothes—underneath the contemporary terminology and marketing sophistication lies the same risk-based, lifecycle-oriented, supplier-leveraging validation approach that mature organizations have been implementing successfully for over a decade. Quality professionals should focus their attention on implementation excellence rather than methodological fashion, building validation programs that deliver demonstrable business value regardless of what acronym appears on the procedure titles.

The choice facing pharmaceutical organizations is not between outdated CSV and modern CSA—it’s between poor implementation of established practices and excellent implementation of the same practices. Excellence is what protects patients, ensures product quality, and satisfies regulatory expectations. Everything else is just consulting industry marketing.

Technician in full sterile gown inspecting stainless steel equipment in a cleanroom environment, surrounded by large cylindrical tanks and advanced instrumentation.