Using AI in GxP — without giving up control
Yes, AI may be used in GxP environments — the question is no longer whether but how controlled. The regulatory frame is in place: EU GMP Annex 22 (draft, consultation 2025) defines requirements for AI in GMP environments for the first time, GAMP 5 Second Edition explicitly addresses AI/ML, and the FDA follows a risk-based approach with CSA. What decides the outcome is the control architecture: every AI statement needs a source, every draft remains a suggestion until a human approves it, and the audit trail documents the entire path.
Is AI even allowed to work in GxP environments?
The question still comes up in QA meetings — the regulators have long answered it. None of the relevant authorities prohibits the use of AI in regulated environments. What they do instead: they define under which conditions AI work is acceptable. Those conditions look alike across all frameworks — risk-based, traceable, with human responsibility.
That shifts the real task. It is not the policy debate that decides whether your company can use AI — it is whether your working environment structurally provides the required controls: source binding, attributable approvals, complete documentation. A generic chat interface does not provide that. A controlled working environment does.
The 2026 regulatory frame — four anchors
Four documents currently determine how AI use in GxP environments is assessed:
- EU GMP Annex 22 (draft): the first GMP annex dedicated entirely to artificial intelligence — published for consultation in July 2025, together with the Annex 11 revision. Core points: intended use, validation of static models, human oversight, explainability. Generative AI sits outside the critical scope — in non-critical, supporting applications it remains possible, with documented human review. The details: our Annex 22 analysis.
- GAMP 5 Second Edition (2022): the industry guide recognises AI/ML as part of computerised systems and puts critical thinking above schematic documentation — what changes is covered in our GAMP 5 article.
- FDA Computer Software Assurance: final since September 2025, updated in February 2026 (QMSR terminology). CSA does not prescribe a tool — it requires risk-based testing depth: CSA vs. CSV compared.
- 21 CFR Part 11 / ALCOA+: the constant: electronic records, signatures and tamper-evident audit trails. Every piece of AI-assisted work must meet these requirements in the end — regardless of which tool wrote the draft.
What stands out is the convergence: four frameworks, one direction — risk-based, source-bound, human-owned. Align your AI work with these three principles and you can answer to all four anchors.
Where AI works well today — and where it doesn't
The honest map, as of 2026:
Sensible today — supporting, document-level work where a human reviews and approves:
- Document drafting: drafts for URS, specifications, test documentation, SOPs — from controlled sources, with citations. The biggest time sink in Computer System Validation is rarely the testing — it is the writing and holding-together.
- Review preparation: finding inconsistencies, gaps and contradictions across document sets before the human review starts.
- Audit preparation: consolidating evidence, building gap lists, drafting answers with source binding — assessment stays with QA (audit readiness).
- Knowledge access: answering questions against your own controlled source space — instead of the open internet.
Not today — and not in the foreseeable future:
- Generative AI in critical GMP applications — the Annex 22 draft does not foresee its use there.
- Autonomous approvals: no framework accepts an AI as the approving instance. The signature stays human — with a name and a justification.
- Continuously learning models in validated processes: a state that keeps changing cannot be demonstrated as stable.
This boundary is not a weakness of the technology — it is what makes the sensible part inspection-proof.
Five control principles that make AI work evidencable
Whether an AI deployment holds up in an inspection is decided by the control architecture. Five principles have proven load-bearing — they are also the foundation of the traqx trust architecture:
- Source binding (citation): every regulatory statement ends on a clickable source from the controlled source space. What has no source is marked as unevidenced — not silently plausible.
- Ghost values: an AI suggestion visibly remains a suggestion until a human adopts it. Draft and approved state can never be confused.
- Human-in-the-loop: every approval carries a name and a justification. Responsibility cannot be delegated to a model — nor should it be.
- Deterministic checking: whether citations exist and statements match their source is not judged by a second language model but by a deterministic check: pass or fail.
- Complete audit trail: who decided what, when, on what basis — across the whole lifecycle, tamper-evident (21 CFR Part 11, ALCOA+).
The test for any tool — including ours: can these five principles be demonstrated structurally, or do they depend on the discipline of individual users?
Introducing AI in practice: start with one process
The successful AI introductions we see in regulated environments follow the same pattern — and it is the opposite of a major IT project:
- One team, one real process: not the whole organisation, but one bounded GxP process with real pain — an SOP revision, a validation package, an audit preparation.
- Source space first: before the first prompt comes the question of which controlled documents the AI may work from. The source space is the risk boundary.
- Stop/go criteria up front: how will you tell after four to six weeks whether it carries? Review effort, correction reasons, evidence quality — defined before, not interpreted after.
- QA at the table from day one: not as the approval body at the end, but as co-designer of the controls. That changes acceptance fundamentally.
What does not work: the big-bang rollout (“AI for everyone, starting Monday”), shadow use without a source space — and the opposite mistake of waiting two years for the final edition of every guidance while your team is already working unmanaged with public chat tools.
The most common objections — and what sits behind them
“AI hallucinates — we cannot afford that.” Correct, and that is why hallucination is an architecture problem, not a model problem: with source binding and deterministic checking, an invented reference becomes visible as unevidenced before it reaches a review. The danger is not the model — it is uncontrolled use of it.
“Our data must not train a model.” A legitimate requirement — and solvable contractually and technically: EU hosting, no model training on customer data, a defined source space. That belongs in every supplier assessment of an AI vendor.
“Do we have to validate the AI tool itself?” The tool is qualified risk-based like other software (GAMP 5 logic); what matters is that the work products stay evidenced — source, review, approval, audit trail. Responsibility for the content stays with the regulated company.
“What will an auditor say?” Auditors ask the same questions they ask about any work: where does the statement come from, who approved it, where is the trail? A controlled AI environment answers these questions faster than manual work — because the connection never breaks.
What this means for your roadmap
The regulatory frame is converging, the control principles are known, and the entry can be small. The order we recommend: pick one process, define the source space, set stop/go criteria, bring QA to the table — and assess honestly after six weeks.
That entry is exactly what traqx is built for: AI drafts, your team reviews and approves — sources, versions and the audit trail stay connected. The fastest way to see it on your process is the 30-minute live demo.
Key takeaways
- No framework prohibits AI in GxP — all four anchors (draft Annex 22, GAMP 5 2nd Ed, FDA CSA, Part 11) define the how: risk-based, source-bound, human-owned.
- Generative AI belongs in supporting, document-level work with human approval today — not in critical GMP applications and not in autonomous decisions.
- Evidencability is architecture, not discipline: source binding, ghost values, human-in-the-loop, deterministic checking and the audit trail must be built into the tool structurally.
- The proven entry: one team, one real process, a defined source space, stop/go criteria up front — QA at the table from day one.
- Waiting is the riskiest option: without a controlled environment, shadow AI use emerges — unmanaged and unevidencable.
Sources
- European Commission — EudraLex Vol. 4, draft Annex 22 “Artificial Intelligence” (consultation July–October 2025) — the first GMP annex on AI: scope, static models, human oversight, explainability.
- ISPE — GAMP 5: A Risk-Based Approach to Compliant GxP Computerized Systems, 2nd Edition (2022) — critical thinking, agile lifecycles and the treatment of AI/ML in the system lifecycle.
- FDA — Computer Software Assurance for Production and Quality Management System Software (final guidance: 24 September 2025, updated 3 February 2026) — risk-based testing depth instead of documentation volume; tool-neutral.
- EU GMP Annex 11 — Computerised Systems (EudraLex Vol. 4) — the general frame for computerised systems that Annex 22 is embedded in.
- 21 CFR Part 11 — Electronic Records; Electronic Signatures — electronic records, signatures and tamper-evident audit trails.
These analyses monthly by email
Practice-depth on CSV, CSA, audit-readiness and AI governance — no spam, unsubscribe in one click.