Resources / Insights

An AI Compliance Checklist for Regulated Industries

Comparison table showing runtime AI failing on reproducibility, versioning, audit trail and drift, while frozen code passes all four questions auditors ask

If you operate in finance, insurance, or healthcare, the question about AI is no longer whether it helps. It is whether you can put it in front of an auditor without flinching. The rules arrived faster than most compliance teams expected, and a demo that impresses a product manager is a long way from a system that survives an examiner. This is a practical map of what changed, what reviewers actually ask for, and a checklist you can run before you deploy anything.

A disclaimer worth stating plainly: this is an overview, not legal advice, and the rules below move. Where a date or status is in flux, I have said so. Confirm specifics with your own counsel before you rely on them.

The 2026 regulatory map, briefly

The EU AI Act entered into force in August 2024 and phases in over several years. Prohibited practices applied from February 2025, general-purpose model obligations from August 2025, and most high-risk system requirements were set for August 2026 (AI Act implementation timeline). One moving piece to watch: in November 2025 the European Commission proposed a "Digital Omnibus" that would push the high-risk deadline to December 2027. As of this writing that deferral is proposed, not adopted, so the original timeline still governs unless and until it passes (DLA Piper). High-risk systems carry concrete duties: risk management, data governance, technical documentation, automatic logging of events, human oversight, and accuracy and robustness.

In the US, the NIST AI Risk Management Framework remains the voluntary backbone, organized around four functions (govern, map, measure, manage), with a Generative AI Profile added in July 2024 (NIST). State law is shifting underneath everyone: Colorado repealed and replaced its original AI Act, with Governor Polis signing SB 26-189 in May 2026, effective January 2027, pivoting toward disclosure and consumer-rights obligations (Carpe Datum Law). Many older summaries still cite the repealed version, so check the date on anything you read.

Sector regulators have their own guidance. The NAIC Model Bulletin on insurers' use of AI, adopted in December 2023, asks insurers to maintain a written AI systems program and has been taken up by roughly half the states (NAIC). On the healthcare side, HHS proposed the first major HIPAA Security Rule update in about two decades in January 2025 (HIPAA Journal). The SEC has not finalized a dedicated AI rule but has already brought enforcement actions against advisers for "AI washing," overstating their AI capabilities.

What auditors and regulators actually ask

Strip away the framework names and reviewers keep returning to a short list. Can you reproduce the decision the system made on a given date, with the version that made it? Can you trace a number back to its source data? Can a qualified human review, contest, and override an automated decision, rather than rubber-stamp it? Is there a complete, tamper-evident record of what ran?

That last cluster has teeth in Europe. Under GDPR Article 22, individuals have the right not to be subject to a solely automated decision with legal or similarly significant effects, plus a right to human intervention and to meaningful information about the logic involved (ICO). "Meaningful information about the logic" is a high bar for a system whose logic is a model's behavior at a moment in time.

The certification alphabet, decoded

Buyers will ask your vendors, and your own security team will ask you, about a familiar set of attestations. Briefly, so you can tell them apart:

A note on our own posture, to be straight with you: Dittah does not claim to hold any of these certifications, and you should be skeptical of any vendor that waves a logo without handing you the report. What the architecture below does is make the underlying audit questions easier to answer, which is a different thing from a certificate.

Why runtime generative AI struggles in an audit

The hardest compliance problem with a model in the runtime loop is that it cannot reliably reproduce its own decisions. When outputs vary on identical inputs, reconstructing and auditing a specific decision becomes very difficult. Research on financial AI systems makes the point bluntly: in regulated settings, accuracy that cannot be reproduced cannot really be audited, and structured output can degrade meaningfully across runs (arXiv, 2026). Add silent model drift, where the provider retunes the model and your behavior shifts without a changelog, and you have a system that fails the reproducibility test by construction.

How frozen code maps to the audit asks

This is where a design-time-only approach earns its keep. With Dittah, AI helps you build the workflow, then publishing freezes it into versioned, immutable code, and production runs that code with no model call. Line that up against the recurring audit asks and it maps cleanly. Reproducibility comes for free, because the same input yields the same output every time. Versioning and traceability are built in, because each run is tied to a specific, identifiable version of the logic. The audit trail is the execution log. And there is no model drift in production, because the frozen artifact does not change between runs unless you deliberately publish a new version.

There is even an auditing standard that points the same way. The PCAOB's benchmarking guidance treats fully automated controls as not subject to human failure, and notes the approach is especially strong when the code cannot be modified between runs (PCAOB, AS 5 Appendix B). That kind of immutable, versioned logic is close to exactly what an auditor wants to see.

The checklist

Run these before you deploy any AI automation, whoever built it. If you cannot answer a question, that is your next piece of work, not a detail to defer.

Bottom line

Compliance for AI in regulated industries comes down to a question the rules ask in a dozen different ways: can you reproduce, explain, and prove what your system did? Keep a model in the runtime loop and you will struggle to answer that honestly. Let AI build the logic once and freeze it, and the answer falls out of the architecture. Start from the checklist, be honest about the gaps, and prefer architectures that give you reproducibility as a property rather than a promise. If you want to see what a deterministic, fully logged workflow looks like in practice, we walked one through end to end, or you can build one yourself.

Regulatory references are linked inline and reflect material available as of April 2026. This is general information, not legal advice. Statuses noted as proposed or pending should be reconfirmed before you rely on them.

Back to Resources