Public reference

The Annex IV Mapping

Annex IV is the EU AI Act's technical-documentation requirement — what the regulator wants to see when a high-risk AI system is placed on the market. We translate each requirement into the engineering source where the evidence usually already exists in a disciplined ML platform, plus the gap pattern we see most often and a default severity. The Diagnostic surveys these rows; the Sprint analyses them per system; the Programme operationalises the technical file so it stays current as your models change.

Annex IV §1 — system description

The first paragraph of Annex IV asks for a general description of the system: what it is, how it ships, what it talks to. Most rows have low default severity because the artefacts are typically already documented somewhere in your engineering wiki or release process — the gap is shape, not substance.

Cite

What the regulation asks for

Where it lives in your stack

Common gap

Default severity

Annex IV §1(a)
Intended purpose; persons developing the system; date and version
Where it lives in your stack
Model registry entry; Git tag; release-gate metadata
Common gap
Often complete in MLflow or W&B; sometimes missing the 'intended purpose' prose statement
Low
Annex IV §1(a)
Intended purpose; persons developing the system; date and version
Model registry entry; Git tag; release-gate metadata
Often complete in MLflow or W&B; sometimes missing the 'intended purpose' prose statement
Low
Annex IV §1(b)
Provider details (name, address, etc.)
Where it lives in your stack
Company registration data
Common gap
Boilerplate; almost never a gap
Low
Annex IV §1(b)
Provider details (name, address, etc.)
Company registration data
Boilerplate; almost never a gap
Low
Annex IV §1(c)
How the system interacts with hardware and software not part of the AI system
Where it lives in your stack
Service architecture diagrams; interface contracts; SDK docs
Common gap
Usually exists in the engineering wiki, never in Annex IV shape
Medium
Annex IV §1(c)
How the system interacts with hardware and software not part of the AI system
Service architecture diagrams; interface contracts; SDK docs
Usually exists in the engineering wiki, never in Annex IV shape
Medium
Annex IV §1(d)
Versions of relevant software / firmware
Where it lives in your stack
Container manifests; lock files; CI artefacts
Common gap
Usually exists; sometimes only in CI logs and not in a written artefact
Low
Annex IV §1(d)
Versions of relevant software / firmware
Container manifests; lock files; CI artefacts
Usually exists; sometimes only in CI logs and not in a written artefact
Low
Annex IV §1(e)
Forms in which the system is placed on the market — boxed product, online API, embedded SDK, etc.
Where it lives in your stack
Product / release docs
Common gap
Usually clear from the product surface; needs a one-line written statement
Low
Annex IV §1(e)
Forms in which the system is placed on the market — boxed product, online API, embedded SDK, etc.
Product / release docs
Usually clear from the product surface; needs a one-line written statement
Low
Annex IV §1(f)
Description of the hardware on which the system runs
Where it lives in your stack
Infra-as-code (Terraform, Pulumi, etc.); deployment manifests
Common gap
Often complete in IaC; needs a prose summary
Low
Annex IV §1(f)
Description of the hardware on which the system runs
Infra-as-code (Terraform, Pulumi, etc.); deployment manifests
Often complete in IaC; needs a prose summary
Low
Annex IV §1(g)
Validation and testing logs, signed and dated
Where it lives in your stack
MLflow / W&B run history; CI logs
Common gap
Almost always exists; signing-and-dating discipline missing
Medium
Annex IV §1(g)
Validation and testing logs, signed and dated
MLflow / W&B run history; CI logs
Almost always exists; signing-and-dating discipline missing
Medium
Annex IV §1(h)
User-interface description (where applicable)
Where it lives in your stack
UX docs; user-facing screenshots
Common gap
Usually exists for B2C; thinner for API-only products
Low
Annex IV §1(h)
User-interface description (where applicable)
UX docs; user-facing screenshots
Usually exists for B2C; thinner for API-only products
Low
Annex IV §1(i)
Instructions for use to deployers (where applicable)
Where it lives in your stack
API docs; SDK guides; deployer-facing manuals
Common gap
Often exists; not in regulator format
Low
Annex IV §1(i)
Instructions for use to deployers (where applicable)
API docs; SDK guides; deployer-facing manuals
Often exists; not in regulator format
Low

Annex IV §2 — detailed technical description

The second paragraph is where the heavy lifting sits: design choices, data sheets, validation, predetermined changes, cybersecurity. Most rows are medium-to-high severity because the artefacts often exist informally (model cards, design discussions in Slack, evaluation notebooks) but not in a regulator-readable form.

Cite

What the regulation asks for

Where it lives in your stack

Common gap

Default severity

Annex IV §2(a)
General logic / methodology used to develop the system
Where it lives in your stack
Model card; design doc; ADR-style notes
Common gap
Most common gap — model cards rarely written with the discipline Annex IV requires
High
Annex IV §2(a)
General logic / methodology used to develop the system
Model card; design doc; ADR-style notes
Most common gap — model cards rarely written with the discipline Annex IV requires
High
Annex IV §2(b)
Design choices, key assumptions, rationale
Where it lives in your stack
Same as §2(a) plus commit history and design discussions
Common gap
Same gap pattern as §2(a) — assumptions live in tribal memory, not in writing
High
Annex IV §2(b)
Design choices, key assumptions, rationale
Same as §2(a) plus commit history and design discussions
Same gap pattern as §2(a) — assumptions live in tribal memory, not in writing
High
Annex IV §2(c)
System architecture description
Where it lives in your stack
Architecture diagrams; service docs
Common gap
Usually exists; needs an Annex IV–shape document
Medium
Annex IV §2(c)
System architecture description
Architecture diagrams; service docs
Usually exists; needs an Annex IV–shape document
Medium
Annex IV §2(d)
Data sheets — sources, training / validation / test data, lineage, biases
Where it lives in your stack
Data documentation; ETL configs; data lineage tools
Common gap
Common gap — most teams have lineage but not datasheets
High
Annex IV §2(d)
Data sheets — sources, training / validation / test data, lineage, biases
Data documentation; ETL configs; data lineage tools
Common gap — most teams have lineage but not datasheets
High
Annex IV §2(e)
Human oversight description (Article 14 alignment)
Where it lives in your stack
Product UX showing human-review affordances; reviewer training materials
Common gap
Often informal; needs a written human-oversight design
Medium
Annex IV §2(e)
Human oversight description (Article 14 alignment)
Product UX showing human-review affordances; reviewer training materials
Often informal; needs a written human-oversight design
Medium
Annex IV §2(f)
Predetermined changes / continuous-learning behaviour
Where it lives in your stack
Change-control log; feature-flag config; retraining cadence doc
Common gap
Often exists in process; not in writing
Medium
Annex IV §2(f)
Predetermined changes / continuous-learning behaviour
Change-control log; feature-flag config; retraining cadence doc
Often exists in process; not in writing
Medium
Annex IV §2(g)
Validation and testing procedures, including metrics
Where it lives in your stack
W&B reports; evaluation code; test pipelines
Common gap
Usually exists; not signed and dated
Medium
Annex IV §2(g)
Validation and testing procedures, including metrics
W&B reports; evaluation code; test pipelines
Usually exists; not signed and dated
Medium
Annex IV §2(h)
Cybersecurity measures
Where it lives in your stack
SOC 2 / ISO 27001 documentation; threat model; incident-response runbook
Common gap
Often exists for SOC 2 mature teams; needs Annex IV–shape mapping
Medium
Annex IV §2(h)
Cybersecurity measures
SOC 2 / ISO 27001 documentation; threat model; incident-response runbook
Often exists for SOC 2 mature teams; needs Annex IV–shape mapping
Medium
Annex IV §2(i)
Algorithms used (where disclosable)
Where it lives in your stack
Model cards; published architecture
Common gap
Usually disclosable in summary form
Low
Annex IV §2(i)
Algorithms used (where disclosable)
Model cards; published architecture
Usually disclosable in summary form
Low

Annex IV §3–§9 — operational and governance layers

The remaining paragraphs cover monitoring, performance metrics, the risk-management system, lifecycle changes, harmonised standards, the declaration of conformity, and post-market monitoring. The largest single gap on most engagements is §9 — the post-market monitoring plan rarely exists in writing when we walk in.

Cite

What the regulation asks for

Where it lives in your stack

Common gap

Default severity

Annex IV §3
Detailed information on monitoring, functioning and control of the system
Where it lives in your stack
Monitoring dashboards; SLO docs; on-call runbook
Common gap
Often exists in SRE docs; needs Annex IV–shape
Medium
Annex IV §3
Detailed information on monitoring, functioning and control of the system
Monitoring dashboards; SLO docs; on-call runbook
Often exists in SRE docs; needs Annex IV–shape
Medium
Annex IV §4
Performance metrics including by demographic subgroup (links to Article 15)
Where it lives in your stack
Slice-level metrics in training pipeline (fairlearn, Aequitas)
Common gap
Common gap — computed at training time but not exported per shipped system
High
Annex IV §4
Performance metrics including by demographic subgroup (links to Article 15)
Slice-level metrics in training pipeline (fairlearn, Aequitas)
Common gap — computed at training time but not exported per shipped system
High
Annex IV §5
Article 9 risk-management system description
Where it lives in your stack
Written RMS document
Common gap
Rarely exists in writing; sometimes informal in incident-review processes
High
Annex IV §5
Article 9 risk-management system description
Written RMS document
Rarely exists in writing; sometimes informal in incident-review processes
High
Annex IV §6
Description of relevant changes throughout the system's lifecycle
Where it lives in your stack
Change-control log; CHANGELOG.md
Common gap
Often complete in Git; needs a prose summary
Low
Annex IV §6
Description of relevant changes throughout the system's lifecycle
Change-control log; CHANGELOG.md
Often complete in Git; needs a prose summary
Low
Annex IV §7
Harmonised standards applied (where any)
Where it lives in your stack
ISO 42001 / harmonised standards mapping
Common gap
Rarely applicable in 2026; flag if relevant
Low
Annex IV §7
Harmonised standards applied (where any)
ISO 42001 / harmonised standards mapping
Rarely applicable in 2026; flag if relevant
Low
Annex IV §8
Copy of the EU declaration of conformity (Article 47)
Where it lives in your stack
Declaration template, signed
Common gap
Drafted as part of the Programme tier
Medium
Annex IV §8
Copy of the EU declaration of conformity (Article 47)
Declaration template, signed
Drafted as part of the Programme tier
Medium
Annex IV §9
Post-market monitoring plan (Article 72 link)
Where it lives in your stack
PMM plan document
Common gap
Largest single gap — PMM plan rarely exists in writing on engagement entry
Critical
Annex IV §9
Post-market monitoring plan (Article 72 link)
PMM plan document
Largest single gap — PMM plan rarely exists in writing on engagement entry
Critical

Operating articles bundled into the Mapping

Five Articles whose evidence trails are commonly included in the technical file even though they sit outside Annex IV proper. Article 12 (logging), Article 13 (transparency to deployers), Article 14 (human oversight), Article 15 (accuracy, robustness and cybersecurity), and Article 73 (incident reporting on the 15 / 10 / 2-day clocks).

Cite

What the regulation asks for

Where it lives in your stack

Common gap

Default severity

Art. 12
Automatic logging of events relevant to traceability
Where it lives in your stack
Inference-layer structured logs; trace IDs
Common gap
Usually exists; needs traceability proof and retention policy
Medium
Art. 12
Automatic logging of events relevant to traceability
Inference-layer structured logs; trace IDs
Usually exists; needs traceability proof and retention policy
Medium
Art. 13
Transparency / instructions to deployers
Where it lives in your stack
Deployer-facing docs
Common gap
Usually exists for SaaS; thin for embedded
Medium
Art. 13
Transparency / instructions to deployers
Deployer-facing docs
Usually exists for SaaS; thin for embedded
Medium
Art. 14
Human oversight design
Where it lives in your stack
UX patterns plus on-call runbook
Common gap
Same as §2(e); merged in practice
Medium
Art. 14
Human oversight design
UX patterns plus on-call runbook
Same as §2(e); merged in practice
Medium
Art. 15
Accuracy, robustness, cybersecurity assessment
Where it lives in your stack
Eval reports plus adversarial-test reports
Common gap
Robustness and cybersecurity tests often missing
High
Art. 15
Accuracy, robustness, cybersecurity assessment
Eval reports plus adversarial-test reports
Robustness and cybersecurity tests often missing
High
Art. 73
Serious incident reporting workflow (15 / 10 / 2-day clocks)
Where it lives in your stack
On-call rota plus incident process; classification SLA
Common gap
Detection exists; reporting SLA almost never
High
Art. 73
Serious incident reporting workflow (15 / 10 / 2-day clocks)
On-call rota plus incident process; classification SLA
Detection exists; reporting SLA almost never
High

How the Mapping is applied per engagement

Defaults are typical for a Series B/C ML team. Per-engagement, severity is overridden based on procurement materiality, engineering effort to close, and penalty exposure — Article 5 and Article 9 ties bump rows to Critical.

Engagement

What's done

Output

Diagnostic
Qualitative scan of all 30 rows for up to 3 systems. Material gaps flagged.
→ One-page gap snapshot.
Diagnostic
Qualitative scan of all 30 rows for up to 3 systems. Material gaps flagged.
One-page gap snapshot.
Readiness Sprint
Full 30-row analysis per system. Every row gets a status, severity and evidence note.
→ Annex IV gap analysis document.
Readiness Sprint
Full 30-row analysis per system. Every row gets a status, severity and evidence note.
Annex IV gap analysis document.
Programme
The Mapping becomes operational. New deployments populate the right rows automatically.
→ Maintained Annex IV technical file v1.0 plus the workflow that keeps it current.
Programme
The Mapping becomes operational. New deployments populate the right rows automatically.
Maintained Annex IV technical file v1.0 plus the workflow that keeps it current.

See the Mapping applied to your stack

The Diagnostic produces a one-page gap snapshot in 3 to 5 days. — we confirm classification and the highest-leverage gaps in the call, free.

Last reviewed · 8 May 2026