Research2026-06-19

Execution-bound advisory automation for agentic AI: a reproducible AIBOM-driven CSAF-VEX framework

arXiv:2606.19390v1 Announce Type: cross Abstract: A protocol driven framework is presented that binds SBOM and AIBOM artefacts to deterministic environment capture and structured runtime telemetry. Exploitability is computed from declared artefacts, observed activation conditions, and enforced...

What Happened

Researchers have proposed a protocol-driven framework that formalizes how to automate advisory and execution-bound decisions for agentic AI systems. The approach integrates Software Bill of Materials (SBOM) and AI Bill of Materials (AIBOM) artifacts with deterministic environment capture and structured runtime telemetry. Crucially, it leverages the Common Security Advisory Framework (CSAF) and the Vulnerability Exploitability eXchange (VEX) to compute exploitability based on declared artifacts, observed activation conditions, and enforced runtime constraints.

The framework is explicitly designed to be reproducible—meaning that the same inputs should yield the same advisory outputs, enabling consistent auditing and verification across different deployments. By binding AIBOM records (which catalog model components, training data provenance, and dependencies) to real-time telemetry, the system can determine whether a known vulnerability is actually exploitable in a given agentic context, rather than merely flagging it as present.

Why It Matters

Agentic AI systems—those that autonomously plan, reason, and execute actions—introduce a fundamentally new attack surface. Unlike traditional software, where vulnerabilities are static and environment-agnostic, agentic AI exploitability depends on dynamic factors: what tools the agent has access to, what prompts it receives, what data it can retrieve, and how its reasoning chain unfolds.

Current vulnerability management approaches treat AI components as black boxes, often relying on static scans that produce high false-positive rates. This framework addresses a critical gap: it moves from "is this vulnerability present?" to "is this vulnerability exploitable given this agent's current state and permissions?" The CSAF-VEX standard, originally designed for software, is repurposed here to express nuanced exploitability assessments that account for runtime context.

For the industry, this represents a maturation of AI security from ad hoc checklists toward structured, auditable governance. Regulators and enterprise risk officers have been demanding exactly this kind of deterministic, reproducible assurance for autonomous systems.

Implications for AI Practitioners

For MLOps and AI engineers: Expect to see AIBOM generation become a standard CI/CD step, similar to how SBOM generation is now mandated for software supply chains. You will need to instrument agentic systems to emit structured telemetry that maps back to declared artifacts. For security teams: The framework provides a mechanism to automate vulnerability triage for AI agents. Instead of manually assessing each CVE against an agent's capabilities, you can feed runtime telemetry into a VEX processor that computes actual exploitability. This reduces alert fatigue and enables faster response. For compliance and governance: Reproducible advisory outputs mean that audit trails become defensible. If an agent is compromised, you can replay the exact conditions that made a vulnerability exploitable—or prove that it was not exploitable under declared constraints. For researchers: The paper's emphasis on reproducibility is a challenge to the field. Many current AI security benchmarks are not repeatable across environments. This framework sets a higher bar for empirical rigor.

Key Takeaways

A new framework binds AIBOM artifacts to runtime telemetry, enabling context-aware exploitability assessment for agentic AI systems using CSAF-VEX standards.
The approach shifts vulnerability management from static presence checks to dynamic exploitability determination, reducing false positives in complex agent environments.
Practitioners should prepare for AIBOM generation to become a standard practice, requiring instrumentation of agentic systems for structured telemetry output.
Reproducibility of advisory outputs is a core design goal, supporting auditable governance and regulatory compliance for autonomous AI deployments.

Read Original Article on Arxiv CS.AI

arxivpapersagents