Live feedDedupedUpdated every hourLatest check Jun 23, 11:00 AM GMT+1

Intel Centre

Agentic AI Security stories summarised for CTOs and CISOs: what changed, why it matters, and what to review before agents get more autonomy.

ResearcharXiv cs.CR agent security·Jun 23, 08:01 AM GMT+1Latest

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

Executive summary

arXiv cs.CR agent security reports that multi-agent systems (MAS) offer a scalable path forward for agentic AI, comprising multiple LLM-based agents, each assigned a system prompt and a position within a workflow that governs inter-agent coordination and output aggregation. Multi-agent systems can spread evaluator bias and bad assumptions across the workflow, creating correlated failures instead of independent checks. Avoid treating another model's judgement as neutral evidence. Log provenance and diversify evaluation paths for high-impact decisions.

Why it matters

CTO/CISO takeaway: model-to-model trust can create correlated failures. Preserve evidence, diversify checks, and avoid using one agent as the sole control for another.

Context Firewall angle

For agent builders, the operational question is which source of context should be allowed to influence which action.

Intel Centre

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

Composing Verifiable Conceptual Models via Building Blocks: Towards Design-Time Verification of Agentic AI Workflows

Dissecting Agentic RAG: A Component Ablation for Multi-Hop QA with a Local 7B Model

CISA, US and International Partners Release Guide to Secure Adoption of Agentic AI

Aikido and OWASP bring agentic Code Audit to the global AppSec community

Juice Shop v20.0.0 — a fresh squeeze of features, now with AI

Temporary Cloudflare Accounts for AI agents

Introducing the Cloudflare One stack: agent-powered deployment

Defend against frontier cyber models: Cloudflare's architecture as customer zero

How we built Cloudflare's data platform and an AI agent on top of it

Trust No Skill: Integrity Verification for AI Agent Supply Chains

Dirty Frag: Linux Kernel Local Privilege Escalation via ESP and RxRPC

AutoJack: How a single page can RCE the host running your AI agent

Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving

LedgerAgent: Structured State for Policy-Adherent Tool-Calling Agents

Sovereign Execution Brokers: Enforcing Certificate-Bound Authority in Agentic Control Planes

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Efficient and Sound Probabilistic Verification for AI Agents

Contagion Networks: Evaluator Bias Propagation in Multi-Agent LLM Systems

Embedding Forbidden Text in Spyware to Discourage AI Analysis

Beyond the benchmark: Advancing security at AI speed

Safeguarding VS Code against prompt injections

AI threats in the wild: The current state of prompt injections on the web

Google Workspace’s continuous approach to mitigating indirect prompt injections

Architecting Security for Agentic Capabilities in Chrome

Mitigating prompt injection attacks with a layered defense strategy

A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry

How Endava is redesigning software delivery around AI agents