AI Agents Scale Production Despite Identity Security Gaps

AI agents are running hospital records and factory inspections at enterprise scale, but identity governance systems cannot track, scope, or revoke their access at machine speed. According to Cisco President Jeetu Patel at RSAC 2026, 85% of enterprises are running agent pilots while only 5% have reached production — an 80-point trust gap driven by identity management limitations.

Anthropic on Tuesday unveiled “dreaming”, a system that lets AI agents learn from their own past sessions and improve over time. Legal AI company Harvey saw task completion rates increase roughly 6x after implementing the feature, while medical document review company Wisedocs cut document review time by 50% using Anthropic’s outcomes feature.

Enterprise Agent Adoption Hits Security Bottlenecks

The gap between agent deployment speed and security readiness is creating incidents across industries. IANS Research found that most businesses still lack role-based access control mature enough for today’s human identities, and agents will make it significantly harder.

Gravitee’s 2026 State of AI Agent Security report, surveying more than 900 executives and practitioners, revealed that 88% of organizations reported confirmed or suspected AI agent security incidents in the past year. Only 14.4% of agentic systems went live with full security and IT approval.

The 2026 IBM X-Force Threat Intelligence Index reported a 44% increase in attacks exploiting public-facing applications, driven by missing authentication controls and AI-enabled vulnerability discovery. Apono found that 98% of cybersecurity leaders report friction between accelerating agentic AI adoption and meeting security requirements.

Attack Surface Expands Beyond Traditional LLM Boundaries

AI agents expose four distinct attack surfaces compared to standalone LLMs, according to security researchers. Traditional prompt attacks target only the input layer, but agents add tool execution, memory storage, and multi-agent coordination vectors.

The four agent attack surfaces include:

Prompt Surface: Reading external inputs and processing user commands
Tool Surface: Executing backend actions with system-level permissions
Memory Surface: Storing and retrieving data across sessions
Coordination Surface: Managing communication between multiple agent instances

Autonomous offensive security firm XBOW raised $35 million in Series C extension funding to address these expanded attack vectors. The company’s platform leverages AI reasoning and adversarial workflows to continuously test applications for vulnerabilities, operating autonomously to identify and validate security holes through real exploitation.

Anthropic’s Self-Learning Agents Show Production Promise

Anthropic’s “dreaming” capability allows Claude agents to analyze their own past sessions, identify patterns in successful and failed attempts, and adjust their approach for future tasks. The company moved two previously experimental features — outcomes and multi-agent orchestration — from research preview into public beta.

Key production results from early adopters:

Harvey (legal AI): 6x increase in task completion rates
Wisedocs (medical documents): 50% reduction in review time
Netflix: Processing logs from hundreds of builds simultaneously

Anthropic CEO Dario Amodei disclosed that the company’s growth has outpaced even its own aggressive internal projections in the first quarter of 2026. The announcements address what Anthropic identifies as the three hardest problems in running AI agents at scale: maintaining accuracy, enabling learning, and preventing bottlenecks on complex workflows.

Enterprise Architecture Shifts From Bot-Centric to Agentic

Forbes Technology Council research indicates that enterprise automation is evolving beyond traditional bot deployment metrics. Organizations that initially focused on deploying maximum numbers of bots are now facing “automation sprawl” — multiple platforms performing similar functions with fragmented governance and visibility.

The shift toward “agentic enterprises” prioritizes intelligent orchestration over bot quantity. This architectural evolution requires centralized governance models, unified credential management, and comprehensive monitoring across business units.

Common automation sprawl symptoms include:

Multiple platforms performing duplicate functions
Inconsistent governance across departments
Proliferated scripts and workflows
Fragmented visibility and reporting

What This Means

The 80-point gap between AI agent pilots and production deployments reflects a fundamental mismatch between agent capabilities and enterprise security infrastructure. While agents can now learn from their mistakes and coordinate complex workflows, most organizations lack the identity governance systems to safely manage non-human identities at machine speed.

The security challenge is architectural, not just tooling-related. Enterprises need unified frameworks for agent identity management, real-time access control, and comprehensive audit trails. Companies successfully scaling agents to production — like Harvey, Wisedocs, and Netflix — have invested heavily in these foundational security capabilities before deployment.

The emergence of autonomous security testing platforms like XBOW suggests the industry is developing AI-native approaches to validate agent security. However, the 88% incident rate indicates most organizations are learning these lessons through production failures rather than proactive security design.

FAQ

What is the main security challenge preventing AI agent adoption at scale?
Identity governance systems cannot track, scope, or revoke agent access at machine speed. Most enterprises lack role-based access control mature enough for human identities, making agent management significantly harder.

How do AI agent attack surfaces differ from traditional LLM security?
Agents expose four attack surfaces (prompt, tool, memory, coordination) compared to one for standalone LLMs (prompt only). This expanded attack surface includes backend system access, persistent memory, and multi-agent communication vectors.

What results are enterprises seeing from production AI agents?
Early adopters report significant improvements: Harvey achieved 6x higher task completion rates, Wisedocs cut document review time by 50%, and Netflix processes hundreds of build logs simultaneously using multi-agent orchestration.

Sources

AI agents are running hospital records and factory inspections. Enterprise IAM was never built for them. – VentureBeat
The AI Agent Security Surface: What Gets Exposed When You Add Tools and Memory – Towards Data Science
Anthropic introduces “dreaming,” a system that lets AI agents learn from their own mistakes – VentureBeat