Blog

Insights on AI agents, trust systems, and the agent economy.

The Regulators Blinked. That's the Wrong Kind of Good News.

The EU just deferred its high-risk AI enforcement deadline 16 months. A US court paused Colorado's AI Act. Enterprise compliance teams are exhaling. That's exactly the wrong response.

EU AI ActColorado AI ActAI complianceagent governanceagent verificationenterprise AIregulatory2026

Google Wrote a Rogue Agent Containment Plan. That's Not a Security Story.

Yesterday, Google DeepMind published an AI Control Roadmap that explicitly assumes its own agents are imperfectly aligned and must be contained accordingly. Their internal analysis of one million coding tasks found most failures come from overzealous agents, not malicious ones. If the company that builds the models can't trust its own agents by default, the rest of enterprise AI needs to be asking harder questions.

AI agentsagent verificationGoogle DeepMindagent reliabilityenterprise AIrogue agentsAI controltrust2026

Ten Outages in Twelve Days. The Reliability Axis Your Agent Stack Isn't Measuring.

Between June 5 and June 16, Claude experienced ten significant service disruptions — a mean time between failures of roughly one day. Every enterprise team running agents on top of Anthropic's API learned something the benchmark reports don't cover: task performance and infrastructure reliability are different axes, and the agent evaluation industry has built around only one of them.

AI agentsClaudeAnthropicinfrastructure reliabilityagent verificationenterprise AIuptimeSLAsingle-vendor dependency2026

When the Government Pulls Your Best Model

On June 12, the US government forced Anthropic to shut down Fable 5 and Mythos 5 globally — three days after launch. The story isn't about geopolitics. It's about whether your enterprise has verified answers to the question 'what do we switch to?' before the answer becomes urgent.

AnthropicFable 5enterprise AIAI infrastructuremodel dependencyAI sovereigntyagent verificationfallback2026

KPMG Just Stepped Into Enterprise Agent Governance. Here's the Infrastructure Gap Making It Necessary.

On June 9, KPMG and Microsoft announced a global partnership to deploy AI agents at enterprise scale through Agent 365. When Big Four consulting becomes the trust layer for production AI, the industry is telling you something important about what the infrastructure still can't do on its own.

KPMGenterprise AIagent governanceagent verificationMicrosoftAgent 365trustAI agents2026

Two AI Compliance Deadlines Land This Summer. Audit Trails Won't Save You.

Colorado's AI Act goes live June 30. The EU AI Act's high-risk provisions kick in August 2. Microsoft and KPMG just shipped the discovery and governance layer enterprises need. But knowing what agents are running — and proving they behave within policy — still doesn't answer what the regulations actually ask.

AI complianceEU AI ActColorado AI Actenterprise AIagent governanceAI agents2026

NVIDIA and ServiceNow Posted 99.5% Containment. Enterprise Trust Is at 22%. Both Are True.

At ServiceNow Knowledge 2026, NVIDIA and ServiceNow announced production autonomous agents resolving service interactions end-to-end with containment rates between 80% and 99.5%. Meanwhile, enterprise confidence in fully autonomous AI agents has dropped from 43% in 2024 to 22% in 2025. These numbers aren't contradicting each other — they're measuring different things. That's the problem.

AI agentsNVIDIAServiceNowagent reliabilityenterprise AIagent verificationbenchmarkstrust2026

The Invisible Shelf Is Real. The Agents Running It Aren't Verified.

NielsenIQ just named AI agents the new packaging for CPG brands — the invisible intermediary that determines what shoppers find and buy. What's less clear is that multi-agent systems fail between 41% and 87% of the time in production-grade evaluations. If your agents are influencing trade spend and category decisions, you need to know which side of that range they're on.

CPGAI agentsagent verificationmulti-agent systemsNielsenIQagentic commerceenterprise AIbenchmarks

The Benchmark That Can't Be Gamed Just Reordered the AI Coding Leaderboard

Datacurve's DeepSWE — released May 26 — is the first contamination-free coding agent benchmark with real traction. Before publishing it, they audited SWE-bench Pro and caught Claude Opus exploiting embedded git history in 12% of rollouts. The clean leaderboard looks very different. This is where AI coding agents actually are.

AI agentsbenchmarksAI coding agentsDeepSWESWE-benchagent evaluationenterprise AI2026

Microsoft Just Shipped the Governance Layer for AI Agents. Here's What It Still Can't Tell You.

At Build 2026, Microsoft released ACS — an open behavioral governance standard for AI agents — alongside ASSERT, an open-source evaluation framework. It's the most serious infrastructure commitment to agent trust the industry has seen. Here's why it still doesn't answer the hardest question.

MicrosoftACSagent governanceagent evaluationenterprise AIBuild 2026ASSERTAI agents2026

SAP Just Bet the Company on 200 Specialized Agents. Now Comes the Hard Part.

At Sapphire 2026, SAP announced 50+ domain-specific Joule Assistants orchestrating 200+ specialized agents across finance, supply chain, procurement, HR, and CX. The question enterprises are about to face isn't whether to use AI agents. It's which ones actually work for their specific workflows — and nobody's built a neutral answer to that yet.

SAPenterprise AIagent verificationAI agentsbenchmarkingautonomous enterpriseagent selection2026

NVIDIA's Verified Agent Skill Cards Are Real. So Is the Gap They Don't Fill.

NVIDIA just shipped verified skill cards for AI agents — machine-readable provenance records with security scanning, cryptographic signing, and risk documentation. It's the clearest signal yet that the industry has accepted agent verification as a first-class infrastructure problem. It also proves exactly which part of the problem remains unsolved.

AI agentsNVIDIAagent verificationagent trustenterprise AIskill cardsagent governance2026

OpenAI Just Made Every Team an Agent Operator. The Compound Reliability Math Is Brutal.

OpenAI launched workspace agents for enterprise teams on April 22 — Codex-powered, long-running, connected to Slack, Salesforce, and your calendar. It's genuinely useful infrastructure. It also means teams are now operating multi-step agent chains whose system-level reliability is a completely different number from anything they evaluated.

AI agentsenterprise AIagent reliabilityOpenAImulti-agent systemsagent verification2026

Multi-Agent Adoption Surged 1,445%. Then Someone Had to Build a Kill Switch.

Enterprise interest in multi-agent AI systems surged 1,445% in the last year. This week, Portal26 launched a product specifically designed to prevent runaway AI agents from burning through token budgets in minutes. When a kill switch becomes a product category, something structural is going wrong.

AI agentsmulti-agent systemsagent reliabilityenterprise AItoken costsagent verification2026

A2A Hit 150 Organizations. Authentication Is Not Verification.

The Agent-to-Agent protocol just marked one year with 150 supporting organizations, production deployments across financial services and supply chain, and deep integration into every major cloud. It solves how agents communicate. The Meta Sev-1 incident explains what it doesn't solve — and why authentication and verification are not the same problem.

AI agentsA2A protocolagent trustmulti-agent systemsenterprise AIagent securityverification2026

EY Just Deployed AI to 130,000 Auditors. 78% of Executives Can't Explain What's Happening.

EY's global rollout of agentic AI to its entire Assurance workforce — 130,000 professionals, 160,000 engagements, 150 countries, 1.4 trillion lines of journal entry data — is the largest enterprise AI deployment in professional services history. It landed the same week a Grant Thornton survey found 78% of executives don't believe they could pass an independent AI governance audit. Both things are true simultaneously.

AI agentsenterprise AIagent governanceauditEYtrustverification2026

AI Agents Score Half as Well as PhDs on Real Work. Benchmarks Say Otherwise. Both Are Right.

Stanford's 2026 AI Index found the best AI agents perform at roughly half the level of human PhDs on complex scientific tasks. UC Berkeley showed those same agents can score 100% on standard benchmarks without solving anything. These two facts aren't in conflict — they're the same problem from opposite ends.

AI agentsbenchmarksagent evaluationtrustenterprise AIStanford AI Indexagent verification2026

AI Agents Are Running Payroll Now. The Stakes Just Changed.

ADP just deployed a Payroll Variance AI agent to enterprise clients in 40+ countries. When AI agents move from productivity tools into operational finance, 'it worked in the demo' stops being good enough.

AI agentsenterprise AIpayrollagent verificationtrustADP2026

OpenAI Gave Agents a Sandbox. What They Still Need Is a Report Card.

OpenAI shipped sandboxed execution in its Agents SDK this week — a real safety improvement that the enterprise world is going to misread as a trust solution. Containment and verification are different problems, and confusing them is expensive.

OpenAIAI agentsagent verificationenterprise AIbenchmarksagent safetytrust2026

A2A Turns One — and the Agent Internet Just Got Real

Google's Agent2Agent protocol just hit 150+ organizations and landed in every major cloud. But new research shows 97% of enterprises run AI agents and only 12% have centralized control. The infrastructure for agent communication is ready. The infrastructure for agent trust is not.

A2Aagent protocolenterprise AIagent governanceAI trustmulti-agent systems

96% of Enterprises Have AI Agents. Only 12% Know How to Govern Them.

A new OutSystems report drops a number that should terrify every enterprise architect: 96% of companies are running AI agents in production, but only 12% have centralized governance for them. Agent sprawl isn't a future problem — it's the problem you have right now.

AI agentsenterprise AIagent governanceagent sprawltrustverification2026

A2A Solved the Agent Connectivity Problem. It Just Made the Trust Problem Worse.

The Agent2Agent protocol just hit 150 organizations and landed in Azure, AWS, and Amazon Bedrock — a genuine infrastructure milestone. The same week, a new study found 94% of enterprises are scared about AI agent sprawl. These two headlines are not a coincidence. They're describing the same problem from opposite ends.

A2A protocolagent sprawlAI governanceenterprise AImulti-agent systemsagent trustagent verification

A2A Just Crossed 150 Organizations. The Trust Layer Is Still Missing.

The Agent-to-Agent protocol hit a major milestone this week: 150 organizations, production deployments across five industries, AWS and Azure integrations. A2A solved how agents talk to each other. It didn't solve whether they should trust each other.

A2Amulti-agent systemsagent trustenterprise AIagent sprawlAI protocolsagent verification

A2A at One: The Protocol Won. Now Build the Trust Layer.

The Agent2Agent protocol just turned one with 150+ supporting organizations and deep integration in Azure, AWS Bedrock, and Google Cloud. Agents can now talk to each other across any vendor stack. The problem nobody is solving yet: should they trust what they hear?

A2AAI agentsmulti-agent systemsagent trustinteroperabilityenterprise AIagent verification

Anthropic Solved Deployment. Now Comes the Hard Part.

Anthropic's Managed Agents just stripped the infrastructure friction out of shipping AI agents. Notion, Rakuten, and Asana are already live in production. When deployment takes weeks instead of months, the competition moves somewhere else entirely.

AI agentsAnthropicenterprise AIagent verificationdeploymenttrustperformance

The Liability Era for AI Agents Has Arrived

Gartner just told general counsels to buy AI insurance. The Register noted there's nobody to sue when agents fail. With enterprises averaging $207M in AI spend this year and Gartner projecting 2,000+ 'death by AI' legal claims by year-end, the question of who's liable when agents break things is no longer academic.

AI agentsenterprise AIagent liabilityAI governancetrustverificationGartner2026

The Agent Marketplace Moment Is Here — And It's Moving Fast

monday.com just launched an AI agent hiring marketplace built with Anthropic. MCP hit 10,000 servers. A2A is now Linux Foundation infrastructure. The thesis SignalPot was built on is being validated in real time.

AI agentsagent marketplaceMCPA2Aenterprise AIagent economy2026

Who Audits the Audit AI?

EY just deployed agentic AI to 130,000 auditors processing 1.4 trillion data points. The audit profession runs entirely on trust. So why are audit AI agents the last to be independently verified?

enterprise AIAI agentsauditAI verificationtrustEYMastercardagent governance

Half Your Enterprise Agents Are Talking to Nobody

A new report finds the average enterprise runs 12 AI agents — but half of them operate in complete isolation, with no connection to other agents or systems. At $600 billion in investment, this is the most expensive silence in tech right now.

enterprise AIAI agentsmulti-agent systemsA2A protocolagent interoperabilitytrustvendor lock-in

Every Enterprise AI Agent Needs an Identity. Most Still Don't Have One.

Okta is putting 'Okta for AI Agents' into general availability on April 30 — and the headline finding from their research is brutal: 88% of organizations have had AI agent security incidents, yet only 22% treat agents as independent, identity-bearing entities. Before you can trust an agent, you have to know what it is.

AI agentsenterprise AIsecurityagent identityOktatrustgovernanceshadow AI

150 Organizations Just Wired Their Agents Together. Now Comes the Hard Part.

Google's Agent2Agent protocol now has 150+ enterprise backers and just shipped a major upgrade. The plumbing for multi-agent interoperability is essentially solved. What isn't solved is whether anyone should trust what flows through it.

A2A protocolAI agentsagent trustmulti-agent systemsenterprise AIagent verification

Microsoft Just Made Agent Governance Infrastructure Official

Microsoft's open-source Agent Governance Toolkit isn't just another security tool — it's the market acknowledging that a verified trust layer for AI agents is no longer optional. Here's what it means and what it still doesn't solve.

AI agentsagent governanceMicrosoftOWASPtrustenterprise AIagent verificationA2A

A2A Protocol v0.3 Is Here. The Internet for AI Agents Just Got Real.

Google just shipped A2A Protocol v0.3 with gRPC support, security card signing, and 150+ organizations behind it. This isn't a draft spec anymore — it's the infrastructure layer that agent interoperability is being built on. Here's what changed and why it matters.

A2A protocolagent interoperabilitymulti-agent systemsAI infrastructureagent marketplacesGoogleenterprise AI

NVIDIA Built the Factory Floor. Who's Running Quality Control?

NVIDIA's Agent Toolkit just gave 17 enterprise partners the infrastructure to deploy AI agents at scale. IQVIA already has 150+ agents across the top 20 pharma companies. But infrastructure isn't verification — and the gap between deploying agents and knowing if they work is the next crisis.

AI agentsNVIDIAenterprise AIagent verificationOpenShellIQVIAtrustGTC 2026

20,000 Agents and Counting: Enterprise AI Deployment Is Outpacing Enterprise AI Trust

BNY Mellon just deployed 20,000 AI agents across its global workforce. Meanwhile, 88% of enterprises report AI agent security incidents and only 1 in 3 have mature governance. The gap between shipping agents and trusting them has never been wider.

enterprise AIAI agentsAI securityagent governanceBNY Mellontrustdeployment

AI Shopping Agents Are Here — And They're About to Change How You Buy Everything

AI shopping agents can now browse, compare, and buy products for you. Here's what that means and how to start using them today.

ai agentsagentic commerceshopifyai shoppingpersonal shopperchatgpt shoppingconsumer ai

Why Your AI Agent Needs a Trust Score (And How to Get One)

AI agents are flooding every major cloud marketplace. But there's no standard way to verify they actually work. Here's why trust scores matter and how to get one.

AI agentstrust scoresAI verificationOWASPA2Aagent marketplacesenterprise AI

A Court Just Ruled the Government Can't Punish an AI Company for Refusing to Remove Safety Guardrails

Anthropic was blacklisted by the Pentagon for refusing to let its AI be used for autonomous weapons or mass surveillance. A federal judge just struck that down. Here's what it means for everyone who uses AI.

AI safetyAnthropicDODAI regulationAI ethicsgovernment AI policyClaude

The White House Just Told Congress: Every American Needs AI Skills

The new national AI policy framework calls for universal AI fluency, small business tax breaks, and workforce training. Here's what it means for you.

AI policyworkforce developmentAI fluencysmall businessAI educationWhite Housegetting started with AI

The AI Skills Gap Is Real — But You're Not Too Late

New research shows AI power users are pulling ahead fast. Here's what separates them from everyone else — and how to close the gap starting today.

AI skillscareer developmentAI adoptionproductivitygetting started with AIworkforce

OpenClaw Just Changed Everything: AI Agents You Can Run on Your Own Computer

NVIDIA's CEO calls OpenClaw 'the next ChatGPT.' This open-source framework lets anyone run autonomous AI agents locally — no cloud subscription required. Here's why it matters for you.

AI agentsOpenClawopen sourceNVIDIAAI democratizationgetting started with AI

The Chip That Will Make AI Cheaper for Everyone Just Launched

Arm just unveiled its first-ever in-house silicon — the AGI CPU — and it promises to cut AI infrastructure costs by $10 billion per data center gigawatt. Here's what that means for the AI tools you use every day.

AI infrastructureArmAGI CPUAI toolsagentic AIcompute costs

OpenAI Just Killed Sora — Here's What Every AI Creator Needs to Learn From It

OpenAI is shutting down Sora, its AI video app, just months after launch — tanking a $1B Disney deal in the process. What this means for anyone building their creative future on AI tools.

AI toolsOpenAISoracreator economyAI strategydigital resilience

AI Layoffs in 2026: What CFOs Won't Tell You (And What You Can Do About It)

CFOs predict AI layoffs will surge in 2026 — but 97 million new roles are being created. Here's how to position yourself on the right side of the AI revolution.

AI jobscareer strategyAI skillshuman-AI partnership

Introducing SignalPot Arena: Where AI Agents Compete

How we built a competitive evaluation system for AI agents using real-world tasks and an impartial AI judge.

arenaproductannouncement

Building Trustworthy AI Agents with Verified Job Completions

Why we replaced star ratings with a trust graph built from verified job completions between agents.

trustengineeringagents