Who Vouches for the Machine?
AI Agent Identity in an Automated World
If an AI conducts your business, books your flights, wires your money, and signs your contracts — how does the world know it’s actually you? This is the identity problem that keeps security engineers up at night.
01 · Background: The Problem of Trust
Before we talk about solutions, we need to feel the weight of the problem.
Every day, millions of AI agents take actions in the real world. Some book calendar appointments. Some respond to customer service emails. Others scan financial data, place trades, and file expense reports. By 2025, machine-to-human identity ratios in some enterprise environments had reached 144-to-1 — meaning for every person who needed to verify who they were, 144 automated systems or agents needed to do the same.[1]
This explosion in non-human identities creates a question that existing technology was never designed to answer cleanly: when a machine shows up at the digital door and says “I’m here on behalf of Sarah Chen to transfer $12,000 to a contractor” — how does the door know that’s actually true?
For a human, proving identity is relatively familiar. You show a passport. You type a password. Your face gets scanned. You receive a one-time code. These mechanisms work because a human is present, making choices, accepting accountability. But an AI agent acts autonomously and at scale — it can execute hundreds of actions in parallel, at any hour, on any continent, without a human watching every step.[2] It doesn’t get tired and make careless mistakes. But it also can’t “just know” when something seems wrong the way a human might.
The answer to the identity question isn’t a single technology. It’s a whole architecture — a layered system of cryptographic credentials, delegation protocols, and auditable logs that must work together so that when an AI agent acts, every party can verify three things: who the agent is, who authorized it, and what it is allowed to do.[2] Getting any one of these wrong creates a crack that bad actors can exploit.
02 · History: From Passwords to Protocols
You cannot understand where we are without understanding the half-century of improvised scaffolding that got us here.
The identity problem is not new. It’s just gotten geometrically harder. Let’s trace the engineering decisions — and their costs — that built the infrastructure we’re now racing to extend for AI agents.
OAuth looks complicated because it solves a genuinely hard problem: delegated access without shared credentials, across untrusted networks, for multiple client types.
— Dev.to / Tyson Cung, “OAuth 2.0 Explained” (2026)
03 · Definitions Glossary
The field runs on jargon. Strip it down to function and it becomes tractable.
04 · How AI Agent Identity Works Today
Three interlocking mechanisms. Each necessary. None sufficient alone.
Mechanism 1 — OAuth 2.0 Tokens (The Wristband Model)
The most common approach today adapts OAuth 2.0 — originally designed for human-approved logins — for machine-to-machine use. When a user wants to empower an AI agent, they log in and explicitly consent. The authorization server issues the agent a short-lived access token. The token encodes what the agent can do, for how long, and on whose behalf.
The June 2025 update to the MCP authorization specification formalized this approach, classifying MCP servers as OAuth Resource Servers and requiring clients to implement RFC 8707 Resource Indicators to prevent malicious servers from obtaining tokens meant for other services.[9] The token is the wristband: it doesn’t contain your passport, but it carries enough information to get you into the right rooms.
Mechanism 2 — Cryptographic Key Pairs (The Unforgeable Seal)
Every serious implementation assigns each AI agent its own public/private key pair. The agent uses its private key to sign every request it makes, generating a cryptographic signature. Any service receiving a request can verify the signature using the agent’s public key — confirming the request actually came from that specific agent and hasn’t been tampered with in transit.
Companies like Keycard have built tooling that assigns each agent its own verifiable identity at runtime through attestation, with no long-lived credentials written to disk. When a task begins, a session is created that binds every subsequent action to the originating user and request. Every token in the chain is traceable, revocable, and expires at the end of the session.[10]
Mechanism 3 — Decentralized Identifiers + Verifiable Credentials (The Self-Sovereign Stack)
The most forward-looking approach gives each AI agent a Decentralized Identifier (DID) — a permanent, cryptographically controlled identity that no company owns or can revoke unilaterally. Combined with Verifiable Credentials (VCs) issued by trusted third parties, this creates a system where agents can prove who they are, what authority they hold, and what organization deployed them — to any other system, across any platform boundary, without needing to call back to a central server.[8]
Agents can now prove ownership of their self-controlled identifiers for authentication purposes and establish cross-domain trust relationships through the spontaneous exchange of their identity-bound credentials.
— arXiv 2511.02841, “AI Agents with Decentralized Identifiers and Verifiable Credentials” (Oct 2025)
05 · How an Organization Delegates Authority to an AI Agent
Delegation is the formal act of passing a portion of your authority to someone — or something — else. The mechanics matter enormously.
When a company deploys an AI agent, it needs to answer a deceptively complex question: exactly how much of our authority do we hand over, and how do we keep track of what the agent does with it? The current answer involves multiple interlocking components:
The Agent Registry
The Cyber Security Agency of Singapore’s 2025 Addendum on Securing Agentic AI requires organizations to maintain a trusted agent registry — a centralized or federated list of every AI agent the organization has authorized, what it can do, and under whose name it operates.[7] Think of this like an employee directory — before an agent can act in the company’s name, it must be registered and credentialed.
Scoped, Short-Lived Tokens
One of the hardest-won lessons from OAuth’s history is that broad, long-lived permissions are a liability. Modern delegation protocols apply the least privilege principle aggressively: an agent booking a hotel room gets a token that says “read and write to the travel booking service, for the next 60 minutes.” It cannot access the payroll system. It cannot send emails. The token expires. The scope is mathematically enforced, not just promised.[10]
Natural Language to Formal Permissions
One emerging area of research addresses a very practical problem: users say things like “handle my travel” in natural language, but access control systems need formal, auditable permission specifications. Researchers at MIT and collaborating labs have proposed frameworks for translating flexible, natural language permissions into auditable access control configurations — so when a user says “take care of my calendar this week,” the system correctly converts that into: read/write calendar service; no email; no financial transactions; expires Sunday midnight.[11]
The Delegation Chain Diagram
06 · How an AI Proves It Represents a Specific Human
This is the deepest technical question. It’s not just identity — it’s delegated identity, with proof.
An AI agent needs to answer two distinct questions simultaneously: “Who am I?” and “Who authorized me?” These aren’t the same answer. A legitimate agent could be acting for the wrong person if the delegation chain is spoofed. This is why researchers distinguish between identity verification and authority verification.[2]
The Verifiable Credential Approach
The most technically rigorous approach uses W3C Verifiable Credentials (VCs). Here’s the functional model: an organization issues a VC to an AI agent that encodes a specific set of claims — “This agent is authorized by Sarah Chen (DID: did:example:sarah123) to access the travel booking service, with scope limited to read/write reservations, valid until 2026-06-10T23:59:00Z, issued by Nexus Frontiers AI.” The credential is cryptographically signed by the organization.
When the agent presents this credential to a travel booking service, the service doesn’t need to call Nexus Frontiers AI to check. It verifies the cryptographic signature directly. If the signature checks out, the claims are true. If the credential is expired or revoked, it fails. No central lookup needed. No phone call. Just math.[12]
The DID Binding
Each AI agent has its own Decentralized Identifier (DID) — a long string like did:web:agent.nexusfrontiers.ai:agent-42 — that is cryptographically controlled by a key pair held by the agent at runtime. This DID is registered in a verifiable data registry (which could be a public blockchain or a trusted institutional registry). When the agent signs a request, anyone can look up the DID, retrieve the agent’s public key, and verify the signature. The W3C formalized this standard in the DID specification v1.1 (Candidate Recommendation, March 2026).[8]
The Cross-Agent Problem
Things get harder when one AI agent delegates to another sub-agent. This is called a multi-agent system, and it’s increasingly common — an orchestrator agent assigns tasks to specialized worker agents. Each hop in this chain must maintain the original user’s authorization context. Researchers have identified that without careful design, these chains create attack surfaces including “confused deputy attacks” (an agent is tricked into using its privileges on behalf of a different, unauthorized principal) and “delegation hijacking” (a malicious actor inserts themselves into a delegation chain).[13]
07 · Threats and What Can Go Wrong
The attack surface for AI agent identity is not theoretical. It’s already being exploited.
The NSA published guidance on MCP security in 2026, citing real vulnerabilities: Asana’s MCP implementation exposed data across different customer instances; Microsoft 365 Copilot was vulnerable to hidden prompts that exfiltrated sensitive data (CVE-2025-32711); the mcp-remote package — downloaded over 437,000 times — contained a remote code execution vulnerability (CVE-2025-6514).[14]
The pattern is consistent: the specification often has the security model correct. What fails is enforcement at implementation time. Writing a standard is the easy part. Making every developer follow it, consistently, under deadline pressure, is the genuinely hard part.
08 · Worked Example: Step by Step
The underlying concept, concretely deployed. No assumed technical knowledge required.
Let’s walk through a realistic scenario: Maria, a small business owner, authorizes her AI assistant (“ArioAI”) to book a hotel room for a client visit next week. Here is what happens under the hood, step by step, using the current best-practice architecture.
eyJhbGciOiJSUzI1NiIsInR5cCI6IkpXVCJ9... It encodes: owner = Maria, agent = ArioAI, scope = hotel_read + hotel_book, expires = June 4 2026 12:00AM. No other permissions exist in this token — not flights, not car rentals, not Maria’s email.Check A: Is the access token valid and not expired? (It checks the token’s cryptographic signature against the authorization server’s public key.) ✓
Check B: Does the token’s scope cover the requested action (hotel booking)? ✓
Check C: Is the request signature valid — does it match ArioAI’s known public key? ✓
All three pass. The service processes the hotel search for Maria’s dates and budget.
09 · Where This Is Headed
The architecture is converging. The governance is not.
The technical standards are maturing rapidly. The W3C DID specification reached Candidate Recommendation status in March 2026. The OAuth working group continues developing OAuth 2.1 — a consolidation of best practices — with the latest draft released in May 2024.[15] Anthropic’s MCP has been extended with formal cross-app authorization controls. The EU’s eIDAS 2.0 (Regulation 2024/1183) is building cryptographic infrastructure into the European Digital Identity Wallet that will natively support AI agent identity.[7]
What remains genuinely unsolved is the governance layer. Who decides which AI agents are trustworthy enough to be issued Verifiable Credentials? Who runs the registries? How do you revoke an agent’s identity when it’s been compromised — and what happens to the actions it already took? These are legal, institutional, and political questions as much as technical ones.
Enterprises are rebuilding business functions around AI agents. Right now, the developers building these systems have to choose: give agents broad access and they’re ungovernable, or lock them down and lose what makes them valuable. Agents with their own identity, per-task delegation, and no standing privileges don’t experience this trade-off.
— Ian Livingstone, Co-founder & CEO of Keycard (2026)
The research community is clear-eyed: this is a structural identity problem, not a prompt-engineering problem. The highest-impact agentic failures — impersonation, delegation hijacking, cascading compromise — are authorization failures, not alignment failures. Fixing them requires the same rigor applied to securing human identities, applied at machine speed and machine scale.[13]
The honest assessment: we are in the middle of building the infrastructure. The math works. The protocols are being written. The implementations are inconsistent. And the agents are already acting.
// Summary: The Five Things That Must Be True for Safe AI Agent Identity
- The agent has its own unique, verifiable identity — a cryptographic key pair and/or DID, not borrowed credentials from its owner.
- The delegation is explicit and auditable — a signed credential or scoped token that records exactly who authorized what, when, and for how long.
- Permissions are minimal and time-limited — the agent can only do what the current task requires, and access expires automatically.
- Every action is signed and logged — so there’s an immutable record of what the agent did, traceable back to the authorizing principal.
- Credentials can be revoked instantly — if something goes wrong, the chain can be broken without requiring the agent to cooperate.
Sources
All sources verified for factual accuracy and accessibility at time of publication (June 2026).
- [1] Blockchain Council. “Decentralized Identity for AI Using Blockchain.” blockchain-council.org, April 6, 2026. https://www.blockchain-council.org/blockchain/decentralized-identity-for-ai-blockchain-authenticate-agents-devices-users/
- [2] Dock.io. “AI Agent Digital Identity Verification: How to Trust Autonomous Decisions.” dock.io, 2025. https://www.dock.io/post/ai-agent-digital-identity-verification
- [3] Fedotova, M. et al. “Failures of Public Key Infrastructure: A 53-Year Survey.” arXiv:2401.05239, 2024. https://arxiv.org/pdf/2401.05239
- [4] Jayasena, Achini Udari. “Understanding OAuth 2.0: A Brief History.” Medium, March 13, 2025. https://aaujayasena.medium.com/understanding-oauth-2-0-c85d05d8ed66
- [5] OAuth.net. “OAuth Introduction: History and Origins.” oauth.net, 2024. https://oauth.net/about/introduction/
- [6] Zenity. “Securing the Model Context Protocol (MCP): A Deep Dive into Emerging AI Risks.” zenity.io, June 20, 2025. https://zenity.io/blog/security/securing-the-model-context-protocol-mcp
- [7] South, Tobin et al. “Identity Management for Agentic AI: The New Frontier of Authorization, Authentication, and Security.” arXiv:2510.25819 / Executive Summary, April 25, 2026. https://arxiv.org/html/2604.23280v1
- [8] W3C. “Decentralized Identifiers (DIDs) v1.1.” W3C Candidate Recommendation Snapshot, March 5, 2026. https://www.w3.org/TR/did-1.1/
- [9] Descope. “What Is the Model Context Protocol (MCP) and How It Works.” descope.com, January 15, 2026. https://www.descope.com/learn/post/mcp
- [10] Biometric Update. “1Password, Keycard Present Tools for Secure AI Agent Credential Delegation.” biometricupdate.com, May 2026. https://www.biometricupdate.com/202605/1password-keycard-present-tools-for-secure-ai-agent-credential-delegation
- [11] arXiv. “Authenticated Delegation and Authorized AI Agents.” arXiv:2501.09674, January 2025. https://arxiv.org/pdf/2501.09674
- [12] Indicio. “Why Verifiable Credentials Will Power Real-World AI in 2026.” indicio.tech, February 12, 2026. https://indicio.tech/blog/why-verifiable-credentials-will-power-ai-in-2026/
- [13] Rodriguez Garzon, Sandro et al. “AI Agents with Decentralized Identifiers and Verifiable Credentials.” arXiv:2511.02841, November 2025. https://arxiv.org/abs/2511.02841
- [14] NSA. “Model Context Protocol (MCP) Security Guidance.” nsa.gov Cybersecurity Information Sheet, 2026. https://www.nsa.gov/Portals/75/documents/Cybersecurity/CSI_MCP_SECURITY.pdf
- [15] InstaSafe. “OAuth 2.1 vs OAuth 2.0: What’s the Difference?” instasafe.com, March 28, 2025. https://instasafe.com/blog/oauth-2-1-vs-oauth-2-0-differences/
