AI Agents Are the Next FinOps Blind Spot

Remember shadow IT? Developers spinning up EC2 instances on personal credit cards, marketing teams buying SaaS tools nobody in finance knew about, that one intern who provisioned a GPU cluster for a hackathon and forgot to shut it down?

We spent a decade building FinOps practices to tame that chaos. Tagging policies, showback dashboards, commitment discounts, anomaly alerts. The whole discipline exists because cloud spending is invisible until it isn't.

Now we have a new version of the same problem. It's faster, harder to see, and it doesn't even provision infrastructure you can find in a console.

AI agents are the next shadow IT, and most FinOps teams aren't ready.

The Problem: Agents Spend Money Differently

Traditional cloud cost management tracks compute, storage, and network. The unit of spend is a resource: a VM, a container, a database. You can tag it, right-size it, schedule it, reserve it.

AI agents don't work that way. Their cost signature is:

Token-based: billed per input/output token, not per hour or per GB
Compounding: a single agent task can chain 10-50 model calls, each with full context re-injection
Autonomous: the agent decides how many calls to make, which tools to invoke, and whether to retry
Invisible: no infrastructure shows up in your cloud bill. It's API calls to OpenAI, Anthropic, or your internal model endpoints

A basic chatbot interaction costs ~1,000 tokens. A moderately complex agent workflow (research, reasoning, tool calls, verification) routinely consumes 20,000 to 100,000+ tokens. That's a 20-100x multiplier that scales with every employee who deploys one.

The Numbers Are Already Ugly

The data from early 2026 deployments is sobering:

One 35-engineer SaaS company hit $87,000/month in agent token costs. Most of it came from autonomous coding assistants running unchecked overnight.
A single developer generated $4,200 in a weekend when a refactoring agent entered a recursive loop nobody was monitoring.
POC-to-production scaling has seen costs jump from ~$50/month to $2.5M/month when agents go from 5 users to 5,000.
Agentic workflows consume 5-30x more tokens per task than conversational AI. Enterprises are consuming 13x more tokens year-over-year.

The pattern is consistent: teams prototype an agent cheaply, get excited about the results, roll it out broadly, and discover the bill 30 days later.

Why Your Current FinOps Practice Misses This

If you're running a mature FinOps organization, you probably have:

Cloud cost dashboards (AWS Cost Explorer, Azure Cost Management)
Tagging enforcement and allocation rules
Reserved instance / savings plan coverage optimization
Anomaly detection on infrastructure spend

None of that catches agent costs because:

Token spend often hits a different GL code: it's an API subscription or a software license, not infrastructure. Finance categorizes it as "software" and nobody in FinOps sees it.
There's no resource to tag: you can't tag an API call the way you tag an EC2 instance. The agent runs on someone's laptop or in a serverless function, calls an external API, and the cost shows up as a line item on a vendor invoice 30 days later.
The spender isn't a team, it's an agent: traditional showback assumes a human team made a provisioning decision. With agents, the "decision" to make 47 API calls to complete a task was made by software, autonomously, with no approval workflow.
Agents proliferate like shadow IT: by mid-2026, over 3 million AI agents operate in enterprises, with only ~47% actively monitored. 29% of employees admit to using unsanctioned agents. Departments spin them up with low-code tools, no procurement process required.

What FinOps Teams Should Do Now

This isn't a future problem. It's a right-now problem. Here's what I'd recommend:

1. Get Visibility Into Token Spend Today

If you can't answer "how much are we spending on AI API calls, by team, by use case, per month?" then you're flying blind. Start here:

Audit all API keys and model endpoints across the organization
Centralize AI/LLM access through a gateway or proxy layer (this is the equivalent of requiring all cloud access through your organization's AWS accounts)
Build a token spend dashboard alongside your cloud cost dashboard

2. Treat Agent Deployments Like Infrastructure Deployments

An agent that autonomously makes decisions and spends money should go through the same governance as provisioning a production service:

Require registration of all agents (even internal/experimental ones)
Set token budgets per agent, per workflow, per department
Implement hard caps with human-in-the-loop approval to exceed them
Alert on anomalies: if an agent suddenly consumes 10x its baseline, something is wrong

3. Establish Unit Economics

The FinOps principle of "unit economics" matters even more here. Don't just track total token spend. Track cost per business outcome:

What does it cost to resolve one customer support ticket with an agent?
What's the cost per code review, per research task, per generated report?
Is the agent actually cheaper than the human process it replaced, or are we spending more for the convenience of automation?

If you can't answer these questions, you can't make rational build/buy/automate decisions.

4. Build a Model Routing Strategy

Not every agent task needs GPT-4 or Claude Opus. Most organizations are running premium models for tasks that a cheaper model handles just fine:

Route simple classification/extraction to smaller models
Reserve expensive reasoning models for tasks that actually need them
Cache common queries and results to avoid redundant calls
Implement tiered model selection based on task complexity

This is the AI equivalent of right-sizing your instances, and it can cut token spend by 40-60%.

5. Add AI Spend to Your FinOps Reporting

Your monthly cost review should now include:

Total token spend by department and use case
Cost per agent workflow vs. value delivered
Trend analysis: are we scaling spend linearly with value, or is it compounding?
Shadow AI audit: what unsanctioned tools and agents are we finding?

The Bigger Picture

Gartner projects 40% of enterprise applications will include task-specific AI agents by end of 2026, up from less than 5% in 2025. PwC found 88% of executives plan to increase AI budgets specifically due to agentic AI.

This isn't optional anymore. AI agents are going to be everywhere. The question is whether your FinOps practice evolves to govern them, or whether you're the last to know when your AI bill hits seven figures.

The organizations that figure this out will compound their advantage. They'll deploy agents confidently, scale them efficiently, and measure their ROI clearly. Everyone else will be writing incident reports about why Q3 costs were 300% over forecast.

Trey Morgan is a cloud FinOps leader based in Austin, Texas. He previously led FinOps product strategy at Microsoft and Walmart, and currently drives Cloud FinOps as a Service delivery. Connect with him at treymorgan.com.

AI Agents Are the Next FinOps Blind Spot

The Problem: Agents Spend Money Differently

The Numbers Are Already Ugly

Why Your Current FinOps Practice Misses This

What FinOps Teams Should Do Now

1. Get Visibility Into Token Spend Today

2. Treat Agent Deployments Like Infrastructure Deployments

3. Establish Unit Economics

4. Build a Model Routing Strategy

5. Add AI Spend to Your FinOps Reporting

The Bigger Picture

Tags:

Related Posts

The Problem: Agents Spend Money Differently

The Numbers Are Already Ugly

Why Your Current FinOps Practice Misses This

What FinOps Teams Should Do Now

1. Get Visibility Into Token Spend Today

2. Treat Agent Deployments Like Infrastructure Deployments

3. Establish Unit Economics

4. Build a Model Routing Strategy

5. Add AI Spend to Your FinOps Reporting

The Bigger Picture

Tags:

Share this post

Related Posts

Is the Cloud the Wrong Place for AI? A FinOps Perspective

FinOps Certified: FinOps for AI becomes available (July 27)

FinOps for the Agentic Enterprise