Your agents are burning money.
We know why.

Costile is an AI agent diagnostic proxy for Anthropic Claude and OpenAI. It turns runaway spend into a clear incident report with root cause, affected requests, business impact, and recommended fix.

costile.com/dashboard
Account Overview Agents Reports

Account Overview

Portfolio health across monitored AI workflows

Last 30d
Active Incidents
4
2 critical · 2 warning
Spend Reviewed
$23.4k
Across 7 agents
Savings Found
$8.4k
Projected opportunity
High Impact

Spend traced to owner and workflow

Costile connects API keys, teams, agents, and incidents so the account view shows who owns each spend source.

$18.4k/mo risk

You'll see the bill. You won't know why.

8:00 AM
You ship a new AI agent to production. Everything looks fine.
9:15 AM
An edge case triggers a retry loop. Your agent starts hammering the API.
11:30 AM
10,000 requests later, your bill reads $127. You have no idea.
2:00 PM
The provider dashboard finally refreshes. You're $77 over budget.
The bill tells you what. Costile tells you why.

✓ With Costile: incident classified, affected requests highlighted, monthly risk calculated, fix recommended.

Built for the failures provider dashboards hide

Costile turns raw AI spend into a diagnosis your team can act on.

Behavioral Incident

An agent starts behaving abnormally

Costile classifies the event, isolates the affected request window, and shows whether it is still active.

Request Trail

The spike becomes explainable

Every incident links back to the agent, session, model, tokens, cost, timestamp, and request sequence.

Costed Fix

The recommendation has a number attached

Costile estimates impact, projected monthly exposure, and the likely savings from the recommended change.

From one env variable to root-cause diagnostics

Route traffic through Costile and turn every AI request into inspectable operational data.

1

Route AI API calls through Costile

Change one environment variable to route through the Costile proxy. No SDK changes, no code refactor — your app doesn't know the difference.

ANTHROPIC_BASE_URL=https://your-proxy.com
2

Detect abnormal agent behavior

Requests are grouped by agent and session. Costile looks for abnormal request patterns, unusual cost movement, inefficient model usage, and other signs of runaway behavior.

3

Get the cause, impact, and fix

The dashboard shows the incident window, request log, projected monthly risk, and a concrete recommendation your team can ship.

Provider dashboards tell you what. Costile tells you why.

Visibility isn't enough. You need a diagnosis.

Costile vs OpenAI and Anthropic dashboards: real-time AI cost diagnostics, root-cause analysis, per-agent monitoring
Feature OpenAI/Anthropic Costile
Cost updates Delayed Real-time
Why costs spiked Never Root-cause diagnostic report
Budget enforcement None Hard caps
Per-agent tracking Not supported Per agent, per session
Fix recommendations Not available Specific, costed, actionable
Open source Closed MIT licensed

Made for teams running AI in production

Different stakeholders need the same answer: what happened, what did it cost, and what should change?

For CTOs

See account-level spend, active incidents, top risks, and projected monthly exposure across agents.

For AI teams

Debug abnormal agent behavior through agent, session, token, cost, and request-level evidence.

For finance and ops

Turn AI usage into reports with cost impact, savings opportunities, and budget guardrails.

From suspicious spend to an executive-ready incident summary

Costile gives teams the context they need to decide whether to investigate now, cap spend, or change the agent.

Customer-support workflow · High-impact spend anomaly

Cause Costile isolated the abnormal behavior to one agent, one session, and a narrow incident window.
Impact $842 in preventable spend this week, projected to $3,600/month if the pattern continues.
Recommendation Add a guardrail for repeated agent turns and route routine requests away from premium models.

Start simple. Bring us in when AI spend becomes operationally important.

The public repo gives you the basic proxy and local controls. Enterprise adds the managed diagnostic layer, team workflows, and support.

Open Source

Free

Self-hosted cost-control proxy

$0 /forever
  • AI proxy for Claude requests
  • Basic per-agent tracking
  • Local cost visibility
  • Budget caps and kill switch
  • SQLite dashboard
  • MIT licensed source code
Get Started on GitHub

See It In Action

Open a preloaded workspace and see how a CTO or AI team would move from portfolio overview to a single costly agent, request evidence, projected exposure, and a recommended fix.

costile.com/dashboard?key=demo-customer
Open Full Dashboard →

Ready to track your own agents? create an account and get an API key instantly.

Enterprise trust starts with what Costile does not need

Costile is designed around operational metadata, scoped keys, and auditable deployment choices.

Data minimization

Costile can diagnose spend from metadata: model, tokens, cost, timestamp, agent, session, and stop reason.

Scoped access

Dashboard access is tied to authenticated users and their own generated Costile API keys.

Deployment choice

Run it yourself from the MIT-licensed repo or use managed cloud when you need team workflows and support.

This Isn't a Hypothetical Problem

AI agent costs are already hitting six figures — and most teams have zero visibility until the invoice lands.

The All-In Podcast
@theallinpod

What Happens When AI Tokens Cost More Than Your Employees?

@Jason: "We, with our agents, hit $300/day per agent using the Claude API, like instantly. And that was doing, maybe, 10 or 20%. That's $100k/year per agent."

View on X →

$300/day. Per agent. And they didn't see it coming.

Costile exists so you do.

Frequently Asked Questions

What is Costile?
Costile is an AI agent diagnostic layer that sits between your application and AI APIs like Anthropic and OpenAI. It tracks operational metadata and tells you why costs spiked without forcing your team to reverse-engineer the provider bill.
How does it work?
Simply point your application to Costile's proxy endpoint instead of the AI provider's API. Costile forwards your requests, records operational metadata, groups requests by agent and session, then classifies abnormal behavior and shows a diagnostic report with cause, impact, request log, and fix. Setup takes about 5 minutes.
Which AI providers does Costile support?
Currently Costile supports Anthropic's Claude API. OpenAI support is coming soon. The architecture is designed to support any AI API provider - we're adding new providers based on community demand. If you need a specific provider, open an issue on GitHub!
Self-hosted vs. Cloud - what's the difference?
The self-hosted version (free, open source) runs on your own infrastructure. You deploy it yourself, manage updates, and handle your own data. Costile Cloud is the managed version: hosting, updates, uptime, alerts, team workflows, diagnostics, and enterprise support on top of the same core technology.
Is my API data secure?
Yes. With self-hosting, your data never leaves your infrastructure - you have complete control. Costile only logs metadata (cost, tokens, timestamps) not the actual prompts or responses. Your Anthropic/OpenAI API keys stay in your environment variables and are never exposed or stored. The code is open source (MIT license) so you can audit it yourself.
How much does it cost?
The self-hosted version is completely free and open source (MIT license). Enterprise cloud pricing is custom because it depends on deployment, support, retention, security, and team needs. You can start self-hosted and move to managed cloud when the product becomes operationally important.