Your agents are burning money.
We know why.
Costile is an AI agent diagnostic proxy for Anthropic Claude and OpenAI. It turns runaway spend into a clear incident report with root cause, affected requests, business impact, and recommended fix.
Account Overview
Portfolio health across monitored AI workflows
Spend traced to owner and workflow
Costile connects API keys, teams, agents, and incidents so the account view shows who owns each spend source.
You'll see the bill. You won't know why.
✓ With Costile: incident classified, affected requests highlighted, monthly risk calculated, fix recommended.
Built for the failures provider dashboards hide
Costile turns raw AI spend into a diagnosis your team can act on.
An agent starts behaving abnormally
Costile classifies the event, isolates the affected request window, and shows whether it is still active.
The spike becomes explainable
Every incident links back to the agent, session, model, tokens, cost, timestamp, and request sequence.
The recommendation has a number attached
Costile estimates impact, projected monthly exposure, and the likely savings from the recommended change.
From one env variable to root-cause diagnostics
Route traffic through Costile and turn every AI request into inspectable operational data.
Route AI API calls through Costile
Change one environment variable to route through the Costile proxy. No SDK changes, no code refactor — your app doesn't know the difference.
ANTHROPIC_BASE_URL=https://your-proxy.com
Detect abnormal agent behavior
Requests are grouped by agent and session. Costile looks for abnormal request patterns, unusual cost movement, inefficient model usage, and other signs of runaway behavior.
Get the cause, impact, and fix
The dashboard shows the incident window, request log, projected monthly risk, and a concrete recommendation your team can ship.
Provider dashboards tell you what. Costile tells you why.
Visibility isn't enough. You need a diagnosis.
| Feature | OpenAI/Anthropic | Costile |
|---|---|---|
| Cost updates | Delayed | Real-time |
| Why costs spiked | ✗ Never | ✓ Root-cause diagnostic report |
| Budget enforcement | ✗ None | ✓ Hard caps |
| Per-agent tracking | ✗ Not supported | ✓ Per agent, per session |
| Fix recommendations | ✗ Not available | ✓ Specific, costed, actionable |
| Open source | ✗ Closed | ✓ MIT licensed |
Made for teams running AI in production
Different stakeholders need the same answer: what happened, what did it cost, and what should change?
For CTOs
See account-level spend, active incidents, top risks, and projected monthly exposure across agents.
For AI teams
Debug abnormal agent behavior through agent, session, token, cost, and request-level evidence.
For finance and ops
Turn AI usage into reports with cost impact, savings opportunities, and budget guardrails.
From suspicious spend to an executive-ready incident summary
Costile gives teams the context they need to decide whether to investigate now, cap spend, or change the agent.
Customer-support workflow · High-impact spend anomaly
Start simple. Bring us in when AI spend becomes operationally important.
The public repo gives you the basic proxy and local controls. Enterprise adds the managed diagnostic layer, team workflows, and support.
Free
Self-hosted cost-control proxy
- ✓ AI proxy for Claude requests
- ✓ Basic per-agent tracking
- ✓ Local cost visibility
- ✓ Budget caps and kill switch
- ✓ SQLite dashboard
- ✓ MIT licensed source code
Enterprise
Managed diagnostics, teams, and governance
- ✓ Managed Costile Cloud deployment
- ✓ Advanced incident classification
- ✓ Account overview and executive reports
- ✓ Alerts via email, Slack, or webhook
- ✓ API key rotation and access controls
- ✓ Retention, audit, and security support
- ✓ Dedicated onboarding and roadmap input
See It In Action
Open a preloaded workspace and see how a CTO or AI team would move from portfolio overview to a single costly agent, request evidence, projected exposure, and a recommended fix.
Ready to track your own agents? create an account and get an API key instantly.
Enterprise trust starts with what Costile does not need
Costile is designed around operational metadata, scoped keys, and auditable deployment choices.
Costile can diagnose spend from metadata: model, tokens, cost, timestamp, agent, session, and stop reason.
Dashboard access is tied to authenticated users and their own generated Costile API keys.
Run it yourself from the MIT-licensed repo or use managed cloud when you need team workflows and support.
This Isn't a Hypothetical Problem
AI agent costs are already hitting six figures — and most teams have zero visibility until the invoice lands.
What Happens When AI Tokens Cost More Than Your Employees?
@Jason: "We, with our agents, hit $300/day per agent using the Claude API, like instantly. And that was doing, maybe, 10 or 20%. That's $100k/year per agent."
View on X →$300/day. Per agent. And they didn't see it coming.
Costile exists so you do.