How does Costile diagnose runaway AI agents?

Costile routes AI traffic through a diagnostic layer, watches for abnormal spend behavior, and turns the signal into clear reports your team can act on.

AI agent diagnostics

Your agents are burning money.
We know why.

Q: Can we inspect or run Costile ourselves?

The Costile core repository is available for inspection and local experimentation. Costile Cloud is the maintained commercial product for teams that need hosted diagnostics, attribution, alerts, and support.

For AI builders and production teams. Route your AI traffic through Costile once and automatically surface cost overruns, ownership, and recommended fixes as they happen.

See the Diagnosis

Explore a real AI spend incident from detection → resolution. It’s read-only, nothing to configure or connect.

Metadata-first by default DPA available EU company EU AI Act evidence

Spend timeline

$585.68

3 spend spikes over limit

Spend

Your agents spent $585.68 in the last 24h, on pace for $18k/mo.

Cap

The dashed line shows the daily budget your agents should stay under.

Breach

Spend crossed the limit 3 times. Open the Support team marker and see Maya owns the workflow.

Traceable record captured audit log #1842

Provider dashboards tell you what. Costile tells you why.

Ownership

Attribution

Owner found

Maya Holm

Support workflow · support-bot

Spend traced $97.84

Team

Support

Key

prod-42

AI usage is linked to a specific key, owner, and workflow so you always know who generated the cost.

Detection & Diagnosis

Incident

High impact

Session loop detected

02:47 AM · support-bot

$97.84

Cause

Context resent every call with no stop condition.

Proof

9 requests in 5 minutes, 3 max-token hits.

Cost spikes are flagged with what happened, why it happened, and which requests were affected.

Fix & Evidence

Audit log

#1842

02:47

Spend spike recorded

02:49

Owner linked to workflow

02:53

Recommended fix attached

Tamper-evident · export-ready

You get recommended fixes, estimated cost savings, and a full audit log for compliance and governance.

AI Spend Investigation

See the spike. Stop the cost.

At 2:47 AM, an AI agent goes into a runaway loop. Costile catches it, shows who's responsible, projects the cost, and provides the fix that stops it.

app.costile.com / diagnostics / support-bot

High Impact · Active

Support-bot context loop incident

Repeated calls caused by context accumulation. The full conversation history is resent every request without context limits.

Impact Cost

$97.84

Monthly Risk

$2,935

Incident Window

02:47 → 02:52

Affected Requests

Session Loop

9 requests in the same session, with tokens escalating from 920 to 1,480. Max token hits occurred on 3 of 9 calls.

Loop cost

$97.84

Requests in loop

Max tokens hits

3 / 9

→ Agent re-reads its own outputs without a completion limit.

Issue

Repeated calls caused by a context accumulation loop.
Full conversation history resent without context limits.

Root Cause

Agent re-reads its own outputs.
No completion limit, so context keeps growing.

Recommended Fix

Add a max_turns guard.
Set max_turns: 6.
Reset context after tool completion.

Expected outcome Stops loop

Session cost ~85% lower

Monthly reduction ~$2,495

Owner Maya Holm · AI Operations Director · Northstar production AI workflows

EU AI Act · Regulation 2024/1689 (Phased application from 2 August 2026)

Built for the EU AI Act

Article 9 — Risk Management

Requirement: Documented risk management process for high-risk AI systems.

What regulators expect: Ongoing identification, mitigation, and tracking of AI-related risks.

Costile provides:

Live anomaly detection across agents and workflows
Root-cause capture when incidents occur (what happened + why)
Suggested fixes to resolve and prevent recurrence

Article 12 — Record-Keeping

Requirement: Automatic, traceable logs of system behavior in production.

What regulators expect: Who did what, when, and under what conditions.

Costile provides:

Automatic audit logs for every AI call
Full attribution by API key, agent, and team
Production traceability accessible to workspace admins

Costile gives you both Article 9 + Article 12 coverage out of the box.

Learn more: Costile EU AI Act compliance page

Pricing and setup

What your subscription funds

Improve AI diagnostics and cost detection
Build better governance and compliance features
Create smarter cost-saving recommendations

How it works

From trial to live traffic in three steps

Starting a Costile trial and choosing the Individual plan — 1
Pick Individual or Team based on how many people need access, sign in with email and new password to create a workspace, and start the 10-day free trial without a card.

Connecting Costile and preparing checkout after the trial — 2
Send AI calls through Costile, review incidents and spend, and only proceed to billing once the pilot proves the value.

Managing plan settings after the trial — 3
Add payment details in Billing after the trial, manage your plan in Settings, and switch or cancel it at any time as team needs evolve.

Individual

For solo builders and operators monitoring AI usage.

$79 /month

Live incident diagnostics (owner, cause, fix)
Spend visibility across agents and workflows
Abnormal spend alerts via email

Start Individual Trial

Team

For teams needing governance and shared visibility.

$299 /month

Everything in Individual, plus:
Team-wide attribution (person, API key, workflow)
Audit-ready logs for compliance & procurement
Fast alerts via Slack or webhooks

Start Team Trial

Good To Know

How does it work?

Route API traffic through Costile's proxy → it analyzes requests in real time → returns cause + impact + fix reports.

Does routing through Costile add latency?

The proxy is lightweight. It forwards requests and records cost metadata without modifying model calls, so overhead stays low. We'll measure exact latency during pilots.

Do you store my prompts or responses?

Default: No. Only operational metadata is stored. Optional prompt capture can be enabled for debugging.

Can we inspect or run Costile ourselves?

Yes. The core runs locally and is fully inspectable. Cloud adds alerts, attribution, and managed ops.

How is Costile different from Helicone or LangSmith?

Those tools show logs. Costile focuses on why something happened, who caused it, and how to fix it.

Catch AI Cost Spikes Before You're Billed

See the Diagnosis or Talk to Costile

Your agents are burning money.We know why.

Provider dashboards tell you what. Costile tells you why.

See the spike. Stop the cost.

Built for the EU AI Act

Article 9 — Risk Management

Article 12 — Record-Keeping

Pricing and setup

From trial to live traffic in three steps

Individual

Team

Good To Know

Catch AI Cost Spikes Before You're Billed

Your agents are burning money.
We know why.