What's the difference between tracking AI usage and tracking AI spend?

Usage is the raw meter — tokens, GPU-seconds, characters, requests. Spend is that meter converted to money at the provider's rates. Forty million tokens tells you how much you sent. It doesn't tell you what that cost, and it can't be set beside a GPU bill or a hosting bill measured in different units. The number you budget against is spend, which means any useful tracker has to convert every provider's meter to money and sum the results.

Can I track AI costs without changing my code?

Yes. The lowest-friction approach reads each provider's own usage or billing API directly and converts it to cost. It sits entirely outside your application — no SDK to install, nothing added to your request path. Observability tools and AI gateways track cost differently, by instrumenting your code or routing every request through a proxy; the billing-API approach is the one that needs zero code changes.

How do you total spend across providers that bill in different units?

You convert each provider's own meter to money at its published per-unit rates before adding anything up. OpenAI and Claude bill in tokens; RunPod bills in GPU-hours; ElevenLabs in characters; Cloudflare and Vercel in requests and bandwidth. Each is metered differently, but once priced into money they become directly comparable and sum into one total.

Do I need an observability SDK or an AI gateway to track costs?

Not for spend tracking alone. Observability platforms and gateways are worth the trade-off when you also need per-request tracing, prompt-level debugging, or routing. They earn that by sitting in your code or your request path — an SDK to install, or a gateway in your latency budget. If the question is just "what am I spending across my providers this month and where is it heading," reading each provider's usage API directly answers it with nothing installed in your application.

Why might a tracked total differ from the provider's invoice?

A good tracker computes cost from each provider's published rates at the moment usage is recorded. That makes the figure reproducible and stable even when a provider later changes prices. Discounts that don't appear in the raw usage (batch-API rates, automatic prompt caching, promotional credits, committed-use deals) can make the actual invoice a little lower. A tracked total is an accurate, slightly conservative running estimate.

How far ahead can I see my AI spend?

Into next month. A running total covers what you've spent so far this month. The forecast takes your recent daily burn rate, extends it across next month, and adds next month's fixed subscriptions. Provider dashboards don't give you that forward number. Because the total and forecast both recompute from your latest usage each time you refresh, a developing spike shows up the next time you pull it in — well before it reaches a bill.

Are there AI costs CostCompass can't track automatically yet?

Yes, a few. This kind of tracking only works when a provider exposes usage through an API. A handful only show billing in their own dashboard, with no API to read, so CostCompass can't pull those yet. Flat subscriptions you enter once, and CostCompass prorates them into your total. When a subscription also charges usage-based credits on top of the flat fee, that extra metered part isn't captured today. Your total covers every provider with a readable usage API plus the subscriptions you've entered; it can read low where billing is dashboard-only or a subscription adds credits CostCompass can't yet see.

Why use CostCompass instead of tracking AI costs yourself?

Tracking it yourself means logging into a console per provider, reading each one's after-the-fact usage in its own units, converting to money, and summing it by hand — every time you want to know the number — and you still have no forecast at the end. CostCompass pulls each provider's usage with one click and turns it into one live month-to-date total, a forecast of what next month will cost at your current pace, and a per-provider, per-model breakdown across model APIs and compute alike, with nothing wired into your code. You glance at one number instead of rebuilding it by hand, and the day you add another provider it's already folded in.

Track AI costs

Actually track what AI is costing you

Your AI spend is scattered across model APIs, a GPU box, a hosting bill, and a voice service — each in its own console, none of them showing where the month is headed. Here are the real ways to track it, and how to end up with one running total and a forecast in front of you.

See your own AI spend — start free

By Joubert Berger Published June 6, 2026

Tracking AI costs sounds like it should be solved already. Every provider shows you a usage page. The trouble starts when you run on more than one: a model API, a second model API, a GPU box for inference, a hosting bill, a voice service. Each one has its own login and its own unit of measure, and they bill on different cycles. No single page anywhere answers what is AI costing me this month. Until the invoices land, you're mostly guessing.

This guide walks the four real ways to do it — provider dashboards, a spreadsheet, observability tooling, a billing dashboard — and what each costs you in effort. The aim is to end with one running total and a forecast, instead of a row of consoles you add up by hand.

An antique almanac engraving: a balance scale, an hourglass, and a graduated measuring jug each feed a single line down into one open ledger book, where a running total is tallied in a single copper-inked column. — Different meters — weight, time, volume — but one running total kept in a single ledger.

What does it mean to “track” AI costs?

Tracking AI costs means turning each provider’s raw usage into money at its rates and adding it into one total. Two terms get blurred together along the way. Usage is the raw meter each provider keeps. Spend is that meter priced into money at the provider’s rates. Usage tells you that you sent forty million tokens last week; it can’t tell you what that cost, or let you set it beside a GPU bill counted in entirely different units.

Different kinds of provider meter different things, which is why usage has to be priced into money before any of it can be summed:

Provider type	Example providers	Native unit
Model APIs	OpenAI, Claude	Tokens
Rented GPUs	RunPod	GPU-seconds
Voice synthesis	ElevenLabs	Characters
Hosting / edge	Vercel, Cloudflare	Requests

The number you budget against is spend. So tracking AI costs comes down to three steps, run whenever you want the number: read each provider’s usage, price it into money at that provider’s rates, and add it into one total. The methods below are just different ways of doing those three steps. They vary in how much you do by hand, and how current the number stays.

What are the four ways to track AI costs?

They run from “log in and read it” to “it reads itself.” All four work for somebody. But once you’re a solo dev or a small team running more than two or three providers, the first three stop being worth the effort — and that’s the case this guide is going to make.

1. Provider dashboards

Every provider gives you a usage page — OpenAI’s usage view, Anthropic’s Console, the Gemini billing page — and for a single provider, that’s often enough. You can see what you’ve spent this month and, on most of them, export a CSV.

The limits show up the moment you have more than one. A dashboard is a rear-view mirror: it shows what you’ve already spent, never where the month is heading — and only its own slice, in its own units, with no roll-up. To answer “what am I spending on AI this month” you log into every console and add it up in your head. Fine with one provider. With five it’s a chore, and you’ll make mistakes.

2. A spreadsheet

The natural next step is a spreadsheet: read each console, paste the figure into a row, repeat for every provider, sum the column. It gives you the one thing the dashboards don’t — a total — and you control the layout.

It also has one fatal flaw: a spreadsheet is a snapshot, not a system — which is exactly where a spreadsheet stops keeping up. Keeping it current means re-reading every console and re-typing every figure by hand. Skip a few days and the total goes stale, usually right when a spike is happening. It won’t break spend down by model without more columns, and it gives you no forecast. So you stop updating it, and it drifts.

3. Observability SDKs and AI gateways

The developer-grade answer is to instrument the calls. Observability platforms and AI gateways capture cost per request — some by wrapping your provider SDK, others by routing every request through a proxy between your app and the provider. Done well, you get per-request cost attribution, plus tracing and prompt-level debugging.

That power has a price: these tools live in your code or your request path. There’s an SDK to install, or a gateway that now sits in your latency budget. And they’re built mostly around model tokens, so a GPU box or a hosting bill tends to fall outside what they track. If you want per-request tracing, the trade is worth making. If you just want to know your monthly spend across providers, it’s a lot of plumbing for the question. (For the named tools in this camp and the others — AI cost tools compared by how each collects billing — see the side-by-side.)

4. A billing dashboard

The last option — an AI API cost dashboard — reads each provider’s own usage or billing API directly, when you ask it to, and prices it into money — the same three steps, done in one click. Because it reads the provider’s records instead of intercepting your traffic, there’s no SDK, no gateway, nothing in your request path. It gives you one month-to-date total, a per-provider and per-model breakdown, and a forecast. Each provider documents the feed it reads from — Anthropic’s Admin API, OpenAI’s API, and Google Cloud billing.

This is the approach CostCompass takes, and the rest of this guide shows what it looks like in practice.

Here’s how the four compare:

Method	One cross-provider total	Forecast	Effort to update	Code changes needed
Provider dashboards	No	No	Read each console	None
Spreadsheet	Yes, by hand	No	Re-enter every figure	None
Observability / gateway	Yes	Sometimes	None — passive	Yes — SDK or proxy
Billing dashboard	Yes	Yes	One click	None

How do you forecast where your AI spend is heading?

You forecast it with a run rate: project your recent daily burn across next month. Every method above except the last has the same blind spot — it tells you what you’ve spent, not what you’re on track to spend. Closing that gap is nothing exotic. Take your spend over the last several days, average it per day, and project it across a full month ahead.

That’s how CostCompass forecasts: it takes your trailing seven-day burn rate, multiplies it by the days in next month, and adds next month’s fixed subscriptions. Seven days is long enough to smooth out day-to-day swings but short enough to catch a real change in pace. You get one forward number — at this rate, here’s what next month costs. Because it recomputes from your latest usage every time you refresh, a spike shows up the next time you pull it in, while you can still act on it. Seeing where the spend is heading is the first half; deciding which lever lowers it is the second — see managing LLM costs down for how to act on the breakdown once you have it.

How does CostCompass track your AI costs?

CostCompass is built around the fourth method. You connect each provider once. After that, a click on Refresh reads that provider’s usage and prices it into money, so all those different meters land in one comparable number.

The CostCompass dashboard showing a single month-to-date total of $4,318.60 across every connected provider, with a forecast, a daily burn rate, and a Refresh button to pull the latest usage. — One month-to-date total across every connected provider, with a forecast and burn rate — and the Refresh button that pulls the latest usage when you want it.

One thing here is intentional: you pull the data, not a background timer. The Refresh button fetches the latest usage from every connected provider on demand. CostCompass doesn’t poll them around the clock — there’s no point calling your providers when you aren’t looking. It stays quiet until you want the number. Then one click brings every provider’s latest usage in at once, already priced and broken down so you can see which provider and model moved it.

A by-provider breakdown of month-to-date spend — Claude, OpenAI, RunPod, ElevenLabs, and Vercel — each metered in its own unit (tokens, GPU-hours, characters, bandwidth) but combined into one running total. — Every provider in one breakdown — tokens, GPU-hours, characters, and bandwidth, all normalized to one comparable total, so your biggest line item is obvious at a glance.

Two things make it practical for a single developer. First, nothing touches your code. CostCompass reads each provider’s usage API directly, so there’s no SDK to wrap your calls and no gateway in your request path — your application runs exactly as it did before. Second, your keys are encrypted in your browser before they’re ever stored. Each provider key you connect is sealed with your vault password and saved only as ciphertext CostCompass can’t decrypt — so the App Server only ever holds that ciphertext, and your vault password stays in your browser. When usage is fetched, the key is decrypted in your browser and forwarded to the provider through a broker that holds it for the moment of the call and is built not to log it, so the plaintext stays out of the database and the logs.

A 30-day line chart of daily AI spend across all connected providers, with a steady upward slope. — A 30-day spend trend across every provider makes a developing climb visible days before the invoice would.

It also doesn’t stop at model APIs. The same dashboard rolls Claude and OpenAI up next to the GPU box, the hosting bill, and the voice service — the compute spend most LLM-only trackers ignore. (For how individual providers bill, the per-provider guides go deeper: Claude, OpenAI, Gemini, and the routed models behind OpenRouter, with the full set on the providers page and a wider overview of AI cost tracking across providers.)

Getting started takes three steps:

Connect a provider — paste the usage or admin key it gives you. It’s encrypted in your browser before it’s stored, so the server only ever holds ciphertext.
Confirm the month-to-date figure matches what you expect for that provider.
Add the rest. Each one folds into the same total, forecast, and breakdown — and a click on Refresh pulls the latest across all of them whenever you want it.

Frequently asked questions

What's the difference between tracking AI usage and tracking AI spend?: Usage is the raw meter — tokens, GPU-seconds, characters, requests. Spend is that meter converted to money at the provider's rates. Forty million tokens tells you how much you sent. It doesn't tell you what that cost, and it can't be set beside a GPU bill or a hosting bill measured in different units. The number you budget against is spend, which means any useful tracker has to convert every provider's meter to money and sum the results.
Can I track AI costs without changing my code?: Yes. The lowest-friction approach reads each provider's own usage or billing API directly and converts it to cost. It sits entirely outside your application — no SDK to install, nothing added to your request path. Observability tools and AI gateways track cost differently, by instrumenting your code or routing every request through a proxy; the billing-API approach is the one that needs zero code changes.
How do you total spend across providers that bill in different units?: You convert each provider's own meter to money at its published per-unit rates before adding anything up. OpenAI and Claude bill in tokens; RunPod bills in GPU-hours; ElevenLabs in characters; Cloudflare and Vercel in requests and bandwidth. Each is metered differently, but once priced into money they become directly comparable and sum into one total.
Do I need an observability SDK or an AI gateway to track costs?: Not for spend tracking alone. Observability platforms and gateways are worth the trade-off when you also need per-request tracing, prompt-level debugging, or routing. They earn that by sitting in your code or your request path — an SDK to install, or a gateway in your latency budget. If the question is just "what am I spending across my providers this month and where is it heading," reading each provider's usage API directly answers it with nothing installed in your application.
Why might a tracked total differ from the provider's invoice?: A good tracker computes cost from each provider's published rates at the moment usage is recorded. That makes the figure reproducible and stable even when a provider later changes prices. Discounts that don't appear in the raw usage (batch-API rates, automatic prompt caching, promotional credits, committed-use deals) can make the actual invoice a little lower. A tracked total is an accurate, slightly conservative running estimate.
How far ahead can I see my AI spend?: Into next month. A running total covers what you've spent so far this month. The forecast takes your recent daily burn rate, extends it across next month, and adds next month's fixed subscriptions. Provider dashboards don't give you that forward number. Because the total and forecast both recompute from your latest usage each time you refresh, a developing spike shows up the next time you pull it in — well before it reaches a bill.
Are there AI costs CostCompass can't track automatically yet?: Yes, a few. This kind of tracking only works when a provider exposes usage through an API. A handful only show billing in their own dashboard, with no API to read, so CostCompass can't pull those yet. Flat subscriptions you enter once, and CostCompass prorates them into your total. When a subscription also charges usage-based credits on top of the flat fee, that extra metered part isn't captured today. Your total covers every provider with a readable usage API plus the subscriptions you've entered; it can read low where billing is dashboard-only or a subscription adds credits CostCompass can't yet see.
Why use CostCompass instead of tracking AI costs yourself?: Tracking it yourself means logging into a console per provider, reading each one's after-the-fact usage in its own units, converting to money, and summing it by hand — every time you want to know the number — and you still have no forecast at the end. CostCompass pulls each provider's usage with one click and turns it into one live month-to-date total, a forecast of what next month will cost at your current pace, and a per-provider, per-model breakdown across model APIs and compute alike, with nothing wired into your code. You glance at one number instead of rebuilding it by hand, and the day you add another provider it's already folded in.

About the author

Joubert Berger builds CostCompass, a spend-intelligence dashboard that pulls usage from AI and compute providers into one month-to-date total, a forecast, and a per-provider breakdown. This guide reflects how CostCompass reads each provider's own usage API — see the security model for how your keys are handled.

Track your whole AI bill in one place

Connect each provider once and pull a single month-to-date total and forecast across all of them on demand — no SDK, no gateway, nothing in your request path.

Start free