Token Usage & Cost Savings
Agent-CoreX reduces the number of tool schemas sent to your LLM on each request. The Usage dashboard shows an estimated dollar value of the tokens you've saved this month.
Last updated: April 2026
Where to find token data
Navigate to Dashboard → Usage. Scroll to the bottom of the page — below the 30-day bar chart and the quota progress bar — to find the "Estimated cost savings this month" card.
Cost savings card
The card displays two pieces of information:
Estimated cost savings this month
Based on avg 60% token reduction across N queries.
$X.XX
saved vs. unrouted
- •$X.XX — the estimated dollar saving for the current 30-day period
- •N queries — the total query count in the last 30 days, same value shown in the "Queries this period" stat card
How it is calculated
The savings figure uses a fixed formula:
The $0.07 per query figure represents the average token cost saved when Agent-CoreX routes only the relevant 5% of tools instead of your full tool schema, assuming:
- •Average 60% reduction in tokens sent to the LLM per request
- •Typical LLM input pricing across GPT-4, Claude, and Gemini models
Why tokens matter
Most LLM providers charge per input token. When you attach 100 tool schemas to every agent request — even if the agent only uses 3 of them — you pay for all 100 schemas on every call. With a large tool registry, this can represent 40–70% of your total token spend.
Agent-CoreX intercepts that step: your agent sends a natural-language query, gets back only the top 5 (or however many you configure with top_k) relevant tools, and sends only those to the LLM. See How Tool Retrieval Works for details on the ranking algorithm.