
Top-Level Counters
Three headline metrics are displayed at the top of the page:- Total Cost — aggregate spend across all models in the selected time range, with a comparison to the previous period.
- Total LLM Calls — count of all LLM API requests, with period-over-period change.
- Total MCP Calls — count of all MCP requests, with period-over-period change.
Incoming Requests
This section breaks down the total request volume by API endpoint type. Each row shows the endpoint pattern, a friendly label, and its share of total traffic:| Endpoint | Description |
|---|---|
/mcp-server | MCP Gateway |
/chat/completions | Chat Completion |
/agent/responses | Agent Response |
/messages | Anthropic Messages |
/proxy | Proxy |
/responses | OpenAI Responses |
/embeddings | Embedding |
/completions | Completion |
/v2/rerank | Rerank |
Error Breakdowns
Error Breakdown (Model)
Displays errors from LLM providers, with two views:- By Error Type — groups errors by HTTP status code (e.g. HTTP 404, HTTP 400, HTTP 429). This tells you whether errors are client-side (4xx) or provider-side (5xx).
- By Virtual Account — shows which virtual accounts are experiencing the most errors.
Error Breakdown (MCP)
Same structure as the model error breakdown, but for MCP traffic. This helps you quickly identify if a specific MCP server or integration is experiencing reliability issues.Guardrails Summary
Shows the total number of guardrail evaluations and a ranked list of which guardrail groups are triggering most often. Each row shows the guardrail group name and the number of evaluations, with a breakdown of outcomes (blocked, flagged, mutated). This gives you a quick sense of whether your content policies are having the intended impact without needing to visit the full Guardrail Metrics tab.Top Usage Rankings
The bottom section of the Overview tab shows ranked leaderboards to help you quickly identify the biggest consumers and most active components:| Leaderboard | What it shows |
|---|---|
| Top Models | Most-used models by request count |
| Top Model Providers | Most-used providers (e.g. Google Vertex, OpenAI, AWS Bedrock) |
| Top Users by Model | Users making the most LLM requests |
| Top Virtual Accounts by Model | Virtual accounts with the most LLM traffic |
| Top Users by MCP | Users making the most MCP requests |
| Top Virtual Accounts by MCP | Virtual accounts with the most MCP traffic |
| Top MCP Servers | MCP servers receiving the most requests |
| Top Tools | Individual MCP tools called most frequently |
- Identify which models and providers are carrying the most load.
- Spot unexpected usage patterns — a user or virtual account consuming far more than expected may indicate a misconfiguration or runaway process.
- Understand MCP adoption — see which servers and tools are being actively used.