Documentation Index
Fetch the complete documentation index at: https://docs.tensorcost.com/llms.txt
Use this file to discover all available pages before exploring further.
Get started
This guide takes you from zero to your first verified recommendation. Three steps, and the activation event — your first accepted recommendation — is typically reached within 48 hours of connecting your first account.TensorCost is built around three workload classes — GPU fleets, managed inference (Bedrock and friends), and agent workloads. You don’t have to connect all three to get value; most customers start with whichever one is dominating their bill.
Prerequisites
- A TensorCost workspace. Sign up at tensorcost.com or accept your design-partner invite.
- One of:
- An AWS account with Bedrock usage (or CUR 2.0 enabled), or
- An Azure subscription with Azure OpenAI usage, or
- A GCP project with Vertex AI usage, or
- GPU instances on AWS / Azure / GCP / Kubernetes / bare-metal that you can install the unified agent on.
- An admin (or someone who can deploy a CloudFormation stack / Terraform module / Helm chart in your environment).
Step 1 — Create your tenant and invite your team
Sign up
Visit tensorcost.com or follow the design-partner invite email. Cognito-backed sign-up; SSO is enabled via your tenant admin once the tenant is provisioned.
Invite teammates
From the shell sidebar, open Settings → Members. Three roles ship by default:
| Role | What they can do |
|---|---|
member | Read dashboards, accept/dismiss recommendations, see their own team’s spend. |
admin | All of member plus connect cloud accounts, manage agents, configure alert routes, set budgets. |
owner | All of admin plus billing, tenant deletion, RBAC changes. |
Step 2 — Connect your first source
Pick the path that matches what’s burning the most money first. You can layer in the others later.Path A — Amazon Bedrock (lead managed-inference path)
This is the fastest path to a first recommendation because it requires no agent install.Open the Bedrock wizard
Integrations → Add AWS Bedrock. The wizard auto-suggests an
ExternalId — accept it.Choose onboarding mode
SingleAccount (default — one AWS account) or Organization (consolidated billing with payer + member-account jump roles). Most early customers run SingleAccount. See bedrock integration for the multi-account variant.Enable Bedrock model-invocation logging
AWS console → Bedrock → Settings → Model invocation logging → CloudWatch destination. Note the log-group ARN.
Deploy the CloudFormation stack
Click the one-click CFN link in the wizard. The stack creates exactly one IAM role (
TensorCost-BedrockReader-<ExternalId>) with read-only Bedrock + CloudWatch permissions and an external-ID-bound trust policy. Nothing else.Path B — Install the unified GPU agent
For GPU fleets running on EC2, EKS/GKE/AKS, on-prem Slurm, or Ray. Full guide in agent installation.Path C — Azure OpenAI / Vertex / OpenAI API / Anthropic API
Same pattern as Bedrock, with provider-specific credentials. Integrations → Add provider → pick the source. Each adapter ingests:- Per-request: model, input tokens, output tokens, latency, cache-hit rate
- Daily billing: cost normalized to
ai_spend_events - Tags / metadata: mapped to your
application/team/environment/owner
Step 3 — See your first recommendation
Within 48 hours of connecting your first source, the Recommendations feed populates. Each entry includes:- A specific, dollar-quantified change (“route 14% of
customer-support-agenttraffic from Claude Opus 4.6 to Haiku 4.5 — $4,200/month”). - The evidence (sample request IDs, cost breakdown, A/B plan).
- Accept / dismiss-with-reason / snooze actions.
What to do next
Set up alert routes
Slack, PagerDuty, email, Microsoft Teams, custom webhook.
Define budget hierarchies
Tenant → team → application. Burn-rate alerts at 50%, 80%, 100%.
Connect your second source
Coverage compounds. Customers with all three workload classes see 2× the recommendations.
Wire up MCP
Query TensorCost from Claude Desktop or your own agents.
When you get stuck
- Check the Sync history drawer on the connection — every error from STS, CloudWatch, or the IAM trust policy surfaces here with a remediation link.
- Common day-1 failures (and remediations) are catalogued in our customer onboarding runbook.
- Email support@tensorcost.com — design partners get a shared Slack Connect channel.