/config → status line → enable context window usage

/status Command

Shows account info, current model, and usage details.

Cost Management & Monitoring | Amir Brooks

Claude Code Mastery/Lesson 14

Preview lesson

Cost Management & Monitoring

Track spending, reduce token usage, and set up enterprise-grade observability

reading55 min9 min readFree

Cost Management & Monitoring

Claude Code is powerful, but power costs money. This final lesson teaches you how to track, reduce, and monitor your spending — from individual developer habits to enterprise-wide observability.

Cost Overview

Average Costs

Metric	Value
Average daily cost per developer	~$6
90th percentile daily cost	<$12
Monthly average (Sonnet 4.5)	~$100-200/developer

Costs vary based on codebase size, query complexity, conversation length, model choice, and number of parallel instances.

Tracking Usage

/cost Command

/cost

Total cost: $0.55
Total duration (API): 6m 19.7s
Total duration (wall): 6h 33m 10.2s
Total code changes: 0 lines added, 0 lines removed

Status Line

Configure your status line to show context usage continuously:

<details> <summary>💡 Reveal Answer</summary> </details> <details> <summary>Answer</summary> </details> <details> <summary>💡 Reveal Answer</summary> </details> <details> <summary>💡 Reveal Answer</summary> </details> <details> <summary>Answer</summary> </details> <details> <summary>Best approach</summary> </details>

Resources

Anthropic Console

Full access

Continue the full course

Unlock all 14 lessons, templates, and resources for Claude Code Mastery. Free.

Start learning free All-access membership

Back to course Member login

/config → status line → enable context window usage

Task	Recommended Model	Why
Architecture decisions	Opus	Deep reasoning needed
Daily coding	Sonnet	Good balance
Simple lookups/formatting	Haiku	Fast and cheap
Complex plan → implement	opusplan	Best of both

Instead of...	Use...
GitHub MCP server	`gh` CLI
AWS MCP server	`aws` CLI
Sentry MCP server	`sentry-cli`

Team Size	TPM per User	RPM per User
1-5	200k-300k	5-7
5-20	100k-150k	2.5-3.5
20-50	50k-75k	1.25-1.75
50-100	25k-35k	0.62-0.87

export CLAUDE_CODE_ENABLE_TELEMETRY=1
export OTEL_METRICS_EXPORTER=otlp
export OTEL_LOGS_EXPORTER=otlp
export OTEL_EXPORTER_OTLP_PROTOCOL=grpc
export OTEL_EXPORTER_OTLP_ENDPOINT=http://your-collector:4317

Variable	Default	Effect
`OTEL_LOG_USER_PROMPTS`	Disabled	Include prompt content in logs
`OTEL_LOG_TOOL_DETAILS`	Disabled	Include MCP server/tool names

{
  "env": {
    "CLAUDE_CODE_ENABLE_TELEMETRY": "1",
    "OTEL_METRICS_EXPORTER": "otlp",
    "OTEL_LOGS_EXPORTER": "otlp",
    "OTEL_EXPORTER_OTLP_PROTOCOL": "grpc",
    "OTEL_EXPORTER_OTLP_ENDPOINT": "http://your-collector:4317"
  }
}

Strategy	Impact	Effort
Use Sonnet instead of Opus for routine tasks	High	Low
Clear context between unrelated tasks	High	Low
Use CLI tools instead of MCP servers	Medium	Low
Install code intelligence plugins	Medium	Low
Enable Tool Search for MCP	Medium	Low
Write focused CLAUDE.md (not verbose)	Medium	Low
Use Haiku for simple subagents	Medium	Low
Custom compaction instructions	Medium	Medium
Filter output via hooks	High	Medium
Use skills for domain knowledge	Medium	Medium
Set up OTel monitoring	High	High

Switch default model to Sonnet (team settings):
```
{ "model": "sonnet" }
```
Biggest impact — Sonnet is significantly cheaper than Opus.

Enable Tool Search for everyone:

{ "env": { "ENABLE_TOOL_SEARCH": "auto:5" } }

Replace MCP servers with CLIs — audit which MCP servers are used vs. idle.
Install LSP plugins for your primary languages — reduces exploratory token usage.

Add compaction instructions to CLAUDE.md:

# Compact instructions
Focus on code changes and test results. Drop verbose exploration logs.

Set up OTel monitoring to identify heavy users and expensive patterns.

Add cost awareness to CLAUDE.md:

# Cost
- Use /clear between unrelated tasks
- Prefer Haiku for simple questions

Concept	One-Liner
Average cost	~$6/dev/day, ~$100-200/dev/month with Sonnet
Track usage	`/cost` for session; `/context` for what's consuming space
Biggest savings	Right model choice + clearing context between tasks
MCP overhead	CLI tools have zero persistent cost; MCP adds to every message
Tool Search	`ENABLE_TOOL_SEARCH=auto:5` auto-defers idle tools
Headless limits	`--max-budget-usd` and `--max-turns`
OTel monitoring	`CLAUDE_CODE_ENABLE_TELEMETRY=1` + OTLP exporter
Privacy	`OTEL_LOG_USER_PROMPTS` and `OTEL_LOG_TOOL_DETAILS` disabled by default
Team scaling	TPM per user decreases as team size increases

/status Command

Cost Management & Monitoring

Cost Management & Monitoring

Cost Overview

Average Costs

Tracking Usage

/cost Command

Status Line

Resources

Continue the full course

/status Command

🧠 Knowledge Check

Reducing Token Usage

1. Clear Between Tasks

2. Custom Compaction Instructions

3. Choose the Right Model

4. Reduce MCP Server Overhead

5. Use Tool Search

6. Install Code Intelligence Plugins

7. Offload to Hooks

8. Use Skills for Domain Knowledge

9. Subagent Model Selection

✍️ Recall Exercise

🧠 Knowledge Check

Agent Team Cost Management

Team Rate Limits

OpenTelemetry Monitoring

Quick Setup

What's Exported

Privacy Controls

Multi-Team Support

Managed Configuration (Enterprise)

🧠 Knowledge Check

Headless Cost Controls

✍️ Recall Exercise

Cost Optimization Checklist

🔨 Try It Yourself

🤔 What Would You Do?

📋 Key Takeaways (Review These Tomorrow)

🎓 Course Complete!

Your Next Steps