Claude Model Comparison Guide

Practical guidance on choosing the right Claude model for each task in Claude Code.

Model Family Overview

Claude Code uses the Claude 4.5/4.6 model family. The three tiers trade off speed, cost, and reasoning depth:

Model	Speed	Cost (Input/Output per 1M tokens)	Context Window	Max Output	Best For
Haiku 4.5	Fastest	$1 / $5	200K tokens	64K tokens	Simple lookups, formatting, boilerplate
Sonnet 4.6	Fast	$3 / $15	1M tokens	64K tokens	Standard implementation, editing, testing
Opus 4.6	Slower	$5 / $25	1M tokens	128K tokens	Complex reasoning, architecture, debugging

Deprecation Notice

Claude Haiku 3 (claude-3-haiku-20240307) retires April 19, 2026. Migrate to Haiku 4.5.
Opus 4.1 is still available but costs 3x more than Opus 4.6 ($15/$75 vs $5/$25). Upgrade immediately.
Legacy models (Sonnet 4.5, Opus 4.5) remain available but are superseded by the 4.6 generation.

When to Use Each Model

Haiku 4.5 — The Quick Worker

Use Haiku for tasks that need speed over depth:

File searches and exploration — “Find all files that import UserService”
Formatting and boilerplate — “Generate TypeScript interfaces from this JSON”
Simple lookups — “What’s the return type of this function?”
Code generation from templates — “Create a new React component following the existing pattern”
Documentation writing — READMEs, comments, migration notes

Don’t use Haiku for: Complex debugging, architecture decisions, multi-step reasoning, or code that needs careful security review.

Sonnet 4.6 — The Daily Driver

Sonnet handles 80% of daily coding work:

Feature implementation — Building new features across multiple files
Bug fixes — Diagnosing and fixing bugs with test verification
Code editing — Refactoring, renaming, extracting functions
Test writing — Unit tests, integration tests, test data generation
Code review — Finding issues and suggesting improvements
Build and type errors — Fixing compilation failures

Don’t use Sonnet for: Architecture-level decisions affecting the entire codebase, extremely subtle bugs requiring deep reasoning, or critical security audits.

Opus 4.6 — The Deep Thinker

Reserve Opus for tasks where getting it wrong is expensive:

Architecture decisions — System design, API design, data modeling
Complex debugging — Multi-layer bugs, race conditions, subtle logic errors
Security audits — Vulnerability analysis requiring deep understanding
Code review (critical) — API contracts, backward compatibility, edge cases
Multi-step reasoning — Tasks requiring 5+ steps of connected logic
Team lead coordination — Orchestrating multiple agents, plan verification

Don’t use Opus for: Simple edits, file searches, boilerplate generation, or any task Sonnet can handle equally well.

Sub-Agent Model Routing

When spawning sub-agents with the Agent/Task tool, set the model parameter:

# Simple file search
Agent(model: "haiku", prompt: "Find all API route handlers in src/")

# Standard implementation
Agent(model: "sonnet", prompt: "Implement the pagination for the users endpoint")

# Architecture review
Agent(model: "opus", prompt: "Review the authentication architecture for security issues")

Routing Rules of Thumb

Task Type	Model	Why
File discovery, grep, glob	haiku	Speed matters, reasoning doesn’t
Standard code changes	sonnet	Good balance of quality and cost
Writing tests	sonnet	Pattern-following, not deep reasoning
Fixing type errors	sonnet	Mechanical corrections
Writing docs	haiku	Template-based, fast iteration
Debugging (simple)	sonnet	Most bugs are straightforward
Debugging (complex)	opus	Subtle bugs need deep reasoning
Architecture review	opus	Decisions are expensive to reverse
Security review	opus	Missing a vulnerability is costly
Plan verification	opus	Catching plan flaws saves hours

Cost Optimization

The 80/20 Rule

80% of tasks should use Sonnet — it’s your default
15% of tasks can use Haiku — simple, fast, cheap
5% of tasks need Opus — complex, critical, expensive

Practical Tips

Start with Sonnet, escalate to Opus when stuck. If Sonnet can’t solve it in 2 attempts, switch to Opus.
Use Haiku for parallel fan-out. When spawning 3-4 agents for exploration, use Haiku to keep costs down.
Don’t use Opus for verification. Sonnet can verify work — Opus is for generating complex solutions.
Batch simple tasks. Instead of 4 separate Haiku calls, combine into one Sonnet call if the tasks are related.
Monitor with /context. If you’re burning through context quickly, you might be over-reasoning on simple tasks.

Cost Comparison (Approximate)

For a typical 10-file feature implementation session (~50K tokens in, ~20K tokens out):

Strategy	Input Cost	Output Cost	Total
All Opus	$0.25	$0.50	$0.75
All Sonnet	$0.15	$0.30	$0.45
Smart routing (Haiku + Sonnet + Opus)	$0.10	$0.20	$0.30
All Haiku	$0.05	$0.10	$0.15

Smart routing — using the right model for each task — typically saves 30-50% versus defaulting to the most capable model.

Model Selection Flowchart

Is it a simple lookup or search?
  → Yes: Haiku
  → No: ↓

Is it standard implementation, editing, or testing?
  → Yes: Sonnet
  → No: ↓

Does it involve architecture, security, or complex debugging?
  → Yes: Opus
  → No: Sonnet (default)

When in doubt, use Sonnet. It handles most tasks well and you can always escalate.

Extended Thinking & Adaptive Thinking

All three current models support Extended Thinking — Claude explicitly reasons through complex problems before responding. This improves accuracy on architecture decisions, debugging, and multi-step logic at the cost of additional tokens.

Opus 4.6 and Sonnet 4.6 also support Adaptive Thinking — Claude automatically adjusts reasoning depth based on task complexity. Simple tasks get fast responses; complex tasks get deeper analysis.

When to enable Extended Thinking:

Architecture decisions with multiple trade-offs
Complex debugging where the root cause isn’t obvious
Security review requiring systematic analysis
Plan verification where missing a flaw is costly

When to skip it:

File searches and exploration (use Haiku)
Standard implementation (Sonnet default is fine)
Any task where speed matters more than depth