Claude Code model tiers and effort levels, explained plainly
Choosing the wrong model or effort level in Claude Code wastes tokens silently. Here is what each setting actually controls.
Claude Code offers three model tiers and four effort levels that independently control reasoning depth and cost, but most developers never adjust either.
- — Haiku suits fast lookups and mechanical tasks; Sonnet handles most daily coding work.
- — Opus unlocks 1M token context on Max, Team, and Enterprise plans automatically.
- — Plan mode enforces read-only access at the tool level, not just as a soft guideline.
- — The opusplan alias switches from Opus to Sonnet at execution time but loses the 1M context window.
- — Effort levels (low, medium, high, max) control internal reasoning token budget before output.
- — Anthropic changed Opus 4.6 default effort from high to medium in v2.1.68, causing community friction.
- — Subagents inherit the parent session model and effort unless explicitly overridden in frontmatter.
- — Higher effort can cause overthinking on simple tasks, producing verbose output where directness was needed.
Astrobobo tool mapping
- Knowledge Capture Document your chosen model-effort defaults and the reasoning behind them so future contributors understand the cost tradeoffs without re-deriving them.
- Daily Log Note which tasks prompted you to reach for high effort or Opus today; after a week, review whether those escalations were justified or habitual.
- Focus Brief Before starting a complex multi-file refactor, write a one-paragraph scope statement and use it as the basis for a plan-mode session rather than jumping straight to edits.
- Reading Queue Queue the next article in this series on context window consumption, since effort levels directly affect how quickly the context window fills.
Frequently asked
- Model tiers determine the underlying capability of the model being used. Haiku is the fastest and least capable, Sonnet is the general-purpose default, and Opus is the most capable for complex reasoning. Effort levels are a separate control that governs how many internal reasoning tokens the model spends before producing a response. Low effort is fast and minimal; max effort removes the reasoning token ceiling entirely. The two settings are independent, so you can run Opus at low effort or Sonnet at high effort depending on the task.
cite ▸
APA
Oleg Efimov. (2026, April 19). Claude Code model tiers and effort levels, explained plainly. Astrobobo Content Engine (rewrite of hackernoon). https://astrobobo-content-engine.vercel.app/article/claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb
MLA
Oleg Efimov. "Claude Code model tiers and effort levels, explained plainly." Astrobobo Content Engine, 19 Apr 2026, https://astrobobo-content-engine.vercel.app/article/claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb. Based on "hackernoon", https://hackernoon.com/navigating-claude-code-models-tiers-and-effort?source=rss.
BibTeX
@misc{astrobobo_claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb_2026,
author = {Oleg Efimov},
title = {Claude Code model tiers and effort levels, explained plainly},
year = {2026},
url = {https://astrobobo-content-engine.vercel.app/article/claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb},
note = {Astrobobo rewrite of hackernoon, https://hackernoon.com/navigating-claude-code-models-tiers-and-effort?source=rss},
}