What is the difference between model tiers and effort levels in Claude Code?

Model tiers determine the underlying capability of the model being used. Haiku is the fastest and least capable, Sonnet is the general-purpose default, and Opus is the most capable for complex reasoning. Effort levels are a separate control that governs how many internal reasoning tokens the model spends before producing a response. Low effort is fast and minimal; max effort removes the reasoning token ceiling entirely. The two settings are independent, so you can run Opus at low effort or Sonnet at high effort depending on the task.

Does the opusplan alias give you the 1 million token context window in Claude Code?

No. According to open bug reports on the Claude Code GitHub repository, selecting opusplan shows a 200K context window even on accounts where selecting Opus directly provides the full 1 million token window. The opusplan alias automatically switches from Opus during planning to Sonnet during execution, which is convenient, but it does not guarantee the expanded context. If you need the full 1M window for a large codebase, you should select Opus explicitly rather than using the opusplan alias.

Does adding ultrathink to CLAUDE.md make Claude always use high effort?

No. The ultrathink keyword only raises effort to high for the single response in which it appears, then the session reverts to its configured effort level. Placing ultrathink in CLAUDE.md means the model applies high effort once when it processes that file at session start, not persistently throughout the session. To set a persistent effort level, use the --effort flag at launch, configure it in settings.json for low through high, or set the CLAUDE_CODE_EFFORT_LEVEL environment variable for the max level.

engineering · 7 min read · Apr 19, 2026

Claude Code model tiers and effort levels, explained plainly

Choosing the wrong model or effort level in Claude Code wastes tokens silently. Here is what each setting actually controls.

Source: hackernoon · Oleg Efimov · open original ↗

Claude Code offers three model tiers and four effort levels that independently control reasoning depth and cost, but most developers never adjust either.

— Haiku suits fast lookups and mechanical tasks; Sonnet handles most daily coding work.
— Opus unlocks 1M token context on Max, Team, and Enterprise plans automatically.
— Plan mode enforces read-only access at the tool level, not just as a soft guideline.
— The opusplan alias switches from Opus to Sonnet at execution time but loses the 1M context window.
— Effort levels (low, medium, high, max) control internal reasoning token budget before output.
— Anthropic changed Opus 4.6 default effort from high to medium in v2.1.68, causing community friction.
— Subagents inherit the parent session model and effort unless explicitly overridden in frontmatter.
— Higher effort can cause overthinking on simple tasks, producing verbose output where directness was needed.

Astrobobo tool mapping

Knowledge Capture Document your chosen model-effort defaults and the reasoning behind them so future contributors understand the cost tradeoffs without re-deriving them.
Daily Log Note which tasks prompted you to reach for high effort or Opus today; after a week, review whether those escalations were justified or habitual.
Focus Brief Before starting a complex multi-file refactor, write a one-paragraph scope statement and use it as the basis for a plan-mode session rather than jumping straight to edits.
Reading Queue Queue the next article in this series on context window consumption, since effort levels directly affect how quickly the context window fills.

Frequently asked

Model tiers determine the underlying capability of the model being used. Haiku is the fastest and least capable, Sonnet is the general-purpose default, and Opus is the most capable for complex reasoning. Effort levels are a separate control that governs how many internal reasoning tokens the model spends before producing a response. Low effort is fast and minimal; max effort removes the reasoning token ceiling entirely. The two settings are independent, so you can run Opus at low effort or Sonnet at high effort depending on the task.

Share X LinkedIn

cite ▸

APA

Oleg Efimov. (2026, April 19). Claude Code model tiers and effort levels, explained plainly. Astrobobo Content Engine (rewrite of hackernoon). https://astrobobo-content-engine.vercel.app/article/claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb

MLA

Oleg Efimov. "Claude Code model tiers and effort levels, explained plainly." Astrobobo Content Engine, 19 Apr 2026, https://astrobobo-content-engine.vercel.app/article/claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb. Based on "hackernoon", https://hackernoon.com/navigating-claude-code-models-tiers-and-effort?source=rss.

BibTeX

@misc{astrobobo_claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb_2026,
  author       = {Oleg Efimov},
  title        = {Claude Code model tiers and effort levels, explained plainly},
  year         = {2026},
  url          = {https://astrobobo-content-engine.vercel.app/article/claude-code-model-tiers-and-effort-levels-explained-plainly-4ca9fb},
  note         = {Astrobobo rewrite of hackernoon, https://hackernoon.com/navigating-claude-code-models-tiers-and-effort?source=rss},
}

#claude #llm #coding #tokens #agents #productivity

Claude Code model tiers and effort levels, explained plainly

Astrobobo tool mapping

Frequently asked

Related insights

Vibe Coding Triggers a Dopamine Loop That Undermines Engineering Judgment

Deterministic Routing Cuts Tail Latency by Aligning Requests With Data

How GCP Architects Should Actually Use Generative AI