upd

April 23, 2026

min

Opus 4.7 Review: Features, Performance & How to Use It (2026 Guide)

A practical, no-hype breakdown of what Claude Opus 4.7 actually delivers — from benchmark numbers and real tester quotes to API setup, pricing, and honest trade-offs.

What Is Opus 4.7?

Opus 4.7 is Anthropic's latest flagship large language model, released on April 16, 2026, as a direct upgrade to Claude Opus 4.6. It is designed for demanding software engineering, long-horizon agentic tasks, advanced document reasoning, and high-resolution multimodal work.

Built by Anthropic, Opus 4.7 sits at the top of the Claude 4 model family, below the limited-release Claude Mythos Preview but well above Sonnet 4.6 in raw capability. It follows a clear version lineage: Claude 3 Opus → Claude 4 Opus (4.5) → Opus 4.6 → Opus 4.7.

The model is aimed primarily at developers, enterprise teams, and power users who need reliable, unsupervised execution on complex, multi-step workloads. Think: autonomous coding agents, financial analysis pipelines, legal document review, and life-sciences research tools, tasks where accuracy and instruction fidelity matter more than cost per token.

Key Features & Upgrades in Opus 4.7

Core Improvements Over Opus 4.6

Opus 4.7 isn't a minor patch. Across every dimension that matters in production — instruction adherence, visual understanding, memory, and agentic reliability — the gap over 4.6 is meaningful and consistent across independent testers.

Advanced Software Engineering

Notable gains on the hardest coding tasks, including multi-file refactors, race-condition detection, and autonomous verification of its own outputs.

High-Resolution Vision

Images up to 2,576px on the long edge (~3.75 MP) — more than 3× prior Claude models. Critical for dense screenshots, diagrams, and computer-use agents.

Precise Instruction Following

Substantially stricter literal compliance. Prompts that worked loosely on 4.6 may produce unexpected results — re-tuning is recommended.

Cross-Session Memory

Better use of file-system memory to carry important notes across multi-session, long-horizon work without re-providing context from scratch.

Loop Resistance

Opus 4.7 pushes through tool failures and ambiguous states rather than stalling — a critical improvement for production agentic pipelines.

Document & Finance Reasoning

State-of-the-art on Finance Agent and GDPval-AA benchmarks, with 21% fewer document-reasoning errors than 4.6 in enterprise evaluations.

New Capabilities & Algorithm/Engine Updates

Capability	Opus 4.6	Opus 4.7
Max image resolution	~1MP (approx.)	3.75 MP (2,576px) 3×+ NEW
Effort levels	low / medium / high / max	low / medium / high / xhigh / max NEW
Tokenizer	Standard Claude 4	Updated tokenizer (1.0–1.35× tokens) NEW
Cross-session memory (file system)	Partial	Substantially improved
Task budgets (API)	Not available	Public beta NEW
Self-verification of outputs	Inconsistent	Systematic (self-checks planning logic)
Loop resistance in tool use	Moderate	Best-in-class (high quality per tool call)

UI/UX, Workflow & Integration Enhancements

Beyond raw intelligence, Anthropic shipped several ecosystem updates alongside Opus 4.7:

Claude Code — /ultrareview command: A dedicated review session that flags bugs and design issues the way a careful senior engineer would. Pro and Max users get three free ultrareviews at launch.
Auto mode extended to Max users: Claude makes permission decisions autonomously, enabling longer unattended runs in Claude Code.
Task budgets in public beta (API): Developers can now guide Claude's token spend across long agentic runs — essential for cost-controlled production pipelines.
Claude Design launch: A new Anthropic Labs product for collaborative visual work (designs, prototypes, slides) ships on the same day as Opus 4.7.
xhigh effort default in Claude Code:The default effort in Claude Code is now raised to xhigh across all plans, balancing thoroughness with latency automatically.

Performance, Benchmarks & Accuracy

Real numbers from Anthropic's release page and independent tester evaluations. All figures sourced from Anthropic's April 16, 2026 announcement unless otherwise noted.

Speed, Resource Efficiency & Output Quality

Performance Gains

vs Opus 4.6

+13%

Coding benchmark resolution lift (93-task eval)

70%

CursorBench score (vs 58%)

98.5%

Visual-acuity benchmark (XBOW) vs 54.5%

90.9%

BigLaw Bench accuracy at high effort (Harvey)

3×

More production tasks resolved (Rakuten-SWE-Bench)

21%

Fewer document-reasoning errors (Databricks OfficeQA Pro)

Head-to-Head: Opus 4.7 vs Competitors & Previous Versions

Benchmark	Opus 4.7	Opus 4.6	Notes
CursorBench (Cursor)	70%	58%	+12 pp improvement
Linear coding benchmark (93 tasks)	+13% resolution	Baseline	4 tasks neither 4.6 nor Sonnet 4.6 could solve
Visual acuity (XBOW)	98.5%	54.5%	Near-complete elimination of prior pain point
BigLaw Bench (Harvey, high effort)	90.9%	Not reported	Best-in-class legal accuracy
Rakuten-SWE-Bench	3× more tasks	Baseline	Also double-digit gains in code & test quality
Databricks OfficeQA Pro	21% fewer errors	Baseline	Best Claude model for enterprise document analysis
Finance Agent / GDPval-AA	State-of-the-art	Previous SOTA	Finance + legal + knowledge work
Notion multi-step workflows	+14% over Opus 4.6	Baseline	At fewer tokens, ⅓ the tool errors
Factory Droids agentic tasks	10–15% lift	Baseline	Fewer tool errors, better validation follow-through
CodeRabbit code review recall	>10% improvement	Baseline	Stable precision; slightly faster than GPT-5.4 xhigh

Benchmark context: Comparisons above use the best API-available versions of GPT-5.4 and Gemini 3.1 Pro where applicable. SWE-bench memorization screens were applied; Opus 4.7's improvement margin holds after exclusions. Terminal-Bench 2.0 was run with thinking disabled, 1× guaranteed/3× ceiling resource allocation averaged over five attempts per task.

Use Cases & Industry Applications

Where Opus 4.7 earns its price — and where it doesn't.

Best For: Developers & Engineering Teams

Can Opus 4.7 be used for autonomous software engineering? Yes, this is its strongest suit. The model handles complex, long-running coding tasks with the kind of rigor you'd expect from a senior engineer: it plans before writing, verifies its outputs, pushes through tool failures, and catches its own logical errors before reporting back.

‍Scenario: A fintech platform wants to automate resolution of GitHub issues without constant human oversight.‍
Setup: Route issues through a Claude Code agent with Opus 4.7 at xhigh effort + task budgets enabled.‍
Outcome: Cognition's Devin reported Opus 4.7 "works coherently for hours, pushes through hard problems rather than giving up, and unlocks a class of deep investigation work we couldn't reliably run before."

Best For: Legal, Finance & Enterprise Professionals

Can Opus 4.7 be used for legal document review? Absolutely. Harvey reported 90.9% substantive accuracy on BigLaw Bench, with notably smarter handling of assignment versus change-of-control provisions — a distinction that has historically tripped up frontier models. Databricks saw 21% fewer document-reasoning errors on enterprise document analysis.

‍Scenario: A law firm needs to review hundreds of acquisition contracts for specific clause types.‍
Setup: Feed documents as PDFs via the API with the claude-opus-4-7 model string; use high or xhigh effort.‍
Outcome: More accurate clause identification, better citation discipline, and reduced attorney review time.

Best For: Multimodal & Life Sciences Work

The jump to 3.75 MP image support is a genuine unlock for domains that depend on fine visual detail. Solve Intelligence reported major improvements in reading chemical structures and interpreting complex technical diagrams for life sciences patent workflows. XBOW saw visual acuity jump from 54.5% to 98.5% — a near-complete resolution of their prior computer-use limitation.

Workflow Integration & API/Plugin Support

Opus 4.7 is available across every major cloud and API surface:

Anthropic API directly (model ID:claude-opus-4-7)
Amazon Bedrock
Google Cloud Vertex AI
Microsoft Foundry
Claude Code (CLI with /ultrareview, auto mode)
All Claude products (claude.ai, Claude apps)
AI/ML API marketplace at aimlapi.com

ROI & Time-Saving Examples

Real efficiency figures from the launch tester cohort:

Ramp reported needing "much less step-by-step guidance" for engineering agent teams, directly scaling internal automation workflows.
Replit observed the same output quality at lower cost — "more efficient and precise at tasks like analyzing logs, finding bugs, and proposing fixes."
Cognition (Devin) unlocked a class of deep-investigation coding work that was previously unreliable, representing an expansion of billable agent capability.
One team reported Opus 4.7 autonomously building a complete Rust text-to-speech engine from scratch — neural model, SIMD kernels, browser demo — and self-verifying via a speech recognizer. "Months of senior engineering, delivered autonomously."

Pros, Cons & Expert Verdict

A balanced assessment of Opus 4.7 pros and cons — based on tester reports and benchmark data, not marketing claims.

What Works Exceptionally Well

StrengthsWhat It Excels At
Best-in-class autonomous coding — handles complex tasks with minimal supervision
98.5% visual-acuity unlocks computer-use and diagram-heavy workflows
Strong loop resistance and graceful error recovery in agentic systems
More honest about limitations — reduced hallucination risk
State-of-the-art document and financial reasoning
No price increase — remains at $5 / $25 per million tokens
Self-verifies outputs, catching planning and logic errors
More creative and polished in docs, UI, and presentations

Limitations

Trade-offs

~1.0–1.35× more input tokens due to updated tokenizer
Stricter instruction following may require prompt re-tuning
Higher output token usage at high effort levels
Weaker harm-reduction responses vs previous version
Less capable than unreleased Claude Mythos Preview
High-res images consume more tokens — downsampling recommended

Known Limitations & Workarounds

‍Prompt compatibility: The most common migration issue. Opus 4.7 takes instructions literally, where 4.6 might have skipped a step or interpreted loosely, 4.7 will execute exactly what's written. Workaround: Audit your system prompts for implicit assumptions; make instructions explicit and check outputs on a test dataset before rolling out.‍
Token cost creep: The tokenizer change and increased thinking at high effort can push costs upward. Workaround: Use the effort parameter to match task complexity; use task budgets (beta) to cap spend on long runs; add "be concise" instructions to your system prompt for chatty tasks.

Who Should Upgrade (And Who Should Wait)

‍Upgrade immediately if you: run autonomous coding agents, rely on computer-use or dense diagram analysis, work in legal/finance document processing, or run multi-step agentic pipelines where loop resistance and error recovery matter.‍
Consider testing first if you: have extensive prompt libraries tuned for Opus 4.6 (plan a re-tuning sprint), or are highly cost-sensitive on high-volume low-complexity tasks where Sonnet 4.6 may be a better fit.‍
Probably wait if you: only use Claude for short, simple single-turn queries where Haiku or Sonnet deliver adequate results at lower cost.

Conclusion & Next Steps

Claude Opus 4.7 is a clear generational step over its predecessor, not an incremental patch. The combination of dramatically improved vision, autonomous coding reliability, document reasoning accuracy, and loop resistance in agentic workflows puts it at the top of the practical-use frontier for enterprise and developer teams, at the same price point as Opus 4.6. The main reasons to pause before upgrading are prompt compatibility (plan a re-tuning sprint) and the modest tokenizer-driven cost increase on high-volume tasks, both of which are solvable with a structured migration approach.

If your work touches any of the model's core strengths — coding agents, visual understanding, legal and financial document review, or long-horizon multi-step automation — Opus 4.7 is worth prioritizing. The tester cohort that participated in early access represents some of the most demanding production workloads in the industry, and the consensus is unusually consistent: this is a meaningful upgrade.

Ready to get started? Access claude-opus-4-7 today via AI/ML API — unified access, instant onboarding, and support for the full Anthropic model family alongside hundreds of other leading models.

Frequently Asked Questions

What's new in Opus 4.7 compared to 4.6?

The key upgrades are: substantially better software engineering on hard tasks, high-resolution vision (up to 3.75 MP vs ~1 MP), stricter instruction following, a new xhigh effort level, improved cross-session file-system memory, better loop resistance in agentic workflows, and task budgets in public beta. Pricing is unchanged.

Does Opus 4.7 support image and multimodal inputs?

Yes. Opus 4.7 accepts images up to 2,576px on the long edge (approximately 3.75 megapixels) — more than three times the resolution supported by prior Claude models. It handles JPEG, PNG, GIF, and WebP formats. Higher-resolution images consume more tokens; downsample before sending if fine detail isn't required.

How do I fix unexpected outputs when migrating from Opus 4.6 to 4.7?

Opus 4.7's stricter instruction following is the most common cause. Prompts that relied on the model loosely or partially interpreting instructions may now behave differently. Audit your system prompts, make implicit instructions explicit, and run a test dataset comparison before rolling out to production. Anthropic has a dedicated migration guide at platform.claude.com/docs.

Is Opus 4.7 available on mobile?

Yes. Opus 4.7 is accessible through Claude.ai on web, iOS, and Android apps, subject to your plan's usage limits. API-based mobile integrations can use the claude-opus-4-7 model string via any REST-capable environment.

How often is Opus 4.7 updated?

Anthropic typically releases model updates on an as-ready basis rather than a fixed schedule. Opus 4.7 is a generally available release. Minor safety and alignment patches may be applied without a version bump; major capability improvements would typically ship as a new version (e.g., a future Opus 4.8 or Opus 5.x). Check Anthropic's news page or the AI/ML API changelog for updates.

Example H2

Share with friends

Ready to get started? Get Your API Key Now!

Get API Key