The AI Desk
THURSDAY, 16 APRIL 2026 From the desk of Amit Singhal Vol. I · The ChatGPT Era
All news RESEARCH

Anthropic ships Claude 4.7 with a one-million-token context window for Sonnet and Opus

A million tokens of context across the Sonnet and Opus tiers, agentic-task improvements, and the first publicly-available frontier model that can hold an entire mid-sized codebase in working memory.

Anthropic ships Claude 4.7 with a one-million-token context window for Sonnet and Opus

On 16 April 2026, Anthropic released Claude 4.7. The two models in the family, Sonnet and Opus, both shipped with a one-million-token context window, matching what GPT-5.2 had set as the new commercial floor in January. The pricing held the existing Sonnet and Opus tiers, with the long-context capability included rather than priced as a premium tier.

The capability shift, in the launch post and in the independent reviews from Wired, MIT Technology Review and The Verge, sat in long-context coherence. Frontier models had supported long contexts for over a year by the time of this release. The gap was that performance on long-context tasks, particularly long-context reasoning across codebases or document collections, had typically degraded substantially as the context filled up. Claude 4.7's reported long-context-degradation curve, on the public Needle-in-a-Haystack-style evaluations, was substantially flatter than its predecessors.

What an entire codebase in context actually does

The most-tested implication was on whole-codebase software-engineering tasks. As reported in the Anthropic launch post, agentic-coding tasks that had previously required retrieval-augmented-generation pipelines to feed relevant code into the model were, with Claude 4.7, plausibly executable by simply pasting the entire mid-sized repository into the context window. For a Python codebase of around 200,000 lines (a representative mid-sized application), the token count fell within the one-million-token budget. As The Information's coverage the following week pointed out, this changes the architecture of agentic-coding products in a non-trivial way: the agent no longer needs to know which files to retrieve, because all files are already loaded.

Long-context performance, April 2026
Reasoning accuracy at full-context length, public reporting
Claude 4.7 Opus 91.2 % Claude 4.7 Sonnet 89.5 % GPT-5.5 81.4 % Gemini 3 Pro Deep Think 84.7 % Claude 4.6 Opus 76.8 %
Long-context retrieval-and-reasoning composite score, depth=1M tokens.
Whole-codebase context is a different working pattern, not a quantitative improvement on the old one.

The downstream implications for the developer-tooling category that had built up around Claude Code, GitHub Copilot, Cursor and the agentic-coding tail are, as of writing, working through. The retrieval-augmented-generation patterns that had been the standard architecture for AI-assisted coding are not deprecated, but they are no longer the only architecture. The next two quarters will tell whether the working pattern shifts substantively or not.

Originally reported by CNBC (CNBC) on 16 April 2026. Read the original report →
← Previous
Anthropic launches Dispatch and packages agentic AI as a managed workflow service
Next →
Anthropic ships Claude Design and the AI-augmented design tool gets its first credible incumbent challenger

Discussion

Email used only for your avatar. Never shown, never stored in plain text.