The AI Desk
MONDAY, 14 APRIL 2025 From the desk of Amit Singhal Vol. I · The ChatGPT Era
All news PRODUCTS

OpenAI releases GPT-4.1 and the model-naming chapter starts to look ridiculous

A three-model release at lower prices and longer contexts, with naming so confusing the launch post had to include a 'which model should I use' diagram.

OpenAI releases GPT-4.1 and the model-naming chapter starts to look ridiculous

On 14 April 2025, OpenAI released GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, a three-tier API-only update to the GPT-4-class line-up. The headline features were a one-million-token context window, lower prices across all three sizes, and improved coding-benchmark numbers, including a step up to fifty-five per cent on SWE-bench Verified for the full GPT-4.1.

What the release also produced, almost immediately, was naming bewilderment. As of mid-April 2025, OpenAI's model menu included GPT-3.5 (legacy), GPT-4 (legacy), GPT-4 Turbo, GPT-4o, GPT-4o-mini, GPT-4.1, GPT-4.1 mini, GPT-4.1 nano, o1-preview, o1, o1-mini, o1-pro and o3 across various tiers. Sam Altman, in a same-day post, acknowledged the issue and announced that GPT-5, when it shipped, would unify the line-up and retire most of these names. The unification took until August.

What GPT-4.1 actually offered

The substantive interest in the release was not the capability frontier. It was the price-per-token and the long-context window. As Wired and Ars Technica wrote in coverage that week, the GPT-4.1 mini at forty cents per million input tokens was, on most benchmarks, comparable to GPT-4o at two-fifty per million. The price-performance line had moved meaningfully in the eleven months since GPT-4o.

The naming was the controversy. The pricing was the news.

The retrospective view of GPT-4.1, four months later when GPT-5 shipped, is that it was a price-cut and quota-bump release misframed as a generational change by its naming. The four hundred-thousand-token to one-million-token context jump was the most operationally consequential improvement, and one that landed quietly at a moment when the field's attention was on reasoning.

Originally reported by OpenAI (OpenAI) on 14 April 2025. Read the original report →
← Previous
Anthropic introduces Claude 3.7 Sonnet and reasoning stops being a separate product tier
Next →
Anthropic ships Claude 4 Opus and Sonnet, and the coding-benchmark frontier moves a non-trivial amount

Discussion

Email used only for your avatar. Never shown, never stored in plain text.