All news PRODUCTS

OpenAI releases GPT-4.1 and the model-naming chapter starts to look ridiculous

A three-model release at lower prices and longer contexts, with naming so confusing the launch post had to include a 'which model should I use' diagram.

MONDAY, 14 APRIL 2025 By The AI Desk

OpenAI releases GPT-4.1 and the model-naming chapter starts to look ridiculous

On 14 April 2025, OpenAI released GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, a three-tier API-only update to the GPT-4-class line-up. The headline features were a one-million-token context window, lower prices across all three sizes, and improved coding-benchmark numbers, including a step up to fifty-five per cent on SWE-bench Verified for the full GPT-4.1.

What the release also produced, almost immediately, was naming bewilderment. As of mid-April 2025, OpenAI's model menu included GPT-3.5 (legacy), GPT-4 (legacy), GPT-4 Turbo, GPT-4o, GPT-4o-mini, GPT-4.1, GPT-4.1 mini, GPT-4.1 nano, o1-preview, o1, o1-mini, o1-pro and o3 across various tiers. Sam Altman, in a same-day post, acknowledged the issue and announced that GPT-5, when it shipped, would unify the line-up and retire most of these names. The unification took until August.

What GPT-4.1 actually offered

GPT-4.1 family, at launch

Context window	1,000,000 tokens
GPT-4.1 input	USD 2 per 1M tokens
GPT-4.1 mini input	USD 0.40 per 1M tokens
GPT-4.1 nano input	USD 0.10 per 1M tokens
API-only at launch	yes

The substantive interest in the release was not the capability frontier. It was the price-per-token and the long-context window. As Wired and Ars Technica wrote in coverage that week, the GPT-4.1 mini at forty cents per million input tokens was, on most benchmarks, comparable to GPT-4o at two-fifty per million. The price-performance line had moved meaningfully in the eleven months since GPT-4o.

The naming was the controversy. The pricing was the news.

The retrospective view of GPT-4.1, four months later when GPT-5 shipped, is that it was a price-cut and quota-bump release misframed as a generational change by its naming. The four hundred-thousand-token to one-million-token context jump was the most operationally consequential improvement, and one that landed quietly at a moment when the field's attention was on reasoning.

Originally reported by OpenAI (OpenAI) on 14 April 2025. Read the original report →

← Previous

Anthropic introduces Claude 3.7 Sonnet and reasoning stops being a separate product tier

Anthropic ships Claude 4 Opus and Sonnet, and the coding-benchmark frontier moves a non-trivial amount

OpenAI releases GPT-4.1 and the model-naming chapter starts to look ridiculous

What GPT-4.1 actually offered

Discussion

The AI Desk, in your inbox.

More from PRODUCTS