Products AI news
AI product news covering launches, agents, tools, and the platform moves that matter.

OpenAI releases GPT-5.5 and the reliability gap to Claude 4.7 finally closes
An incremental version-bump in the naming, a non-incremental shift in what the model can finish without supervision. SWE-bench scores at parity, agentic-task durations comparable, multi-modal capabilities broader.

Anthropic ships Claude Design and the AI-augmented design tool gets its first credible incumbent challenger
A design-and-prototyping product built on top of Claude, with vector primitives, component versioning, and an agent that turns design intent into editable artifacts. The product itself is competent. The market it enters is the more interesting story.

Anthropic launches Dispatch and packages agentic AI as a managed workflow service
Long-running, schedulable, observable agentic tasks with retry semantics, audit trails and a human-approval interface. The product is the agentic-AI equivalent of a CI runner.

Anthropic introduces Workflow and the multi-agent operations question gets a first answer
A coordination layer for agentic AI: declarative workflow definitions, agent-to-agent messaging, and a planner. The framing is workflow orchestration. The substance is closer to operating-system primitives.

Google launches Gemini 3 and the perceived capability gap with OpenAI and Anthropic effectively closes
Two model sizes, benchmark scores at parity with GPT-5 and Claude 4, and a serving infrastructure that runs entirely on Google's TPU stack. The technical question is largely settled. The commercial question is not.

OpenAI releases GPT-5.1 with a smarter router and the unified-model strategy starts to settle
Three months after GPT-5 hid the model picker, the 5.1 release tunes the routing and turns up speed and personality. The capability headline numbers are quiet. The default-experience numbers are not.

Anthropic ships Skills, and a new shape for distributing AI capabilities lands
Bundled prompts, scripts and resources Claude can pick up at runtime. The framing is modest. The implications for how AI capabilities are versioned, audited, and shipped are not.

OpenAI launches GPT-5 and the model-name chapter quietly closes
A unified model that hides the routing, retires most of the GPT-4-class line-up, and treats reasoning, multimodality and tool use as standard issue. The release was less a benchmark event than a product cleanup.

Perplexity ships Comet and the agentic browser becomes a category
A Chromium-based browser with an answer engine and an agent that fills forms, books flights, and makes payments. The utility is real. The trust model is the part to watch.

OpenAI releases GPT-4.1 and the model-naming chapter starts to look ridiculous
A three-model release at lower prices and longer contexts, with naming so confusing the launch post had to include a 'which model should I use' diagram.

Anthropic ships Computer Use and agentic AI moves from demo videos to a feature you can invoke
Claude can now move a mouse and click a button. The reliability is not where it needs to be. The threshold for 'real' was crossed regardless.

Anthropic ships Claude 3.5 Sonnet and the working model for serious developers changes
A model upgrade and a feature, called Artifacts, that turned Claude.ai into something more than a chat interface. Both shipped on the same Tuesday.

OpenAI launches GPT-4o and demonstrates real-time voice the day before Google I/O
A spring product update, deliberately timed against Google's annual developer keynote. The voice demo was a year ahead of what most users would actually get.

OpenAI's first DevDay tries to turn ChatGPT into a platform, six days before its CEO is unexpectedly fired
Custom GPTs, the Assistants API, the GPT-4 Turbo price cut, and a roadmap for an app store. The week's events were not what anyone in the keynote room expected.

OpenAI quietly opens ChatGPT to the public, and the internet rearranges itself overnight
A research preview of a chat-tuned GPT-3.5 was put behind a free login on a Wednesday afternoon. By Sunday it had crossed a million users and reshaped the conversation about AI for the rest of the decade.