Vibe Check

Taste-testing new models.

Apr 16, 2025

Vibe Check: o3 Is Here—And It’s Great

The highest praise I can give is that I’m already using it all the time

Dec 10, 2024

Vibe Check: OpenAI’s Sora

The text-to-video model is finally available

Jan 23, 2025

We Tried OpenAI’s New Agent—Here’s What We Found

Operator (Could you help me do this task?)

Aug 12, 2025

Vibe Check: Claude Sonnet 4 Now Has a 1-million Token Context Window

Fast, reliable long-context responses—for a price

Aug 5, 2025

Vibe Check: OpenAI Drops Two New Open-weight Models

OpenAI President Greg Brockman: ‘The team cooked with this one’

May 16, 2025

Vibe Check: Codex—OpenAI’s New Coding Agent

Our hands-on day-0 review of the new autonomous software engineer

May 22, 2025

Vibe Check: Claude 4 Opus

Anthropic’s new model crushes pull requests, research deep dives, and honest editing—yet o3 keeps the daily-driver crown

Apr 18, 2025

Vibe Check: OpenAI’s o3, GPT-4.1, and o4-mini

Our take on what’s powerful, what’s practical, and what’s still TBD

Jul 17, 2025

Vibe Check: OpenAI Enters the Browser Wars With ChatGPT Agent

It’s launching today! Here’s our day-zero, hands-on report.

Mar 26, 2025

Vibe Check: OpenAI’s GPT-4o Image Generation

'Finally, native images in ChatGPT!'

May 9, 2025

Vibe Check: Gemini 2.5 Pro and Gemini 2.5 Flash

Why Google might quietly win the race to be AI’s top backend provider

Jul 31, 2025

Vibe Check: Claude’s New Agents Are Confusing as Hell—And We Love Them

We spawned AI agents like crazy. Then we tried to work with them.

Feb 3, 2025

We Tried OpenAI’s New Deep Research—Here’s What We Found

Vibe check: It’s awesome.

Aug 8, 2025

Vibe Check: Genie 3, Claude 4.1, GPT-oss, and GPT-5

Four model launches, four ideas about where AI goes next

Mar 8, 2025

Vibe Check: Claude 3.7 Sonnet and Claude Code

All about the newest tools from Anthropic

Sep 29, 2025

Vibe Check: Claude Sonnet 4.5

Faster than GPT-5 Codex, smarter and more steerable than Opus 4.1

Oct 6, 2025

Vibe Check: OpenAI DevDay 2025

Apps, agents, and API updates—but where's the vision that makes you dream?

Oct 20, 2025

Vibe Check: Claude Code Now Works on Mobile and the Web

Anthropic’s coding agent promises work from anywhere. After a weekend of testing, it still feels very beta.

Oct 21, 2025

Vibe Check: OpenAI’s New AI Browser, Atlas

It feels less like learning something new than a browser that has caught up to how we already want to work with AI

Jun 23, 2025

o3-pro Vibe Check—A Slow, Steady Last Resort

OpenAI’s latest model trades speed for occasional brilliance—when nothing else works, it might

Oct 30, 2025

Vibe Check: I Canceled Two AI Max Plans for Factory’s Coding Agent Droid

The one that keeps me in flow across Anthropic and OpenAI’s models—without switching tools

Sep 15, 2025

Vibe Check: GPT-5 Codex Can Code for 35 Minutes Straight—If You Ask Nicely

It launches today—here’s our day-zero vibe check

Jul 18, 2025

Vibe Check: Grok 4 Aced Its Exams. The Real World Is a Different Story.

The smartest model isn’t always the most useful one

Oct 29, 2025

Vibe Check: Cursor 2.0 and Composer 1 Alpha

Two new things: A code editor designed to manage agents and a lightning-fast model