Vibe Check

Taste-testing new models.

Popular Newest Oldest

Dec 10, 2024

Vibe Check: OpenAI’s Sora

The text-to-video model is finally available

Apr 16, 2025

Vibe Check: o3 Is Here—And It’s Great

The highest praise I can give is that I’m already using it all the time

Aug 12, 2025

Vibe Check: Claude Sonnet 4 Now Has a 1-million Token Context Window

Fast, reliable long-context responses—for a price

Jan 23, 2025

We Tried OpenAI’s New Agent—Here’s What We Found

Operator (Could you help me do this task?)

May 16, 2025

Vibe Check: Codex—OpenAI’s New Coding Agent

Our hands-on day-0 review of the new autonomous software engineer

Jul 17, 2025

Vibe Check: OpenAI Enters the Browser Wars With ChatGPT Agent

It’s launching today! Here’s our day-zero, hands-on report.

Aug 5, 2025

Vibe Check: OpenAI Drops Two New Open-weight Models

OpenAI President Greg Brockman: ‘The team cooked with this one’

Mar 26, 2025

Vibe Check: OpenAI’s GPT-4o Image Generation

'Finally, native images in ChatGPT!'

Jul 31, 2025

Vibe Check: Claude’s New Agents Are Confusing as Hell—And We Love Them

We spawned AI agents like crazy. Then we tried to work with them.

Apr 18, 2025

Vibe Check: OpenAI’s o3, GPT-4.1, and o4-mini

Our take on what’s powerful, what’s practical, and what’s still TBD

Nov 3, 2025

Vibe Check: Claude Skills Need a ‘Share’ Button

The feature is powerful for individuals and tricky for teams—but it does lighten the cognitive load

May 22, 2025

Vibe Check: Claude 4 Opus

Anthropic’s new model crushes pull requests, research deep dives, and honest editing—yet o3 keeps the daily-driver crown

Feb 3, 2025

We Tried OpenAI’s New Deep Research—Here’s What We Found

Vibe check: It’s awesome.

Oct 6, 2025

Vibe Check: OpenAI DevDay 2025

Apps, agents, and API updates—but where's the vision that makes you dream?

Oct 20, 2025

Vibe Check: Claude Code Now Works on Mobile and the Web

Anthropic’s coding agent promises work from anywhere. After a weekend of testing, it still feels very beta.

Aug 8, 2025

Vibe Check: Genie 3, Claude 4.1, GPT-oss, and GPT-5

Four model launches, four ideas about where AI goes next

Oct 30, 2025

Vibe Check: I Canceled Two AI Max Plans for Factory’s Coding Agent Droid

The one that keeps me in flow across Anthropic and OpenAI’s models—without switching tools

Jun 23, 2025

o3-pro Vibe Check—A Slow, Steady Last Resort

OpenAI’s latest model trades speed for occasional brilliance—when nothing else works, it might

Oct 29, 2025

Vibe Check: Cursor 2.0 and Composer 1 Alpha

Two new things: A code editor designed to manage agents and a lightning-fast model

Nov 24, 2025

Vibe Check: Opus 4.5 Is the Coding Model We’ve Been Waiting For

But it’s not perfect—it failed our editing test

Oct 21, 2025

Vibe Check: OpenAI’s New AI Browser, Atlas

It feels less like learning something new than a browser that has caught up to how we already want to work with AI

Mar 8, 2025

Vibe Check: Claude 3.7 Sonnet and Claude Code

All about the newest tools from Anthropic

Sep 29, 2025

Vibe Check: Claude Sonnet 4.5

Faster than GPT-5 Codex, smarter and more steerable than Opus 4.1

Oct 23, 2025

We Tested Claude Sonnet 4.5 for Writing and Editing

Five tests across blind comparisons, editorial standards, and deadlines—here's what changed our setup