Skip to content

Peter Yang has been doing some amazing experiments with gen AI tools. There are so many models out there now, so I appreciate him going through and making this post and video.

I made a video testing Claude 4, ChatGPT O3, and Gemini 2.5 head-to-head for coding, writing, deep research, multimodal and more. What I found was that the “best” model depends on what you’re trying to do.

Here’s a handy chart to whet your appetite.

Comparison chart of popular AI tools (ChatGPT, Claude, Gemini, Grok, Perplexity) showing their capabilities across categories like writing, coding, reasoning, web search, and image/video generation, with icons indicating best performance (star), available (check), or unavailable (X). Updated June 2025.

Subscribe for updates

Get design insights in your inbox. Sent weekly (or so).