LLM reviews · 20 models

Large language models, compared

Every major LLM that working professionals are evaluating: Claude (Opus, Sonnet, Haiku), GPT-5 / GPT-4o, Gemini 2.5, Llama 4, Mistral Large 2, Grok 4. Each model gets a profession-aware review.

Anthropic · 5 models

Anthropic

All Anthropic →

Claude Freemium · $20/mo

Anthropic's assistant, known for long-context reasoning, careful writing, and safety-aware outputs. Strong on analysis, coding, and document-heavy work. Projects feature lets you l

4.0Rating

Claude Haiku 4.5 Paid · API: $1/M tokens

Anthropic's small / fast model — designed for high-volume, latency-sensitive tasks where Opus is overkill. Strong fit for classification, extraction, and short-form generation at s

4.0Rating

Claude Opus 4.6 Paid · API: $15/M tokens

Claude Opus 4.6 is a general-purpose AI assistant from Anthropic. Paid only — plans from API: $15/M tokens.

4.0Rating

Claude Opus 4.7 Paid · API: $15/M tokens

Anthropic's flagship model as of 2026 — strongest performance in the Claude family on long-context reasoning, code generation, and agentic tool use. Replaces Opus 4.6 as the defaul

4.0Rating

Claude Sonnet 4.6 Paid · API: $3/M tokens

Claude Sonnet 4.6 is a general-purpose AI assistant from Anthropic. Paid only — plans from API: $3/M tokens.

4.0Rating

OpenAI · 4 models

OpenAI

All OpenAI →

ChatGPT Freemium · $20/mo

OpenAI's flagship conversational AI. Used by professionals across every category for drafting, summarization, structured thinking, and as a Swiss-army-knife reasoning assistant. Pr

4.0Rating

GPT-4o Paid · API: $5/M tokens

GPT-4o is a general-purpose AI assistant from OpenAI. Paid only — plans from API: $5/M tokens.

4.0Rating

GPT-5 Paid · API: from $5/M tokens

OpenAI's headline model for 2025–2026. Multi-modal native (text, image, audio in/out), strong on reasoning and code. Default for ChatGPT Plus / Team subscribers. Pricing tiered wit

4.0Rating

o1 Paid · API: $15/M tokens

o1 is a general-purpose AI assistant from OpenAI. Paid only — plans from API: $15/M tokens.

4.0Rating

Google · 3 models

Google

All Google →

Gemini Freemium · $20/mo

Google's multimodal assistant, deeply integrated into Workspace (Docs, Gmail, Sheets, Meet). Strong choice for organizations already on Google's stack. Advanced tier (Gemini Advanc

4.0Rating

Gemini 2.5 Flash Paid · API: $0.075/M tokens

Gemini 2.5 Flash is a general-purpose AI assistant from Google. Paid only — plans from API: $0.075/M tokens.

4.0Rating

Gemini 2.5 Pro Paid · API: $1.25/M tokens

Google DeepMind's flagship Gemini model. Best-in-class context window (1M+ tokens), native multimodal across text/image/audio/video, deep Google Workspace and Search grounding. Str

4.0Rating

Meta · 1 model

Mistral AI

All Mistral AI →

Mistral Large 2 Paid · API: $2/M tokens

Mistral AI's frontier model — European alternative to US-based frontier LLMs. Strong on multilingual European-language work, competitive English performance, available through Mist

4.0Rating

Mistral Le Chat Freemium · Free

Mistral Le Chat is a general-purpose AI assistant from Mistral AI. Free tier available; paid plans from Free.

4.0Rating

xAI · 1 model

xAI

All xAI →

Grok 4 Paid · $30/mo

xAI's flagship model — integrated with X (Twitter) for real-time data access. Strong on current-events queries, less guardrail-restrictive than peer models. Available via X.com Pre

4.0Rating

Perplexity · 1 model

Perplexity

All Perplexity →

Perplexity Freemium · $20/mo

Search-first AI — every answer comes with citations. Pro tier gives access to GPT-4 / Claude / Sonar models and Pro Search (multi-step research). Replaces a lot of Google searches

4.0Rating

Quora · 1 model

Quora

All Quora →

Poe Freemium · $20/mo

Poe is a general-purpose AI assistant from Quora. Free tier available; paid plans from $20/mo.

4.0Rating

Monica · 1 model

Monica

All Monica →

Monica Freemium · $8.30/mo

Monica is a general-purpose AI assistant from Monica. Free tier available; paid plans from $8.30/mo.

4.0Rating

You.com · 1 model

You.com

All You.com →

You.com Freemium · $15/mo

You.com is a general-purpose AI assistant from You.com. Free tier available; paid plans from $15/mo.

4.0Rating

How we evaluate LLMs

Most LLM comparisons are abstract — "GPT-5 scored 87.4 on benchmark X." That tells working professionals very little about whether a model fits their day. Our LLM reviews focus on professional fit: what kind of work each model handles best, where it falls down, and which profession pages on this site recommend it.

Re-review cadence is 60 days for active LLMs (vs 90 for general AI tools) — model behavior changes faster. Major version bumps trigger an immediate re-review.

Comparisons coming in Phase 2

Common comparisons get their own pages: /compare/claude-opus-4-7-vs-gpt-5/, /compare/claude-opus-vs-claude-haiku/, /compare/gemini-2-5-pro-vs-claude-opus-4-7/. Both within-vendor and cross-vendor comparisons live under one flat /compare/ URL pattern so they match user search queries directly.