Matching models 16 Rows visible after provider and capability filters.
Provider scope ALL Current provider slice for this table view.
Pricing flags Any cache ยท Any batch Use these to narrow the list to operationally usable models.
Sort logic Default order Choose the cost signal that matches the workload you are pricing.
Apply

Prompt-heavy workloads usually start with cheapest input. Generation-heavy workloads usually care more about cheapest output or blended price.

Model Provider Input / 1M Output / 1M Blended / 1M Capabilities Updated
Claude Haiku 3.5
claude-haiku-3.5
Anthropic $0.800000 $4.000000 $1.6000
Cached No batch text
Context 200000
2026-03-30 02:54
Claude Haiku 4.5
claude-haiku-4.5
Anthropic $1.000000 $5.000000 $2.0000
Cached No batch multimodal
Context 200000
2026-03-30 02:54
Claude Opus 4.6
claude-opus-4.6
Anthropic $5.000000 $25.000000 $10.0000
Cached No batch multimodal
Context 200000
2026-03-30 02:54
Claude Sonnet 4.6
claude-sonnet-4.6
Anthropic $3.000000 $15.000000 $6.0000
Cached No batch multimodal
Context 200000
2026-03-30 02:54
Gemini 2.5 Flash
gemini-2.5-flash
Gemini $0.300000 $2.500000 $0.8500
Cached Batch multimodal
Context 1048576
2026-03-30 02:54
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
Gemini $0.100000 $0.400000 $0.1750
Cached Batch multimodal
Context 1048576
2026-03-30 02:54
Gemini 2.5 Pro
gemini-2.5-pro
Gemini $1.250000 $10.000000 $3.4375
Cached Batch multimodal
Context 1048576
2026-03-30 02:54
GPT-4.1
gpt-4.1
OpenAI N/A N/A N/A
No cache No batch text
Context 1048576
N/A
GPT-4.1 Mini
gpt-4.1-mini
OpenAI N/A N/A N/A
No cache No batch text
Context 1048576
N/A
GPT-4.1 Nano
gpt-4.1-nano
OpenAI N/A N/A N/A
No cache No batch text
Context 1048576
N/A
GPT-4o
gpt-4o
OpenAI N/A N/A N/A
No cache No batch multimodal
Context 128000
N/A
GPT-4o Mini
gpt-4o-mini
OpenAI N/A N/A N/A
No cache No batch multimodal
Context 128000
N/A
GPT-5
gpt-5
OpenAI N/A N/A N/A
No cache No batch text
Context 400000
N/A
GPT-5 Mini
gpt-5-mini
OpenAI N/A N/A N/A
No cache No batch text
Context 400000
N/A
GPT-5 Nano
gpt-5-nano
OpenAI N/A N/A N/A
No cache No batch text
Context 400000
N/A
GPT-5.4
gpt-5.4
OpenAI N/A N/A N/A
No cache No batch text
Context 400000
N/A

When to use the list

The list is best for scanning the market, checking freshness, and eliminating clearly expensive options before you do workload-specific math.

When to open compare

Use compare after the shortlist is small and the workload is fixed. That gives a cleaner decision than sorting one price field in isolation.

When to open detail

Open the model detail page when you need source links, pricing history, context limits, or a quick reference workload estimate before committing to a choice.