Compare leading models on capability and cost efficiency. This public page is optimized for teams searching for current model trade-offs.
Weekly refreshed public benchmark (static fallback mode if API unavailable).
| Provider | Model | Capability | Cost efficiency |
|---|---|---|---|
| OpenAI | gpt-4.1-mini | 86/100 | 72/100 |
| Anthropic | claude-3.5-haiku | 82/100 | 91/100 |
| gemini-2.0-flash | 79/100 | 88/100 | |
| Groq | llama-3.3-70b | 75/100 | 94/100 |