LLM
Rankings
Rankings
Compare
π
δΈζ
Model Comparison
Select 2-5 models for side-by-side comparison
β β Back to Rankings
Hold Cmd/β to select multiple models
#1 β DeepSeek: DeepSeek V4 Flash ($0.0/M)
#2 β Tencent: Hy3 preview ($0.3/M)
#3 β Anthropic: Claude Opus 4.7 ($30.0/M)
#4 β Anthropic: Claude Sonnet 4.6 ($18.0/M)
#5 β Owl Alpha ($0.0/M)
#6 β DeepSeek: DeepSeek V4 Pro ($0.0/M)
#7 β Xiaomi: MiMo-V2.5 ($0.4/M)
#8 β Xiaomi: MiMo-V2.5-Pro ($1.3/M)
#9 β DeepSeek: DeepSeek V3.2 ($0.6/M)
#10 β Google: Gemini 3 Flash Preview ($3.5/M)
# β AI21: Jamba Large 1.7 ($10.0/M)
# β AionLabs: Aion-1.0 ($12.0/M)
# β AionLabs: Aion-1.0-Mini ($2.1/M)
# β AionLabs: Aion-2.0 ($2.4/M)
# β AionLabs: Aion-RP 1.0 (8B) ($2.4/M)
# β AllenAI: Olmo 3 32B Think ($0.7/M)
# β Amazon: Nova 2 Lite ($2.8/M)
# β Amazon: Nova Lite 1.0 ($0.3/M)
# β Amazon: Nova Micro 1.0 ($0.2/M)
# β Amazon: Nova Premier 1.0 ($15.0/M)
# β Amazon: Nova Pro 1.0 ($4.0/M)
# β Magnum v4 72B ($8.0/M)
# β Anthropic: Claude 3.5 Haiku ($4.8/M)
# β Anthropic: Claude 3 Haiku ($1.5/M)
# β Anthropic: Claude Haiku 4.5 ($6.0/M)
# β Anthropic: Claude Opus 4 ($90.0/M)
# β Anthropic: Claude Opus 4.1 ($90.0/M)
# β Anthropic: Claude Opus 4.5 ($30.0/M)
# β Anthropic: Claude Opus 4.6 ($30.0/M)
# β Anthropic: Claude Opus 4.6 (Fast) ($180.0/M)
# β Anthropic: Claude Opus 4.7 (Fast) ($180.0/M)
# β Anthropic: Claude Opus 4.8 ($30.0/M)
# β Anthropic: Claude Opus 4.8 (Fast) ($60.0/M)
# β Anthropic: Claude Sonnet 4 ($18.0/M)
# β Anthropic: Claude Sonnet 4.5 ($18.0/M)
# β Arcee AI: Coder Large ($1.3/M)
# β Arcee AI: Maestro Reasoning ($4.2/M)
# β Arcee AI: Spotlight ($0.4/M)
# β Arcee AI: Trinity Large Thinking ($1.1/M)
# β Arcee AI: Trinity Mini ($0.2/M)
# β Arcee AI: Virtuoso Large ($1.9/M)
# β Baidu: ERNIE 4.5 300B A47B ($1.4/M)
# β Baidu: ERNIE 4.5 VL 28B A3B ($0.7/M)
# β Baidu: ERNIE 4.5 VL 424B A47B ($1.7/M)
# β ByteDance Seed: Seed 1.6 ($2.3/M)
# β ByteDance Seed: Seed 1.6 Flash ($0.4/M)
# β ByteDance Seed: Seed-2.0-Lite ($2.3/M)
# β ByteDance Seed: Seed-2.0-Mini ($0.5/M)
# β ByteDance: UI-TARS 7B ($0.3/M)
# β Venice: Uncensored (free) ($0.0/M)
# β Cohere: Command A ($12.5/M)
# β Cohere: Command R (08-2024) ($0.8/M)
# β Cohere: Command R+ (08-2024) ($12.5/M)
# β Cohere: Command R7B (12-2024) ($0.2/M)
# β Deep Cogito: Cogito v2.1 671B ($2.5/M)
# β DeepSeek: DeepSeek V3 ($1.1/M)
# β DeepSeek: DeepSeek V3 0324 ($1.0/M)
# β DeepSeek: DeepSeek V3.1 ($1.0/M)
# β DeepSeek: R1 0528 ($2.7/M)
# β DeepSeek: R1 Distill Llama 70B ($1.5/M)
# β DeepSeek: R1 Distill Qwen 32B ($0.6/M)
# β DeepSeek: DeepSeek V3.1 Terminus ($1.2/M)
# β DeepSeek: DeepSeek V3.2 Exp ($0.7/M)
# β DeepSeek: R1 ($0.0/M)
# β EssentialAI: Rnj 1 Instruct ($0.3/M)
# β Google: Gemma 3 27B ($0.2/M)
# β Google: Gemini 2.0 Flash ($0.5/M)
# β Google: Gemini 2.0 Flash Lite ($0.4/M)
# β Google: Gemini 2.5 Flash ($2.8/M)
# β Google: Nano Banana (Gemini 2.5 Flash Image) ($2.8/M)
# β Google: Gemini 2.5 Flash Lite ($0.5/M)
# β Google: Gemini 2.5 Flash Lite Preview 09-2025 ($0.5/M)
# β Google: Gemini 2.5 Pro ($11.3/M)
# β Google: Gemini 2.5 Pro Preview 06-05 ($11.3/M)
# β Google: Gemini 2.5 Pro Preview 05-06 ($11.3/M)
# β Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) ($3.5/M)
# β Google: Gemini 3.1 Flash Lite ($1.8/M)
# β Google: Gemini 3.1 Flash Lite Preview ($1.8/M)
# β Google: Gemini 3.1 Pro Preview ($14.0/M)
# β Google: Gemini 3.1 Pro Preview Custom Tools ($14.0/M)
# β Google: Gemini 3.5 Flash ($10.5/M)
# β Google: Nano Banana Pro (Gemini 3 Pro Image Preview) ($14.0/M)
# β Google: Gemma 2 27B ($1.3/M)
# β Google: Gemma 3 12B ($0.2/M)
# β Google: Gemma 3 4B ($0.1/M)
# β Google: Gemma 3n 4B ($0.2/M)
# β Google: Gemma 4 26B A4B ($0.4/M)
# β Google: Gemma 4 26B A4B (free) ($0.0/M)
# β Google: Gemma 4 31B ($0.5/M)
# β Google: Gemma 4 31B (free) ($0.0/M)
# β Google: Lyria 3 Clip Preview ($0.0/M)
# β Google: Lyria 3 Pro Preview ($0.0/M)
# β OpenAI: GPT-4o ($12.5/M)
# β OpenAI: GPT-4o-mini ($0.8/M)
# β MythoMax 13B ($0.1/M)
# β IBM: Granite 4.0 Micro ($0.1/M)
# β IBM: Granite 4.1 8B ($0.1/M)
# β Inception: Mercury 2 ($1.0/M)
# β inclusionAI: Ling-2.6-1T ($0.7/M)
# β inclusionAI: Ling-2.6-flash ($0.0/M)
# β inclusionAI: Ring-2.6-1T ($2.8/M)
# β Inflection: Inflection 3 Pi ($12.5/M)
# β Inflection: Inflection 3 Productivity ($12.5/M)
# β Kwaipilot: KAT-Coder-Pro V2 ($1.5/M)
# β LiquidAI: LFM2-24B-A2B ($0.1/M)
# β LiquidAI: LFM2.5-1.2B-Instruct (free) ($0.0/M)
# β LiquidAI: LFM2.5-1.2B-Thinking (free) ($0.0/M)
# β Meta: Llama 4 Maverick ($0.8/M)
# β Meta: Llama 4 Scout ($0.4/M)
# β Mancer: Weaver (alpha) ($1.8/M)
# β Meta: Llama 3.1 70B Instruct ($0.8/M)
# β Meta: Llama 3.1 8B Instruct ($0.1/M)
# β Meta: Llama 3.2 11B Vision Instruct ($0.5/M)
# β Meta: Llama 3.2 1B Instruct ($0.2/M)
# β Meta: Llama 3.2 3B Instruct ($0.4/M)
# β Meta: Llama 3.2 3B Instruct (free) ($0.0/M)
# β Meta: Llama 3.3 70B Instruct ($0.4/M)
# β Meta: Llama 3.3 70B Instruct (free) ($0.0/M)
# β Meta: Llama 3 70B Instruct ($1.3/M)
# β Meta: Llama 3 8B Instruct ($0.1/M)
# β Llama Guard 3 8B ($0.5/M)
# β Meta: Llama Guard 4 12B ($0.4/M)
# β Microsoft: Phi 4 ($0.2/M)
# β Microsoft: Phi 4 Mini Instruct ($0.4/M)
# β WizardLM-2 8x22B ($1.2/M)
# β MiniMax: MiniMax-01 ($1.3/M)
# β MiniMax: MiniMax M1 ($2.6/M)
# β MiniMax: MiniMax M2 ($1.3/M)
# β MiniMax: MiniMax M2.1 ($1.2/M)
# β MiniMax: MiniMax M2.5 ($1.3/M)
# β MiniMax: MiniMax M2.7 ($1.5/M)
# β MiniMax: MiniMax M2-her ($1.5/M)
# β Mistral Large ($8.0/M)
# β Mistral: Codestral 2508 ($1.2/M)
# β Mistral: Devstral 2 2512 ($2.4/M)
# β Mistral: Ministral 3 14B 2512 ($0.4/M)
# β Mistral: Ministral 3 3B 2512 ($0.2/M)
# β Mistral: Ministral 3 8B 2512 ($0.3/M)
# β Mistral Large 2407 ($8.0/M)
# β Mistral: Mistral Large 3 2512 ($2.0/M)
# β Mistral: Mistral Medium 3 ($2.4/M)
# β Mistral: Mistral Medium 3.1 ($2.4/M)
# β Mistral: Mistral Medium 3.5 ($9.0/M)
# β Mistral: Mistral Nemo ($0.1/M)
# β Mistral: Saba ($0.8/M)
# β Mistral: Mistral Small 3 ($0.1/M)
# β Mistral: Mistral Small 4 ($0.8/M)
# β Mistral: Mistral Small 3.1 24B ($0.9/M)
# β Mistral: Mistral Small 3.2 24B ($0.3/M)
# β Mistral: Mixtral 8x22B Instruct ($8.0/M)
# β Mistral: Voxtral Small 24B 2507 ($0.4/M)
# β MoonshotAI: Kimi K2 0711 ($2.9/M)
# β MoonshotAI: Kimi K2 0905 ($3.1/M)
# β MoonshotAI: Kimi K2.5 ($2.3/M)
# β MoonshotAI: Kimi K2.6 ($4.1/M)
# β MoonshotAI: Kimi K2.6 (free) ($0.0/M)
# β MoonshotAI: Kimi K2 Thinking ($3.1/M)
# β Morph: Morph V3 Fast ($2.0/M)
# β Morph: Morph V3 Large ($2.8/M)
# β Nex AGI: DeepSeek V3.1 Nex N1 ($0.6/M)
# β NousResearch: Hermes 2 Pro - Llama-3 8B ($0.3/M)
# β Nous: Hermes 3 405B Instruct ($2.0/M)
# β Nous: Hermes 3 405B Instruct (free) ($0.0/M)
# β Nous: Hermes 3 70B Instruct ($0.6/M)
# β Nous: Hermes 4 405B ($4.0/M)
# β Nous: Hermes 4 70B ($0.5/M)
# β NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 ($0.5/M)
# β NVIDIA: Nemotron 3 Nano 30B A3B ($0.2/M)
# β NVIDIA: Nemotron 3 Nano 30B A3B (free) ($0.0/M)
# β NVIDIA: Nemotron 3 Nano Omni (free) ($0.0/M)
# β NVIDIA: Nemotron 3 Super ($0.5/M)
# β NVIDIA: Nemotron 3 Super (free) ($0.0/M)
# β NVIDIA: Nemotron Nano 12B 2 VL (free) ($0.0/M)
# β NVIDIA: Nemotron Nano 9B V2 ($0.2/M)
# β NVIDIA: Nemotron Nano 9B V2 (free) ($0.0/M)
# β OpenAI: o3 Mini ($5.5/M)
# β OpenAI: GPT-3.5 Turbo ($2.0/M)
# β OpenAI: GPT-3.5 Turbo (older v0613) ($3.0/M)
# β OpenAI: GPT-3.5 Turbo 16k ($7.0/M)
# β OpenAI: GPT-3.5 Turbo Instruct ($3.5/M)
# β OpenAI: GPT-4 ($90.0/M)
# β OpenAI: GPT-4 (older v0314) ($90.0/M)
# β OpenAI: GPT-4.1 ($10.0/M)
# β OpenAI: GPT-4.1 Mini ($2.0/M)
# β OpenAI: GPT-4.1 Nano ($0.5/M)
# β OpenAI: GPT-4 Turbo (older v1106) ($40.0/M)
# β OpenAI: GPT-4 Turbo ($40.0/M)
# β OpenAI: GPT-4 Turbo Preview ($40.0/M)
# β OpenAI: GPT-4o (2024-05-13) ($20.0/M)
# β OpenAI: GPT-4o (2024-08-06) ($12.5/M)
# β OpenAI: GPT-4o (2024-11-20) ($12.5/M)
# β OpenAI: GPT-4o-mini (2024-07-18) ($0.8/M)
# β OpenAI: GPT-4o-mini Search Preview ($0.8/M)
# β OpenAI: GPT-4o Search Preview ($12.5/M)
# β OpenAI: GPT-5 ($11.3/M)
# β OpenAI: GPT-5.1 ($11.3/M)
# β OpenAI: GPT-5.1 Chat ($11.3/M)
# β OpenAI: GPT-5.1-Codex ($11.3/M)
# β OpenAI: GPT-5.1-Codex-Max ($11.3/M)
# β OpenAI: GPT-5.1-Codex-Mini ($2.3/M)
# β OpenAI: GPT-5.2 ($15.8/M)
# β OpenAI: GPT-5.2 Chat ($15.8/M)
# β OpenAI: GPT-5.2-Codex ($15.8/M)
# β OpenAI: GPT-5.2 Pro ($189.0/M)
# β OpenAI: GPT-5.3 Chat ($15.8/M)
# β OpenAI: GPT-5.3-Codex ($15.8/M)
# β OpenAI: GPT-5.4 ($17.5/M)
# β OpenAI: GPT-5.4 Image 2 ($23.0/M)
# β OpenAI: GPT-5.4 Mini ($5.3/M)
# β OpenAI: GPT-5.4 Nano ($1.4/M)
# β OpenAI: GPT-5.4 Pro ($210.0/M)
# β OpenAI: GPT-5.5 ($35.0/M)
# β OpenAI: GPT-5.5 Pro ($210.0/M)
# β OpenAI: GPT-5 Chat ($11.3/M)
# β OpenAI: GPT-5 Codex ($11.3/M)
# β OpenAI: GPT-5 Image ($20.0/M)
# β OpenAI: GPT-5 Image Mini ($4.5/M)
# β OpenAI: GPT-5 Mini ($2.3/M)
# β OpenAI: GPT-5 Nano ($0.4/M)
# β OpenAI: GPT-5 Pro ($135.0/M)
# β OpenAI: GPT Audio ($12.5/M)
# β OpenAI: GPT Audio Mini ($3.0/M)
# β OpenAI: GPT Chat Latest ($35.0/M)
# β OpenAI: gpt-oss-120b ($0.2/M)
# β OpenAI: gpt-oss-120b (free) ($0.0/M)
# β OpenAI: gpt-oss-20b ($0.2/M)
# β OpenAI: gpt-oss-20b (free) ($0.0/M)
# β OpenAI: gpt-oss-safeguard-20b ($0.4/M)
# β OpenAI: o1 ($75.0/M)
# β OpenAI: o1-pro ($750.0/M)
# β OpenAI: o3 ($10.0/M)
# β OpenAI: o3 Deep Research ($50.0/M)
# β OpenAI: o3 Mini High ($5.5/M)
# β OpenAI: o3 Pro ($100.0/M)
# β OpenAI: o4 Mini ($5.5/M)
# β OpenAI: o4 Mini Deep Research ($10.0/M)
# β OpenAI: o4 Mini High ($5.5/M)
# β Auto Router ($-2000000.0/M)
# β Body Builder (beta) ($-2000000.0/M)
# β Free Models Router ($0.0/M)
# β Pareto Code Router ($-2000000.0/M)
# β Perceptron: Perceptron Mk1 ($1.6/M)
# β Perplexity: Sonar ($2.0/M)
# β Perplexity: Sonar Deep Research ($10.0/M)
# β Perplexity: Sonar Pro ($18.0/M)
# β Perplexity: Sonar Pro Search ($18.0/M)
# β Perplexity: Sonar Reasoning Pro ($10.0/M)
# β Poolside: Laguna M.1 (free) ($0.0/M)
# β Poolside: Laguna XS.2 (free) ($0.0/M)
# β Prime Intellect: INTELLECT-3 ($1.3/M)
# β Qwen2.5 72B Instruct ($0.8/M)
# β Qwen: Qwen2.5 7B Instruct ($0.1/M)
# β Qwen2.5 Coder 32B Instruct ($1.7/M)
# β Qwen: Qwen-Plus ($1.0/M)
# β Qwen: Qwen Plus 0728 ($1.0/M)
# β Qwen: Qwen Plus 0728 (thinking) ($1.0/M)
# β Qwen: Qwen2.5 VL 72B Instruct ($1.0/M)
# β Qwen: Qwen3 14B ($0.0/M)
# β Qwen: Qwen3 235B A22B ($2.3/M)
# β Qwen: Qwen3 235B A22B Instruct 2507 ($0.2/M)
# β Qwen: Qwen3 235B A22B Thinking 2507 ($0.2/M)
# β Qwen: Qwen3 30B A3B ($0.5/M)
# β Qwen: Qwen3 30B A3B Instruct 2507 ($0.0/M)
# β Qwen: Qwen3 30B A3B Thinking 2507 ($0.5/M)
# β Qwen: Qwen3 32B ($0.0/M)
# β Qwen: Qwen3.5-122B-A10B ($0.0/M)
# β Qwen: Qwen3.5-27B ($0.0/M)
# β Qwen: Qwen3.5-35B-A3B ($0.0/M)
# β Qwen: Qwen3.5 397B A17B ($0.0/M)
# β Qwen: Qwen3.5-9B ($0.0/M)
# β Qwen: Qwen3.5-Flash ($0.3/M)
# β Qwen: Qwen3.5 Plus 2026-02-15 ($1.8/M)
# β Qwen: Qwen3.5 Plus 2026-04-20 ($2.1/M)
# β Qwen: Qwen3.6 27B ($0.0/M)
# β Qwen: Qwen3.6 35B A3B ($0.0/M)
# β Qwen: Qwen3.6 Flash ($1.3/M)
# β Qwen: Qwen3.6 Max Preview ($7.3/M)
# β Qwen: Qwen3.6 Plus ($2.3/M)
# β Qwen: Qwen3.7 Max ($5.0/M)
# β Qwen: Qwen3 8B ($0.0/M)
# β Qwen: Qwen3 Coder 30B A3B Instruct ($0.0/M)
# β Qwen: Qwen3 Coder Flash ($1.2/M)
# β Qwen: Qwen3 Coder 480B A35B (free) ($0.0/M)
# β Qwen: Qwen3 Coder Next ($0.9/M)
# β Qwen: Qwen3 Coder Plus ($3.9/M)
# β Qwen: Qwen3 Max ($4.7/M)
# β Qwen: Qwen3 Max Thinking ($4.7/M)
# β Qwen: Qwen3 Next 80B A3B Instruct ($1.2/M)
# β Qwen: Qwen3 Next 80B A3B Instruct (free) ($0.0/M)
# β Qwen: Qwen3 Next 80B A3B Thinking ($0.9/M)
# β Qwen: Qwen3 VL 235B A22B Instruct ($1.1/M)
# β Qwen: Qwen3 VL 235B A22B Thinking ($2.9/M)
# β Qwen: Qwen3 VL 30B A3B Instruct ($0.0/M)
# β Qwen: Qwen3 VL 30B A3B Thinking ($0.0/M)
# β Qwen: Qwen3 VL 32B Instruct ($0.0/M)
# β Qwen: Qwen3 VL 8B Instruct ($0.0/M)
# β Qwen: Qwen3 VL 8B Thinking ($0.0/M)
# β Qwen: Qwen3 Coder 480B A35B ($2.0/M)
# β Reka Edge ($0.2/M)
# β Reka Flash 3 ($0.3/M)
# β Relace: Relace Apply 3 ($2.1/M)
# β Relace: Relace Search ($4.0/M)
# β Sao10K: Llama 3.1 70B Hanami x1 ($6.0/M)
# β Sao10K: Llama 3.1 Euryale 70B v2.2 ($1.7/M)
# β Sao10K: Llama 3.3 Euryale 70B ($1.4/M)
# β Sao10k: Llama 3 Euryale 70B v2.1 ($3.0/M)
# β Sao10K: Llama 3 8B Lunaris ($0.1/M)
# β StepFun: Step 3.5 Flash ($0.4/M)
# β StepFun: Step 3.7 Flash ($1.3/M)
# β Switchpoint Router ($4.3/M)
# β Tencent: Hunyuan A13B Instruct ($0.0/M)
# β TheDrummer: Cydonia 24B V4.1 ($0.8/M)
# β TheDrummer: Rocinante 12B ($0.6/M)
# β TheDrummer: Skyfall 36B V2 ($1.4/M)
# β TheDrummer: UnslopNemo 12B ($0.8/M)
# β ReMM SLERP 13B ($1.1/M)
# β Writer: Palmyra X5 ($6.6/M)
# β xAI: Grok 4.20 ($3.8/M)
# β xAI: Grok 4.20 Multi-Agent ($8.0/M)
# β xAI: Grok 4.3 ($3.8/M)
# β xAI: Grok Build 0.1 ($3.0/M)
# β Xiaomi: MiMo-V2-Flash ($0.4/M)
# β Z.ai: GLM 4 32B ($0.2/M)
# β Z.ai: GLM 4.5 ($2.8/M)
# β Z.ai: GLM 4.5 Air ($1.0/M)
# β Z.ai: GLM 4.5 Air (free) ($0.0/M)
# β Z.ai: GLM 4.5V ($2.4/M)
# β Z.ai: GLM 4.6 ($2.2/M)
# β Z.ai: GLM 4.6V ($1.2/M)
# β Z.ai: GLM 4.7 ($2.1/M)
# β Z.ai: GLM 4.7 Flash ($0.5/M)
# β Z.ai: GLM 5 ($2.5/M)
# β Z.ai: GLM 5.1 ($4.1/M)
# β Z.ai: GLM 5 Turbo ($5.2/M)
# β Z.ai: GLM 5V Turbo ($5.2/M)
# β Anthropic Claude Haiku Latest ($6.0/M)
# β Anthropic: Claude Opus Latest ($30.0/M)
# β Anthropic Claude Sonnet Latest ($18.0/M)
# β Google Gemini Flash Latest ($10.5/M)
# β Google Gemini Pro Latest ($14.0/M)
# β MoonshotAI Kimi Latest ($4.1/M)
# β OpenAI GPT Latest ($35.0/M)
# β OpenAI GPT Mini Latest ($5.3/M)
# β accounts/fireworks/models/flux-1-dev-fp8 ($β/M)
# β accounts/fireworks/models/flux-1-schnell-fp8 ($β/M)
# β accounts/fireworks/models/flux-kontext-max ($β/M)
# β accounts/fireworks/models/flux-kontext-pro ($β/M)
# β accounts/fireworks/models/glm-5p1 ($β/M)
# β accounts/fireworks/models/gpt-oss-120b ($β/M)
# β accounts/fireworks/models/kimi-k2p5 ($β/M)
# β accounts/fireworks/models/kimi-k2p6 ($β/M)
# β BAAI/bge-large-en-v1.5 ($0.0/M)
# β BAAI/bge-large-zh-v1.5 ($0.0/M)
# β BAAI/bge-m3 ($0.0/M)
# β BAAI/bge-reranker-v2-m3 ($0.0/M)
# β baidu/ERNIE-Image-Turbo ($0.0/M)
# β ByteDance-Seed/Seed-OSS-36B-Instruct ($0.0/M)
# β codeqwen1.5-7b-chat ($β/M)
# β deepseek-ai/DeepSeek-OCR ($0.0/M)
# β deepseek-ai/DeepSeek-R1-0528-Qwen3-8B ($0.0/M)
# β deepseek-ai/DeepSeek-V3 ($0.0/M)
# β deepseek-ai/DeepSeek-V3.1-Terminus ($0.0/M)
# β deepseek-ai/DeepSeek-V3.2 ($0.0/M)
# β deepseek-r1-distill-llama-8b ($β/M)
# β deepseek-r1-distill-qwen-1.5b ($β/M)
# β deepseek-r1-distill-qwen-14b ($β/M)
# β deepseek-r1-distill-qwen-7b ($β/M)
# β deepseek-v3.1 ($β/M)
# β deepseek-v3.2 ($β/M)
# β fnlp/MOSS-TTSD-v0.5 ($0.0/M)
# β FunAudioLLM/CosyVoice2-0.5B ($0.0/M)
# β FunAudioLLM/SenseVoiceSmall ($0.0/M)
# β glm-4.7 ($β/M)
# β glm-5.1 ($β/M)
# β gui-plus ($β/M)
# β inclusionAI/Ling-flash-2.0 ($0.0/M)
# β inclusionAI/Ling-mini-2.0 ($0.0/M)
# β kimi-k2.5 ($β/M)
# β kimi-k2.6 ($β/M)
# β kimi/kimi-k2.5 ($β/M)
# β kimi/kimi-k2.6 ($β/M)
# β Kwai-Kolors/Kolors ($0.0/M)
# β LoRA/Qwen/Qwen2.5-14B-Instruct ($0.0/M)
# β LoRA/Qwen/Qwen2.5-32B-Instruct ($0.0/M)
# β LoRA/Qwen/Qwen2.5-72B-Instruct ($0.0/M)
# β LoRA/Qwen/Qwen2.5-7B-Instruct ($0.0/M)
# β MiniMax-M2.1 ($β/M)
# β MiniMax-M2.5 ($β/M)
# β MiniMax/speech-02-hd ($β/M)
# β MiniMax/speech-02-turbo ($β/M)
# β MiniMax/speech-2.8-hd ($β/M)
# β MiniMax/speech-2.8-turbo ($β/M)
# β MiniMaxAI/MiniMax-M2.5 ($0.0/M)
# β netease-youdao/bce-embedding-base_v1 ($0.0/M)
# β netease-youdao/bce-reranker-base_v1 ($0.0/M)
# β PaddlePaddle/PaddleOCR-VL-1.5 ($0.0/M)
# β Pro/BAAI/bge-m3 ($0.0/M)
# β Pro/BAAI/bge-reranker-v2-m3 ($0.0/M)
# β Pro/deepseek-ai/DeepSeek-V3 ($0.0/M)
# β Pro/deepseek-ai/DeepSeek-V3.1-Terminus ($0.0/M)
# β Pro/deepseek-ai/DeepSeek-V3.2 ($0.0/M)
# β Pro/MiniMaxAI/MiniMax-M2.5 ($0.0/M)
# β Pro/moonshotai/Kimi-K2.5 ($0.0/M)
# β Pro/moonshotai/Kimi-K2.6 ($0.0/M)
# β Pro/Qwen/Qwen2.5-7B-Instruct ($0.0/M)
# β Pro/zai-org/GLM-4.7 ($0.0/M)
# β Pro/zai-org/GLM-5 ($0.0/M)
# β Pro/zai-org/GLM-5.1 ($0.0/M)
# β qvq-max ($β/M)
# β qvq-plus ($β/M)
# β qwen-1.8b-chat ($β/M)
# β qwen-1.8b-longcontext-chat ($β/M)
# β qwen-14b-chat ($β/M)
# β qwen-72b-chat ($β/M)
# β qwen-7b-chat ($β/M)
# β qwen-coder-plus ($β/M)
# β qwen-coder-plus-1106 ($β/M)
# β qwen-coder-turbo ($β/M)
# β qwen-deep-research-2025-12-15 ($β/M)
# β qwen-deep-search-planning ($β/M)
# β qwen-flash ($β/M)
# β qwen-flash-character ($β/M)
# β qwen-flash-character-2026-02-26 ($β/M)
# β qwen-image-2.0 ($β/M)
# β qwen-image-2.0-2026-03-03 ($β/M)
# β qwen-image-2.0-pro ($β/M)
# β qwen-image-2.0-pro-2026-03-03 ($β/M)
# β qwen-image-2.0-pro-2026-04-22 ($β/M)
# β qwen-image-edit-max ($β/M)
# β qwen-image-edit-max-2026-01-16 ($β/M)
# β qwen-image-edit-plus ($β/M)
# β qwen-image-edit-plus-2025-10-30 ($β/M)
# β qwen-image-edit-plus-2025-12-15 ($β/M)
# β qwen-image-max ($β/M)
# β qwen-image-max-2025-12-30 ($β/M)
# β qwen-image-plus-2026-01-09 ($β/M)
# β qwen-long ($β/M)
# β qwen-math-plus ($β/M)
# β qwen-math-plus-0919 ($β/M)
# β qwen-math-plus-latest ($β/M)
# β qwen-math-turbo ($β/M)
# β qwen-max ($β/M)
# β qwen-max-0107 ($β/M)
# β qwen-max-1201 ($β/M)
# β qwen-max-longcontext ($β/M)
# β qwen-mt-flash ($β/M)
# β qwen-mt-lite ($β/M)
# β qwen-mt-plus ($β/M)
# β qwen-mt-turbo ($β/M)
# β qwen-omni-turbo ($β/M)
# β qwen-plus-2025-01-25 ($β/M)
# β qwen-plus-2025-04-28 ($β/M)
# β qwen-plus-2025-07-14 ($β/M)
# β qwen-plus-2025-09-11 ($β/M)
# β qwen-plus-2025-11-05 ($β/M)
# β qwen-plus-2025-12-01 ($β/M)
# β qwen-plus-latest ($β/M)
# β Qwen/Qwen-Image ($0.0/M)
# β Qwen/Qwen-Image-Edit ($0.0/M)
# β Qwen/Qwen-Image-Edit-2509 ($0.0/M)
# β Qwen/Qwen2.5-14B-Instruct ($0.0/M)
# β Qwen/Qwen2.5-32B-Instruct ($0.0/M)
# β Qwen/Qwen2.5-72B-Instruct ($0.0/M)
# β Qwen/Qwen2.5-72B-Instruct-128K ($0.0/M)
# β Qwen/Qwen2.5-7B-Instruct ($0.0/M)
# β Qwen/Qwen3.5-4B ($0.0/M)
# β Qwen/Qwen3-Embedding-0.6B ($0.0/M)
# β Qwen/Qwen3-Embedding-4B ($0.0/M)
# β Qwen/Qwen3-Embedding-8B ($0.0/M)
# β Qwen/Qwen3-Omni-30B-A3B-Captioner ($0.0/M)
# β Qwen/Qwen3-Omni-30B-A3B-Instruct ($0.0/M)
# β Qwen/Qwen3-Omni-30B-A3B-Thinking ($0.0/M)
# β Qwen/Qwen3-Reranker-0.6B ($0.0/M)
# β Qwen/Qwen3-Reranker-4B ($0.0/M)
# β Qwen/Qwen3-Reranker-8B ($0.0/M)
# β Qwen/Qwen3-VL-32B-Thinking ($0.0/M)
# β Qwen/Qwen3-VL-Embedding-8B ($0.0/M)
# β Qwen/Qwen3-VL-Reranker-8B ($0.0/M)
# β qwen-tts-2025-05-22 ($β/M)
# β qwen-turbo ($β/M)
# β qwen-turbo-0919 ($β/M)
# β qwen-turbo-2024-11-01 ($β/M)
# β qwen-vl-max ($β/M)
# β qwen-vl-ocr ($β/M)
# β qwen-vl-ocr-2025-11-20 ($β/M)
# β qwen-vl-ocr-latest ($β/M)
# β qwen-vl-plus ($β/M)
# β qwen1.5-0.5b-chat ($β/M)
# β qwen1.5-1.8b-chat ($β/M)
# β qwen1.5-110b-chat ($β/M)
# β qwen1.5-14b-chat ($β/M)
# β qwen1.5-32b-chat ($β/M)
# β qwen1.5-72b-chat ($β/M)
# β qwen1.5-7b-chat ($β/M)
# β qwen2-0.5b-instruct ($β/M)
# β qwen2-1.5b-instruct ($β/M)
# β qwen2.5-0.5b-instruct ($β/M)
# β qwen2.5-1.5b-instruct ($β/M)
# β qwen2.5-math-1.5b-instruct ($β/M)
# β qwen2-57b-a14b-instruct ($β/M)
# β qwen2-7b-instruct ($β/M)
# β qwen3-235b-a22b-instruct-2507 ($β/M)
# β qwen3.5-122b-a10b ($β/M)
# β qwen3.5-27b ($β/M)
# β qwen3.5-35b-a3b ($β/M)
# β qwen3.5-397b-a17b ($β/M)
# β qwen3.5-flash ($β/M)
# β qwen3.5-flash-2026-02-23 ($β/M)
# β qwen3.5-livetranslate-flash-realtime ($β/M)
# β qwen3.5-livetranslate-flash-realtime-2026-05-19 ($β/M)
# β qwen3.5-omni-flash ($β/M)
# β qwen3.5-omni-flash-2026-03-15 ($β/M)
# β qwen3.5-omni-flash-realtime ($β/M)
# β qwen3.5-omni-flash-realtime-2026-03-15 ($β/M)
# β qwen3.5-omni-plus ($β/M)
# β qwen3.5-omni-plus-2026-03-15 ($β/M)
# β qwen3.5-omni-plus-realtime ($β/M)
# β qwen3.5-omni-plus-realtime-2026-03-15 ($β/M)
# β qwen3.5-plus ($β/M)
# β qwen3.5-plus-2026-02-15 ($β/M)
# β qwen3.5-plus-2026-04-20 ($β/M)
# β qwen3.6-27b ($β/M)
# β qwen3.6-35b-a3b ($β/M)
# β qwen3.6-flash ($β/M)
# β qwen3.6-flash-2026-04-16 ($β/M)
# β qwen3.6-max-preview ($β/M)
# β qwen3.6-plus ($β/M)
# β qwen3.6-plus-2026-04-02 ($β/M)
# β qwen3.7-max ($β/M)
# β qwen3.7-max-2026-05-17 ($β/M)
# β qwen3.7-max-2026-05-20 ($β/M)
# β qwen3.7-max-preview ($β/M)
# β qwen3-asr-flash-2026-02-10 ($β/M)
# β qwen3-asr-flash-realtime ($β/M)
# β qwen3-asr-flash-realtime-2025-10-27 ($β/M)
# β qwen3-asr-flash-realtime-2026-02-10 ($β/M)
# β qwen3-coder-480b-a35b-instruct ($β/M)
# β qwen3-coder-plus-2025-07-22 ($β/M)
# β qwen3-coder-plus-2025-09-23 ($β/M)
# β qwen3-livetranslate-flash ($β/M)
# β qwen3-livetranslate-flash-2025-12-01 ($β/M)
# β qwen3-livetranslate-flash-realtime ($β/M)
# β qwen3-livetranslate-flash-realtime-2025-09-22 ($β/M)
# β qwen3-max-2025-09-23 ($β/M)
# β qwen3-max-2026-01-23 ($β/M)
# β qwen3-max-preview ($β/M)
# β qwen3-omni-flash ($β/M)
# β qwen3-omni-flash-2025-09-15 ($β/M)
# β qwen3-omni-flash-2025-12-01 ($β/M)
# β qwen3-omni-flash-realtime ($β/M)
# β qwen3-omni-flash-realtime-2025-09-15 ($β/M)
# β qwen3-omni-flash-realtime-2025-12-01 ($β/M)
# β qwen3-s2s-flash-realtime-2025-09-22 ($β/M)
# β qwen3-tts-flash ($β/M)
# β qwen3-tts-flash-2025-09-18 ($β/M)
# β qwen3-tts-flash-2025-11-27 ($β/M)
# β qwen3-tts-flash-realtime ($β/M)
# β qwen3-tts-flash-realtime-2025-09-18 ($β/M)
# β qwen3-tts-flash-realtime-2025-11-27 ($β/M)
# β qwen3-tts-instruct-flash ($β/M)
# β qwen3-tts-instruct-flash-2026-01-26 ($β/M)
# β qwen3-tts-instruct-flash-realtime ($β/M)
# β qwen3-tts-instruct-flash-realtime-2026-01-22 ($β/M)
# β qwen3-tts-vc-2026-01-22 ($β/M)
# β qwen3-tts-vc-realtime-2025-11-27 ($β/M)
# β qwen3-tts-vc-realtime-2026-01-15 ($β/M)
# β qwen3-tts-vd-2026-01-26 ($β/M)
# β qwen3-tts-vd-realtime-2025-12-16 ($β/M)
# β qwen3-tts-vd-realtime-2026-01-15 ($β/M)
# β qwen3-vl-flash ($β/M)
# β qwen3-vl-flash-2025-10-15 ($β/M)
# β qwen3-vl-flash-2026-01-22 ($β/M)
# β qwen3-vl-plus ($β/M)
# β qwen3-vl-plus-2025-09-23 ($β/M)
# β qwen3-vl-plus-2025-12-19 ($β/M)
# β qwq-plus ($β/M)
# β siliconflow/deepseek-r1-0528 ($β/M)
# β siliconflow/deepseek-v3-0324 ($β/M)
# β siliconflow/deepseek-v3.1-terminus ($β/M)
# β siliconflow/deepseek-v3.2 ($β/M)
# β stepfun-ai/Step-3.5-Flash ($0.0/M)
# β TeleAI/TeleSpeechASR ($0.0/M)
# β tencent/Hunyuan-MT-7B ($0.0/M)
# β THUDM/GLM-4-32B-0414 ($0.0/M)
# β THUDM/GLM-4-9B-0414 ($0.0/M)
# β THUDM/GLM-Z1-9B-0414 ($0.0/M)
# β Tongyi-MAI/Z-Image ($0.0/M)
# β Tongyi-MAI/Z-Image-Turbo ($0.0/M)
# β tongyi-xiaomi-analysis-flash ($β/M)
# β tongyi-xiaomi-analysis-pro ($β/M)
# β vanchin/deepseek-ocr ($β/M)
# β vanchin/deepseek-v3 ($β/M)
# β vanchin/deepseek-v3.1-terminus ($β/M)
# β vanchin/deepseek-v3.2-think ($β/M)
# β Wan-AI/Wan2.2-I2V-A14B ($0.0/M)
# β Wan-AI/Wan2.2-T2V-A14B ($0.0/M)
# β wan2.7-image ($β/M)
# β wan2.7-image-pro ($β/M)
# β zai-org/GLM-4.5-Air ($0.0/M)
# β zai-org/GLM-4.5V ($0.0/M)
# β ZHIPU/GLM-5 ($β/M)
# β ZHIPU/GLM-5.1 ($β/M)
Model Comparison
Metric
#1 DeepSeek: DeepSeek V4 Flash
#2 Tencent: Hy3 preview
#3 Anthropic: Claude Opus 4.7
Rank
#1
#2
#3
OR Input /1M
$0.10
$0.06
$5.00
OR Output /1M
$0.20
$0.21
$25.00
SF Input /1M
$0.00
β
β
Best Roundtrip
$0.00/M
$0.27/M
$30.00/M
Context
1,024K
256K
977K
Platforms
OR
SF
BL
OR
OR
Best In
Best Value
Largest Context
β
β