Skip to content

Supported Models

Auto-generated from provider JSON files on 2026-04-19 05:26 UTC. Do not edit manually — run python scripts/generate_model_docs.py

14 providers · 197 active models · 67 deprecated

AI21

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
AI21: Jamba 1.5 Large jamba-1-5-large $2.00 $8.00 256K
AI21: Jamba 1.5 Mini jamba-1-5-mini $0.200 $0.400 256K

Amazon

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Nova 2 Lite nova-2-lite $0.300 $2.50 1.00M
Nova Lite 1.0 nova-lite $0.060 $0.240 300K
Nova Micro 1.0 nova-micro $0.035 $0.140 128K
Nova Pro 1.0 nova-pro $0.800 $3.20 300K

Video Generation

Model Model ID
Amazon Nova Reel nova-reel

Anthropic

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Claude Opus 4.7 claude-opus-4-7 $5.00 $25.00 $0.500 1.00M
Claude Opus 4.5 claude-opus-4-5 $5.00 $25.00 $0.500 200K
Claude Sonnet 4.6 claude-sonnet-4-6 $3.00 $15.00 $0.300 1.00M
Claude Opus 4.6 claude-opus-4-6 $5.00 $25.00 $0.500 1.00M
Claude Haiku 4.5 claude-haiku-4.5 $1.00 $5.00 $0.100 200K
Claude Sonnet 4.5 claude-sonnet-4.5 $3.00 $15.00 $0.300 1.00M
Claude Opus 4.1 claude-opus-4.1 $15.00 $75.00 $1.50 200K
Claude Sonnet 4 claude-sonnet-4 $3.00 $15.00 $0.300 200K
Claude Sonnet 4 (thinking) claude-sonnet-4:thinking $3.00 $15.00 $0.300 200K
Claude Opus 4 claude-opus-4 $15.00 $75.00 $1.50 200K
Claude Opus 4 (thinking) claude-opus-4:thinking $15.00 $75.00 $1.50 200K
Claude 3.7 Sonnet claude-3.7-sonnet $3.00 $15.00 $0.300 200K
Claude 3.7 Sonnet (thinking) claude-3.7-sonnet:thinking $3.00 $15.00 $0.300 200K
Claude 3.5 Haiku claude-3.5-haiku $0.800 $4.00 $0.080 200K

Cohere

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Command R+ command-r-plus $3.00 $15.00 128K
Command R command-r $0.500 $1.50 128K

Embedding Models

Model Model ID Input/1M
Embed English v3.0 embed-english-v3.0
Embed Multilingual v3.0 embed-multilingual-v3.0

DeepSeek

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
DeepSeek V4 deepseek-v4 $0.300 $0.500 128K
DeepSeek V4 deepseek-v4 $0.300 $0.500 128K
DeepSeek LLM 67B Chat deepseek-chat $0.280 $0.420 128K
DeepSeek Reasoner deepseek-reasoner $0.280 $0.420 128K

Google

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Gemini 3.1 Flash Image Preview gemini-3.1-flash-image-preview $0.100 $0.400 $0.010 1.00M
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview $0.100 $0.400 $0.010 1.00M
Gemini 3.1 Pro Preview gemini-3.1-pro-preview $2.00 $12.00 $0.200 1.00M
Gemma 3 27B gemma-3-27b-it $0.100 $0.200 131K
Gemma 3 4B gemma-3-4b-it $0.020 $0.040 131K
Gemma 3 12B gemma-3-12b-it $0.050 $0.100 131K
Gemini 2.5 Flash Image (Nano Banana) gemini-2.5-flash-image $0.300 $2.50 32K
Nano Banana Pro (Gemini 3 Pro Image Preview) gemini-3-pro-image-preview $2.00 $12.00 65K
Gemini 2.5 Flash gemini-2.5-flash $0.300 $2.50 $0.030 1.05M
Gemini 2.5 Pro gemini-2.5-pro $1.25 $10.00 $0.125 1.05M
Gemma 3 1B gemma-3-1b-it 32K
Gemini 2.5 Flash Lite gemini-2.5-flash-lite $0.100 $0.400 $0.010 1.05M
Gemini 3 Flash Preview gemini-3-flash-preview $0.500 $3.00 $0.050 1.05M

Image Models

Model Model ID Low 1024x1024 Recommended
Imagen 3 imagen-3.0-generate-002
Imagen 4 imagen-4.0-generate-001
Imagen 4 Ultra imagen-4.0-ultra-generate-001
Imagen 4 Fast imagen-4.0-fast-generate-001

Embedding Models

Model Model ID Input/1M
Gemini Embedding 2 Preview gemini-embedding-2-preview
Gemini Embedding Experimental gemini-embedding-exp-03-07
Text Embedding 004 text-embedding-004
Embedding 001 embedding-001

Text-to-Speech

Model Model ID
Gemini 2.5 Flash Preview TTS gemini-2.5-flash-preview-tts
Gemini 2.5 Pro Preview TTS gemini-2.5-pro-preview-tts

Video Generation

Model Model ID
Veo 2 veo-2.0-generate-001
Veo 3 veo-3.0-generate-001
Veo 3 Fast veo-3.0-fast-generate-001
Veo 3.1 veo-3.1-generate-preview
Veo 3.1 Fast veo-3.1-fast-generate-preview

Luma


Meta

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Llama 4 Maverick llama-4-maverick $0.190 $0.490 1.00M
Llama 4 Scout llama-4-scout $0.170 $0.170 10.00M
Llama 3.2 1B Instruct llama3-2-1b-instruct $0.100 $0.100 131K
Llama 3.2 3B Instruct llama3-2-3b-instruct $0.150 $0.150 131K
Llama 3.2 11B Vision Instruct llama3-2-11b-instruct $0.160 $0.160 131K
Llama 3.2 90B Vision Instruct llama3-2-90b-instruct $0.720 $0.720 131K
Llama 3.3 70B Instruct llama3-3-70b-instruct $0.720 $0.720 131K

Mistral

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Devstral 2 devstral-2 $0.400 $2.00 262K
Mistral Small 4 mistral-small-4 $0.150 $0.600 262K
Devstral 2 devstral-2 $0.400 $2.00 262K
Mistral Small 4 mistral-small-4 $0.150 $0.600 262K
Ministral 3B ministral-3b-latest $0.040 $0.040 131K
Ministral 8B ministral-8b-latest $0.100 $0.100 128K
Mistral Small mistral-small $0.200 $0.600 32K
Mistral Small 2409 mistral-small-2409 $0.200 $0.600 32K
Mistral Small 2501 mistral-small-2501 $0.200 $0.600 32K
Mistral Small 2503 mistral-small-2503 $0.200 $0.600 32K
Mistral Small Latest mistral-small-latest $0.200 $0.600 32K
Mistral Medium 3 mistral-medium-3 $2.75 $8.10 32K
Mistral Large mistral-large-latest $2.00 $6.00 128K
Mistral Large 2411 mistral-large-2411 $2.00 $6.00 131K
Pixtral Large Latest pixtral-large-latest $2.00 $6.00 128K
Pixtral Large 2411 pixtral-large-2411 $2.00 $6.00 128K
Codestral 2501 codestral-2501 $0.300 $0.900 262K
Codestral Latest codestral-latest $0.300 $0.900 262K
Pixtral 12B pixtral-12b $0.100 $0.100 32K
Pixtral 12B 2409 pixtral-12b-2409 $0.100 $0.100 32K
Pixtral 12B Latest pixtral-12b-latest $0.100 $0.100 32K
Saba mistral-saba-latest $0.200 $0.600 32K
Saba 2502 mistral-saba-2502 $0.200 $0.600 32K
Mistral Medium 3.1 mistral-medium-3.1 $0.400 $2.00 131K
Mistral Medium 3.2 mistral-medium-3.2 $0.100 $0.300 131K
Mistral Large 3 mistral-large-3 $0.500 $1.50 262K
Devstral 2 2512 devstral-2512 $0.150 $0.600 262K

Embedding Models

Model Model ID Input/1M
Mistral Embed mistral-embed

OpenAI

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
GPT-4o-mini gpt-4o-mini $0.150 $0.600 $0.075 128K
GPT-4o-mini (2024-07-18) gpt-4o-mini-2024-07-18 $0.150 $0.600 $0.075 128K
GPT-4o gpt-4o $2.50 $10.00 $1.25 128K
GPT-4o (2024-08-06) gpt-4o-2024-08-06 $2.50 $10.00 $1.25 128K
ChatGPT-4o chatgpt-4o-latest $5.00 $15.00 128K
GPT-4o (2024-11-20) gpt-4o-2024-11-20 $2.50 $10.00 $1.25 128K
GPT-4.1 gpt-4.1 $2.00 $8.00 $0.500 1.05M
GPT-4.1 Mini gpt-4.1-mini $0.400 $1.60 $0.100 1.05M
GPT-4.1 Nano gpt-4.1-nano $0.100 $0.400 $0.025 1.05M
GPT-3.5 Turbo gpt-3.5-turbo $0.500 $1.50 16K
o1 o1 $15.00 $60.00 $7.50 200K
o3 o3 $10.00 $40.00 $2.50 200K
o3 Mini o3-mini $1.10 $4.40 $0.550 200K
o4 Mini o4-mini $1.10 $4.40 $0.275 200K
GPT-4 Turbo gpt-4-turbo $10.00 $30.00 128K
GPT-4o Search Preview gpt-4o-search-preview $2.50 $10.00 128K
GPT-3.5 Turbo 16k gpt-3.5-turbo-0125 $0.500 $1.50 16K
GPT-4o (2024-05-13) gpt-4o-2024-05-13 $5.00 $15.00 128K
GPT-4o-mini Search Preview gpt-4o-mini-search-preview $0.150 $0.600 128K
GPT-4 gpt-4 $30.00 $60.00 8K
GPT-4 Turbo (older v1106) gpt-4-1106-preview $10.00 $30.00 128K
GPT-3.5 Turbo 16k (older v1106) gpt-3.5-turbo-1106 $1.00 $2.00 16K
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct $1.50 $2.00 4K
GPT-3.5 Turbo 16k gpt-3.5-turbo-16k $3.00 $4.00 16K
o1-pro o1-pro $150.00 $600.00 200K
GPT-5 gpt-5 $1.25 $10.00 $0.125 400K
GPT-5 Chat gpt-5-chat-latest $1.25 $10.00 $0.125 400K
GPT-5 Nano gpt-5-nano $0.050 $0.400 $0.0050 400K
GPT-5 Mini gpt-5-mini $0.250 $2.00 $0.025 400K
GPT-5 Pro gpt-5-pro $15.00 $120.00 $1.50 400K
GPT-5.1-Codex-Mini gpt-5.1-codex-mini $0.250 $2.00 $0.025 400K
GPT-5.1 Chat gpt-5.1-chat-latest $1.25 $10.00 $0.125 128K
GPT-5.1-Codex gpt-5.1-codex $1.25 $10.00 $0.125 400K
GPT-5.1 gpt-5.1 $1.25 $10.00 $0.125 400K
GPT-5.1-Codex-Max gpt-5.1-codex-max $1.25 $10.00 $0.125 400K
GPT-5.2 gpt-5.2 $1.75 $14.00 $0.175 400K
GPT-5.2 Chat gpt-5.2-chat $1.75 $14.00 $0.175 128K
GPT-5.2 Pro gpt-5.2-pro $21.00 $168.00 400K
GPT-5.4 gpt-5.4 $2.50 $15.00 $0.250 1.05M
GPT-5.4 Mini gpt-5.4-mini $0.750 $4.50 $0.075 400K
GPT-5.4 Nano gpt-5.4-nano $0.200 $1.25 $0.020 400K
GPT-5.4 Pro gpt-5.4-pro $30.00 $180.00 $3.00 1.05M
GPT-5.2 Codex gpt-5.2-codex $1.75 $14.00 $0.175 400K
GPT-5 Codex gpt-5-codex $1.25 $10.00 $0.125 400K

Image Models

Model Model ID Low 1024x1024 Recommended
GPT Image 1 gpt-image-1 $0.011
GPT Image 1 Mini gpt-image-1-mini $0.005
GPT Image 1.5 gpt-image-1.5 $0.009

Embedding Models

Model Model ID Input/1M
Text Embedding 3 Small text-embedding-3-small
Text Embedding 3 Large text-embedding-3-large
Text Embedding Ada 002 text-embedding-ada-002

Text-to-Speech

Model Model ID
GPT-4o mini TTS gpt-4o-mini-tts
TTS-1 HD tts-1-hd
TTS-1 tts-1

Speech-to-Text

Model Model ID
Whisper 1 whisper-1
GPT-4o Transcribe gpt-4o-transcribe
GPT-4o Mini Transcribe gpt-4o-mini-transcribe

Video Generation

Model Model ID
Sora 2 sora-2
Sora 2 Pro sora-2-pro

Qwen

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
QwQ-Plus qwq-plus $0.600 $2.40 $0.640 32K
Qwen-Max qwen-max $1.60 $6.40 $0.640 32K
Qwen-Max-Latest qwen-max-latest $1.60 $6.40 $0.640 32K
Qwen-Max-2025-01-25 qwen-max-2025-01-25 $1.60 $6.40 $0.640 32K
Qwen-Plus qwen-plus $0.400 $1.20 $0.160 131K
Qwen-Plus-Latest qwen-plus-latest $0.400 $1.20 $0.160 131K
Qwen-Plus-Thinking qwen-plus:thinking $0.400 $8.00 $0.160 131K
Qwen-Plus-2025-04-28 qwen-plus-2025-04-28 $0.400 $1.20 $0.160 131K
Qwen-Turbo qwen-turbo $0.050 $0.200 $0.020 1.00M
Qwen-Turbo-Thinking qwen-turbo:thinking $0.050 $1.00 $0.020 1.00M
Qwen VL Max qwen-vl-max $0.800 $3.20 7K
Qwen VL Max Latest qwen-vl-max-latest $0.800 $3.20 7K
Qwen VL Plus qwen-vl-plus $0.210 $0.630 7K
Qwen VL Plus Latest qwen-vl-plus-latest $0.210 $0.630 7K
Qwen3 235B A22B qwen3-235b-a22b $0.700 $2.80 131K
Qwen3 235B A22B Thinking qwen3-235b-a22b:thinking $0.700 $8.40 131K
Qwen3 32B qwen3-32b $0.700 $2.80 131K
Qwen3 30B A3B qwen3-30b-a3b $0.200 $0.800 131K
Qwen3 14B qwen3-14b $0.350 $1.40 131K
Qwen3 8B qwen3-8b $0.180 $0.700 131K
Qwen3 0.6B qwen3-0.6b $0.110 $0.220 32K
Qwen3 1.7B qwen3-1.7b $0.110 $0.220 32K
Qwen3 4B qwen3-4b $0.110 $0.220 128K
Qwen2.5 VL 32B Instruct qwen2.5-vl-32b-instruct $1.40 $4.20 8K
Qwen2.5-VL 7B Instruct qwen2.5-vl-7b-instruct $0.350 $1.05 32K
Qwen: Qwen3 VL 8B Thinking qwen3-vl-8b-thinking $0.180 $2.10 256K

Image Models

Model Model ID Low 1024x1024 Recommended
Wan2.1 T2I Turbo wan2.1-t2i-turbo
Wan2.1 T2I Plus wan2.1-t2i-plus

Embedding Models

Model Model ID Input/1M
Text Embedding V3 text-embedding-v3

Stability AI

Image Models

Model Model ID Low 1024x1024 Recommended
Stable Diffusion 3.5 Large sd3-5-large
Stable Diffusion 3 Large sd3-large
Stable Image Core stable-image-core
Stable Image Ultra stable-image-ultra

Writer

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Palmyra X4 palmyra-x4 $2.50 $10.00 122K
Palmyra X5 palmyra-x5 $0.600 $6.00 1.04M

xAI

Text Models

Model Model ID Input/1M Output/1M Cache/1M Context Reasoning
Grok 4.2 grok-4.2 $2.00 $6.00 2.00M
Grok 4.1 Fast grok-4.1-fast $0.200 $0.500 2.00M
Grok 4.2 grok-4.2 $2.00 $6.00 2.00M
Grok 4.1 Fast grok-4.1-fast $0.200 $0.500 2.00M
Grok 3 Mini Beta grok-3-mini-beta $0.300 $0.500 131K
Grok 3 Beta grok-3-beta $3.00 $15.00 131K
Grok 3 Fast grok-3-fast $5.00 $25.00 131K
Grok 3 Mini Fast grok-3-mini-fast $0.600 $4.00 131K
Grok 4 grok-4 $3.00 $15.00 256K
Grok Code Fast 1 grok-code-fast-1 $0.200 $1.50 256K
Grok 4 Fast grok-4-fast $0.200 $0.500 2.00M
Grok 4.1 Fast grok-4-1-fast $0.200 $0.500 2.00M

Deprecated Models

These models are disabled and no longer recommended for new integrations.

Provider Model Model ID
Anthropic Claude 3.5 Sonnet claude-3.5-sonnet
Anthropic Claude 3 Haiku claude-3-haiku
Anthropic Claude 3 Sonnet claude-3-sonnet
Anthropic Claude 3 Opus claude-3-opus
Anthropic Claude v2.1 claude-2.1
Anthropic Claude v2 claude-2
Anthropic Claude v2.0 claude-2.0
Anthropic Claude v1 claude-1
Cohere Command command
DeepSeek DeepSeek-Coder-V2 deepseek-coder
Google Gemini 2.0 Flash gemini-2.0-flash
Google Gemini 2.0 Flash 001 gemini-2.0-flash-001
Google Gemini 2.5 Flash Preview gemini-2.5-flash-preview-09-2025
Google Gemini 2.5 Pro Preview gemini-2.5-pro-preview
Google Gemini 2.0 Flash Lite 001 gemini-2.0-flash-lite-001
Google Gemini 2.0 Flash Lite gemini-2.0-flash-lite
Google Gemini 1.5 Pro Latest gemini-1.5-pro-latest
Google Gemini 1.5 Flash gemini-1.5-flash
Google Gemini 1.5 Flash Latest gemini-1.5-flash-latest
Google Gemini 1.5 Flash 001 gemini-1.5-flash-001
Google Gemini 1.5 Flash 002 gemini-1.5-flash-002
Google Gemini 1.5 Flash 8B gemini-1.5-flash-8b
Google Gemini 1.5 Flash-8B 001 gemini-1.5-flash-8b-001
Google Gemini 1.5 Flash-8B Latest gemini-1.5-flash-8b-latest
Google Gemini 1.5 Pro gemini-1.5-pro
Google Gemini 1.5 Pro 001 gemini-1.5-pro-001
Google Gemini 1.5 Pro 002 gemini-1.5-pro-002
Google Gemini 2.5 Flash Preview 4 17 Thinking gemini-2.5-flash-preview-04-17-thinking
Google Gemini 2.5 Flash Image Preview (AKA Nano Banana) gemini-2.5-flash-image-preview
Google Gemini 3 Pro Preview gemini-3-pro-preview
Google Gemini 2.0 Flash Lite gemini-2.0-flash-lite-001
Google Gemini 2.5 Flash Lite Preview 09-2025 gemini-2.5-flash-lite-preview-09-2025
Luma Luma Ray2 ray2
Meta Llama 3 8B Instruct llama3-8b-instruct
Meta Llama 3 70B Instruct llama3-70b-instruct
Meta Llama 3.1 8B Instruct llama3-1-8b-instruct
Meta Llama 3.1 70B Instruct llama3-1-70b-instruct
Meta Llama 3.1 405B Instruct llama3-1-405b-instruct
Mistral Mistral Tiny mistral-tiny
Mistral Mistral Tiny Latest mistral-tiny-latest
Mistral Mistral Nemo mistral-nemo
Mistral Mistral Medium mistral-medium
Mistral Mistral Medium Latest mistral-medium-latest
Mistral Mistral Large 2407 mistral-large-2407
Mistral Codestral Mamba open-codestral-mamba
Mistral Codestral Mamba Latest codestral-mamba-latest
OpenAI Codex Mini codex-mini-latest
OpenAI GPT-4.5 (Preview) gpt-4.5-preview
OpenAI o1-mini o1-mini
OpenAI o1-preview (2024-09-12) o1-preview-2024-09-12
OpenAI GPT-4 Turbo Preview gpt-4-turbo-preview
OpenAI o1-preview o1-preview
OpenAI o1-mini (2024-09-12) o1-mini-2024-09-12
OpenAI GPT-4 (older v0314) gpt-4-0314
OpenAI GPT-4 32k (older v0314) gpt-4-32k-0314
OpenAI OpenAI: GPT-5 Image Mini gpt-5-image-mini
OpenAI OpenAI: GPT-5 Image gpt-5-image
OpenAI DALL·E 3 dall-e-3
OpenAI DALL·E 2 dall-e-2
Qwen Qwen2.5 72B Instruct qwen-2.5-72b-instruct
Qwen Qwen2.5 7B Instruct qwen-2.5-7b-instruct
xAI Grok 2 Vision 1212 grok-2-vision-1212
xAI Grok 2 1212 grok-2-1212
xAI Grok 2 grok-2
xAI Grok 2 Image 1212 grok-2-image-1212
xAI Grok 2 Image grok-2-image
xAI Grok 2 Image Latest grok-2-image-latest
Documentation last built on May 23, 2026