Miavo
Models

93 models from 11 providers.

One baseURL and one sk-maas-… key reaches every model below — chat, tool calling, video.

93 of 93 models

Anthropic

Claude Haiku 4.5

claude-haiku-4-5

Anthropic’s fastest near-frontier model. 200k context, 64k output.

In / MTok
$1.00
Out / MTok
$5.00
Context
200k
StreamingToolsImage input
Anthropic

Claude Opus 4.6

claude-opus-4-6

Previous flagship — 1M context. Still served alongside 4.7.

In / MTok
$5.00
Out / MTok
$25.00
Context
1,000k
StreamingToolsImage input
Anthropic

Claude Opus 4.7

claude-opus-4-7

Top reasoning + agentic coding. 1M context, 128k output.

In / MTok
$5.00
Out / MTok
$25.00
Context
1,000k
StreamingToolsImage input
Anthropic

Claude Sonnet 4.6

claude-sonnet-4-6

Balanced flagship — best speed/intelligence ratio. 1M context.

In / MTok
$3.00
Out / MTok
$15.00
Context
1,000k
StreamingToolsImage input
Gemini

Gemini 2.5 Flash TTS

gemini-2.5-flash-tts

Cheaper prior-gen TTS — same controllability, narrower language set.

Per unit
$0.00
Provider
Gemini
Google
Async
No
Streaming
Gemini

Gemini 3 Flash

gemini-3-flash

Mid-tier multimodal. Image + video + text input. 1M context.

In / MTok
$0.50
Out / MTok
$3.00
Context
1,000k
StreamingToolsImage input
Gemini

Gemini 3.1 Flash-Lite

gemini-3.1-flash-lite

Cheapest 1M-context tier. Great for high-volume agents.

In / MTok
$0.25
Out / MTok
$1.50
Context
1,000k
StreamingToolsImage input
Gemini

Gemini 3.1 Flash Live

gemini-3.1-flash-live

Realtime conversational audio — bidirectional, sub-second latency. ~$0.005/min in, $0.018/min out.

Per unit
$0.00
Provider
Gemini
Google
Async
No
StreamingTools
Gemini

Gemini 3.1 Flash TTS

gemini-3.1-flash-tts

Controllable TTS across 70+ languages, 200+ inline emotion tags. Audio output tokens.

Per unit
$0.00
Provider
Gemini
Google
Async
No
Streaming
Gemini

Gemini 3.1 Pro

gemini-3.1-pro

Google’s flagship reasoning model + Computer Use. 1M context.

In / MTok
$2.00
Out / MTok
$12.00
Context
1,000k
StreamingToolsImage input
Gemini

Nano Banana

nano-banana

Original Nano Banana (Gemini 2.5 Flash Image). Fast, fun edits.

per image
$0.039
Provider
Gemini
Google
Async
No
Gemini

Nano Banana 2

nano-banana-2

Nano Banana 2 (Gemini 3.1 Flash Image) — Pro features at Flash speed. $0.067/1K, $0.10/2K, $0.15/4K.

per image
$0.067
Provider
Gemini
Google
Async
No
Gemini

Nano Banana Pro

nano-banana-pro

Nano Banana Pro (Gemini 3 Pro Image) — best fidelity, complex prompts, accurate text. $0.134/1K-2K, $0.24/4K.

per image
$0.134
Provider
Gemini
Google
Async
No
Gemini

Veo 3.1

veo-3.1

Cinematic text/image-to-video, 720p–1080p, optional audio.

per second
$0.40
Provider
Gemini
Google
Async
Yes
Gemini

Veo 3.1 Fast

veo-3.1-fast

Veo 3.1 Fast — $0.10/s @720p, $0.12/s @1080p, $0.30/s @4K.

per second
$0.10
Provider
Gemini
Google
Async
Yes
VertexAI

Chirp 3 (Vertex)

vertex-chirp-3

TTS with Instant Custom Voice (10s reference audio).

per 1M chars
$16.00
Provider
VertexAI
Google Vertex
Async
No
Streaming
VertexAI

Gemini 3.1 Pro (Vertex)

vertex-gemini-3.1-pro

Gemini 3.1 Pro via GCP Vertex — SLA, region pinning, audit logs.

In / MTok
$2.00
Out / MTok
$12.00
Context
1,000k
StreamingToolsImage input
VertexAI

Imagen 3 (Vertex)

vertex-imagen-3

Highest-quality Imagen text-to-image with inpainting + editing.

per image
$0.040
Provider
VertexAI
Google Vertex
Async
No
VertexAI

Lyria 3 Pro (Vertex)

vertex-lyria-3-pro

Music generation up to 184s. Public preview.

per second
$0.06
Provider
VertexAI
Google Vertex
Async
No
VertexAI

Veo 3.1 (Vertex)

vertex-veo-3.1

Veo 3.1 via GCP Vertex.

per second
$0.40
Provider
VertexAI
Google Vertex
Async
Yes
VertexAI

Veo 3.1 Fast (Vertex)

vertex-veo-3.1-fast

Veo 3.1 Fast on Vertex — enterprise routing, $0.10/s @720p.

per second
$0.10
Provider
VertexAI
Google Vertex
Async
Yes
OpenAI

GPT-4o Audio

gpt-4o-audio-preview

Text-or-audio in, text-or-audio out via /v1/chat/completions.

Per unit
$0.00
Provider
OpenAI
OpenAI
Async
No
StreamingTools
OpenAI

GPT-4o mini TTS

gpt-4o-mini-tts

Token-priced TTS — ~$0.015/min generated audio.

Per unit
$0.00
Provider
OpenAI
OpenAI
Async
No
StreamingTools
OpenAI

GPT-4o Transcribe

gpt-4o-transcribe

ASR with optional speaker diarization. Same price as Whisper.

per minute
$0.01
Provider
OpenAI
OpenAI
Async
No
Streaming
OpenAI

GPT-5.4

gpt-5.4

March 2026 release — sits between mini and 5.5. Strong cost/intel ratio.

In / MTok
$2.50
Out / MTok
$15.00
Context
1,000k
StreamingToolsImage input
OpenAI

GPT-5.4 mini

gpt-5.4-mini

GPT-5.4-class capability, fast + efficient. 400k context.

In / MTok
$0.75
Out / MTok
$4.50
Context
400k
StreamingToolsImage input
OpenAI

GPT-5.4 nano

gpt-5.4-nano

OpenAI’s smallest, cheapest model. High-volume simple tasks.

In / MTok
$0.20
Out / MTok
$1.25
Context
400k
StreamingToolsImage input
OpenAI

GPT-5.5

gpt-5.5

Flagship multimodal. 1M context. Computer use, MCP, hosted shell.

In / MTok
$5.00
Out / MTok
$30.00
Context
1,000k
StreamingToolsImage input
OpenAI

GPT-5.5 Pro

gpt-5.5-pro

Highest-tier flagship — research-grade reasoning. No cached-input discount.

In / MTok
$30.00
Out / MTok
$180.00
Context
1,000k
StreamingToolsImage input
OpenAI

GPT Image 2

gpt-image-2

OpenAI image gen — flexible sizes, high-fidelity image input.

per image
$0.040
Provider
OpenAI
OpenAI
Async
No
OpenAI

GPT Realtime

gpt-realtime

Bidirectional voice + text. ~$0.06/min audio in, $0.24/min audio out. Text in/out at $5/$20.

Per unit
$0.00
Provider
OpenAI
OpenAI
Async
No
StreamingTools
OpenAI

o4-mini

o4-mini

Cheap reasoning model — math, code, structured analysis at low cost.

In / MTok
$0.55
Out / MTok
$2.20
Context
200k
StreamingTools
OpenAI

TTS-1

tts-1

Standard TTS — same voices, lower fidelity, half the cost of HD.

per 1M chars
$15.00
Provider
OpenAI
OpenAI
Async
No
Streaming
OpenAI

TTS-1 HD

tts-1-hd

High-fidelity TTS — 6 preset voices. /v1/audio/speech endpoint.

per 1M chars
$30.00
Provider
OpenAI
OpenAI
Async
No
Streaming
OpenAI

Whisper (legacy)

whisper-1

Speech-to-text (ASR). Will be superseded by gpt-4o-transcribe.

per minute
$0.01
Provider
OpenAI
OpenAI
Async
No
Grok

Grok 4.1 Fast

grok-4.1-fast

Budget-friendly Grok — low TTFT, very cheap for high-volume tasks.

In / MTok
$0.20
Out / MTok
$0.50
Context
256k
StreamingTools
Grok

Grok 4.20

grok-4.20

Long-context variant — 2M context window for very large corpora.

In / MTok
$2.00
Out / MTok
$6.00
Context
2,000k
StreamingToolsImage input
Grok

Grok 4.3

grok-4.3

xAI’s May 2026 flagship — 1M context, multimodal.

In / MTok
$1.25
Out / MTok
$2.50
Context
1,000k
StreamingToolsImage input
Qwen

Qwen3 ASR Flash

qwen3-asr-flash

Speech-to-text — multilingual, low-latency.

per minute
$0.05
Provider
Qwen
Alibaba
Async
No
Streaming
Qwen

Qwen3 Max

qwen3-max

Flagship Qwen3 — Alibaba’s largest dense + MoE model.

In / MTok
$0.36
Out / MTok
$1.43
Context
256k
StreamingToolsImage input
Qwen

Qwen3 Omni Flash

qwen3-omni-flash

Omni-modal: text + image + audio in/out. Cheapest realtime-class tier.

Per unit
$0.00
Provider
Qwen
Alibaba
Async
No
StreamingToolsImage input
Qwen

Qwen3 VL Plus

qwen3-vl-plus

Vision-language flagship — strong OCR, document QA, GUI grounding.

In / MTok
$0.30
Out / MTok
$0.90
Context
128k
StreamingToolsImage input
Qwen

Qwen3.5

qwen3.5

Feb 2026 release — 397B params, 201 languages, 19× faster than prior gen.

In / MTok
$0.40
Out / MTok
$1.20
Context
256k
StreamingTools
Qwen

Qwen3.5 0.8B

qwen3.5-0.8b

Tiniest Qwen — extreme high-volume / on-device fallback.

In / MTok
$0.01
Out / MTok
$0.04
Context
32k
StreamingTools
Minimax

MiniMax Hailuo 2.3

minimax-hailuo-2.3

Latest Hailuo text-to-video. Billed in units; ~$0.04/s @768p.

per second
$0.04
Provider
Minimax
MiniMax
Async
Yes
Minimax

MiniMax Hailuo 2.3 Fast

minimax-hailuo-2.3-fast

Hailuo 2.3 Fast — 30% cheaper than 2.3, same family.

per second
$0.03
Provider
Minimax
MiniMax
Async
Yes
Minimax

MiniMax Image 01

minimax-image-01

Text-to-image, photoreal + stylized.

per image
$0.004
Provider
Minimax
MiniMax
Async
No
Minimax

MiniMax M2.1

minimax-m2.1

Older M-series — kept for reproducibility.

In / MTok
$0.30
Out / MTok
$1.20
Context
200k
StreamingTools
Minimax

MiniMax M2.5

minimax-m2.5

Prior-gen M-series — same base price, slightly weaker quality.

In / MTok
$0.30
Out / MTok
$1.20
Context
200k
StreamingTools
Minimax

MiniMax M2.7

minimax-m2.7

MiniMax’s newest self-iterating flagship for code + agents.

In / MTok
$0.30
Out / MTok
$1.20
Context
200k
StreamingTools
Minimax

MiniMax M2.7 Highspeed

minimax-m2.7-highspeed

M2.7 with priority routing — 2× cost for lower TTFT.

In / MTok
$0.60
Out / MTok
$2.40
Context
200k
StreamingTools
Minimax

MiniMax Music 1.5

minimax-music-1.5

Music generation from prompt — vocals + instrumentation.

per track
$0.10
Provider
Minimax
MiniMax
Async
No
Minimax

MiniMax Speech 02 HD

minimax-speech-02-hd

Prior-gen HD voice — kept for reproducibility of pipelines.

per 1M chars
$50.00
Provider
Minimax
MiniMax
Async
No
Streaming
Minimax

MiniMax Speech 2.5 Turbo

minimax-speech-2.5-turbo

HD TTS — 40 languages, accurate voice replication.

per 1M chars
$40.00
Provider
Minimax
MiniMax
Async
No
Streaming
Minimax

MiniMax Speech 2.6

minimax-speech-2.6

Latest TTS — Fluent LoRA voice cloning, prosodic naturalness across 40+ languages.

per 1M chars
$30.00
Provider
Minimax
MiniMax
Async
No
Streaming
Kimi

Kimi K2 0711

kimi-k2-0711-preview

July 2025 K2 preview snapshot — scheduled for discontinuation 2026-05-25. (Not currently listed on OpenRouter — keeping prior estimate.)

In / MTok
$0.60
Out / MTok
$2.50
Context
128k
StreamingTools
Kimi

Kimi K2 0905

kimi-k2-0905-preview

September 2025 K2 preview snapshot — scheduled for discontinuation 2026-05-25.

In / MTok
$0.60
Out / MTok
$2.50
Context
256k
StreamingTools
Kimi

Kimi K2 Thinking

kimi-k2-thinking

Explicit thinking-mode K2 — chain-of-thought reasoning for math + code.

In / MTok
$0.60
Out / MTok
$2.50
Context
256k
StreamingTools
Kimi

Kimi K2 Turbo

kimi-k2-turbo

Moonshot’s agentic K2 with priority routing — fast TTFT, 128k context.

In / MTok
$0.60
Out / MTok
$2.50
Context
128k
StreamingToolsImage input
Kimi

Kimi K2.5

kimi-k2.5

January 2026 multimodal release — cheaper than K2.6 with similar capability.

In / MTok
$0.40
Out / MTok
$1.90
Context
256k
StreamingToolsImage input
Kimi

Kimi K2.6

kimi-k2.6

Moonshot’s April 2026 flagship — multimodal, 256k context.

In / MTok
$0.73
Out / MTok
$3.49
Context
256k
StreamingToolsImage input
Kimi

Kimi Latest

kimi-latest

Auto-routes to Moonshot’s current default (K2.6 as of 2026-05). Multimodal.

In / MTok
$0.73
Out / MTok
$3.49
Context
256k
StreamingToolsImage input
Kimi

Moonshot v1 128k

moonshot-v1-128k

Legacy text-only Moonshot v1 long-context. (Not on OpenRouter — keeping prior flat pricing.)

In / MTok
$0.83
Out / MTok
$0.83
Context
128k
StreamingTools
DeepSeek

DeepSeek Chat (legacy)

deepseek-chat

Legacy alias → V4 Flash non-thinking mode. Sunsets 2026-07-24; migrate to deepseek-v4-flash.

In / MTok
$0.32
Out / MTok
$0.89
Context
163.84k
StreamingTools
DeepSeek

DeepSeek Reasoner (legacy)

deepseek-reasoner

Legacy alias → V4 Flash thinking mode. Sunsets 2026-07-24; migrate to deepseek-v4-flash with reasoning. (Not separately listed on OpenRouter — mirrors v4-flash.)

In / MTok
$0.11
Out / MTok
$0.22
Context
1,000k
StreamingTools
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

V4 default — 1M context, 384k max output.

In / MTok
$0.11
Out / MTok
$0.22
Context
1,000k
StreamingTools
DeepSeek

DeepSeek V4 Pro

deepseek-v4-pro

V4 flagship reasoning — 1M context. OR pricing reflects 75%-off promo through 2026-05-31.

In / MTok
$0.43
Out / MTok
$0.87
Context
1,000k
StreamingTools
ChatGLM

CogVideoX

cogvideox

Zhipu’s text/image-to-video — 6s clips at 720p–1080p.

per second
$0.20
Provider
ChatGLM
Zhipu GLM
Async
Yes
ChatGLM

CogVideoX Flash

cogvideox-flash

Free video generation tier.

per second (free)
$0.00
Provider
ChatGLM
Zhipu GLM
Async
Yes
ChatGLM

CogView 3 Flash

cogview-3-flash

Free image generation tier — generous rate limits.

per image (free)
$0.000
Provider
ChatGLM
Zhipu GLM
Async
No
ChatGLM

CogView 4

cogview-4

Latest text-to-image — strong on Chinese text rendering + complex prompts.

per image
$0.050
Provider
ChatGLM
Zhipu GLM
Async
No
ChatGLM

GLM 4 AirX

glm-4-airx

Low-latency AirX variant — fastest GLM tier, smaller context.

In / MTok
$1.40
Out / MTok
$1.40
Context
8k
StreamingTools
ChatGLM

GLM 4 Flash

glm-4-flash

Free tier — high-volume, simple tasks. Generous rate limits.

In / MTok
$0.00
Out / MTok
$0.00
Context
128k
StreamingTools
ChatGLM

GLM 4.6

glm-4.6

GLM 4 family flagship — 200k context, agentic + coding focused.

In / MTok
$0.43
Out / MTok
$1.74
Context
200k
StreamingTools
ChatGLM

GLM 4.6V

glm-4.6v

GLM 4.6 vision variant — multimodal input on the 4-family flagship.

In / MTok
$0.30
Out / MTok
$0.90
Context
128k
StreamingToolsImage input
ChatGLM

GLM 4.7

glm-4.7

Previous-gen flagship before GLM 5 — solid coding + reasoning at lower cost.

In / MTok
$0.40
Out / MTok
$1.75
Context
200k
StreamingTools
ChatGLM

GLM 4.7 Flash

glm-4.7-flash

Lowest tier — high-volume simple tasks. Generous rate limits.

In / MTok
$0.06
Out / MTok
$0.40
Context
200k
StreamingTools
ChatGLM

GLM 5

glm-5

GLM 5 base flagship — released 2026-02-12, available on Pro and Max coding tiers.

In / MTok
$0.60
Out / MTok
$1.92
Context
200k
StreamingTools
ChatGLM

GLM 5 Turbo

glm-5-turbo

March 2026 turbo variant — priority routing on the GLM 5 family.

In / MTok
$1.20
Out / MTok
$4.00
Context
200k
StreamingTools
ChatGLM

GLM 5.1

glm-5.1

Zhipu’s April 2026 flagship — SOTA on SWE-Bench Pro, 200k context, 128k max output.

In / MTok
$0.98
Out / MTok
$3.08
Context
200k
StreamingTools
ChatGLM

GLM 5V Turbo

glm-5v-turbo

GLM 5 family vision-capable turbo — multimodal with priority routing.

In / MTok
$1.20
Out / MTok
$4.00
Context
200k
StreamingToolsImage input
ChatGLM

GLM Realtime

glm-realtime

End-to-end voice + video understanding with singing + 2-min memory. Function calls supported.

Per unit
$0.00
Provider
ChatGLM
Zhipu GLM
Async
No
StreamingToolsImage input
ChatGLM

GLM TTS

glm-tts

Controllable + emotion-expressive zero-shot voice cloning. Open-sourced Dec 2025.

per 1M chars
$20.00
Provider
ChatGLM
Zhipu GLM
Async
No
Streaming
Bedrock

Claude Haiku 3.5 (AWS Bedrock US CR)

bedrock-claude-haiku-3-5-us-cr

Claude Haiku 3.5 through AWS Bedrock US cross-region profile. Includes the documented 10% premium.

In / MTok
$0.88
Out / MTok
$4.40
Context
200k
StreamingToolsImage input
Bedrock

Claude Haiku 4.5 (AWS Bedrock)

bedrock-claude-haiku-4-5

Claude Haiku 4.5 through AWS Bedrock Global application inference profile.

In / MTok
$1.00
Out / MTok
$5.00
Context
200k
StreamingToolsImage input
Bedrock

Claude Opus 4.1 (AWS Bedrock US CR)

bedrock-claude-opus-4-1-us-cr

Claude Opus 4.1 through AWS Bedrock US cross-region profile. Includes the documented 10% premium.

In / MTok
$5.50
Out / MTok
$27.50
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Opus 4.5 (AWS Bedrock)

bedrock-claude-opus-4-5

Claude Opus 4.5 through AWS Bedrock Global application inference profile.

In / MTok
$5.00
Out / MTok
$25.00
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Opus 4.6 (AWS Bedrock)

bedrock-claude-opus-4-6

Claude Opus 4.6 through AWS Bedrock Global application inference profile.

In / MTok
$5.00
Out / MTok
$25.00
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Opus 4.7 (AWS Bedrock)

bedrock-claude-opus-4-7

Claude Opus 4.7 through AWS Bedrock Global application inference profile.

In / MTok
$5.00
Out / MTok
$25.00
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Opus 4 (AWS Bedrock US CR)

bedrock-claude-opus-4-us-cr

Claude Opus 4 through AWS Bedrock US cross-region profile. Includes the documented 10% premium.

In / MTok
$5.50
Out / MTok
$27.50
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Sonnet 4 (AWS Bedrock)

bedrock-claude-sonnet-4

Claude Sonnet 4 through AWS Bedrock Global application inference profile.

In / MTok
$3.00
Out / MTok
$15.00
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Sonnet 4.5 (AWS Bedrock)

bedrock-claude-sonnet-4-5

Claude Sonnet 4.5 through AWS Bedrock Global application inference profile.

In / MTok
$3.00
Out / MTok
$15.00
Context
1,000k
StreamingToolsImage input
Bedrock

Claude Sonnet 4.6 (AWS Bedrock)

bedrock-claude-sonnet-4-6

Claude Sonnet 4.6 through AWS Bedrock Global application inference profile.

In / MTok
$3.00
Out / MTok
$15.00
Context
1,000k
StreamingToolsImage input