Miavo
All models
Gemini
Google · audio

Gemini 3.1 Flash TTS

gemini-3.1-flash-tts

Controllable TTS across 70+ languages, 200+ inline emotion tags. Audio output tokens.

Pricing

Input$1.00 / MTok
Output$20.00 / MTok
Context
32k tokens
Provider
Gemini
Google

Capabilities

Chat completions

Multi-turn dialogue, system prompts.

Streaming

SSE chunks for incremental output.

Tool / function calling

Structured arguments via JSON schemas.

Image input

Pass images alongside the prompt.

Video generation

Text and image to short video, async.

JSON mode

Coming soon.

Make your first call.

Drop your sk-maas-… key in and you're done.

chat.ts
import OpenAI from 'openai';

const client = new OpenAI({
  baseURL: 'https://api.miavo.xyz/v1',
  apiKey: process.env.MACAW_API_KEY!,
});

const res = await client.chat.completions.create({
  model: 'gemini-3.1-flash-tts',
  messages: [
    { role: 'user', content: 'Write me a haiku about gateways.' },
  ],
});

console.log(res.choices[0].message.content);

More from Google