Gemini 3.1 Flash Lite

Google

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

What is Gemini 3.1 Flash Lite?

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Specifications

Developer: Google
Context window: 1.0M tokens
Max output: 65.5K tokens
Input modalities: text, image, video, file, audio
Output modalities: text
Input price: $0.2500 per 1M tokens
Output price: $1.50 per 1M tokens
Knowledge cutoff: —
Supported parameters: include_reasoning, max_tokens, reasoning, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_p

Use Gemini 3.1 Flash Lite with PromptLayer

PromptLayer lets teams manage, evaluate, and observe prompts that run on Gemini 3.1 Flash Lite alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.

Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.