Gemini 3.1 Flash Lite
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
What is Gemini 3.1 Flash Lite?
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Specifications
- Developer: Google
- Context window: 1.0M tokens
- Max output: 65.5K tokens
- Input modalities: text, image, video, file, audio
- Output modalities: text
- Input price: $0.2500 per 1M tokens
- Output price: $1.50 per 1M tokens
- Knowledge cutoff: —
- Supported parameters: include_reasoning, max_tokens, reasoning, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_p
Use Gemini 3.1 Flash Lite with PromptLayer
PromptLayer lets teams manage, evaluate, and observe prompts that run on Gemini 3.1 Flash Lite alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.
Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.