Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite

Google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

What is Gemini 2.5 Flash Lite?

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Specifications

  • Developer: Google
  • Context window: 1.0M tokens
  • Max output: 65.5K tokens
  • Input modalities: text, image, file, audio, video
  • Output modalities: text
  • Input price: $0.1000 per 1M tokens
  • Output price: $0.4000 per 1M tokens
  • Knowledge cutoff: 2025-01-31
  • Supported parameters: include_reasoning, max_tokens, reasoning, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_p

Use Gemini 2.5 Flash Lite with PromptLayer

PromptLayer lets teams manage, evaluate, and observe prompts that run on Gemini 2.5 Flash Lite alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.

Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026