Gemini 2.0 Flash Lite
Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...
What is Gemini 2.0 Flash Lite?
Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...
Specifications
- Developer: Google
- Context window: 1.0M tokens
- Max output: 8.2K tokens
- Input modalities: text, image, file, audio, video
- Output modalities: text
- Input price: $0.0750 per 1M tokens
- Output price: $0.3000 per 1M tokens
- Knowledge cutoff: 2024-08-31
- Supported parameters: max_tokens, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_p
Use Gemini 2.0 Flash Lite with PromptLayer
PromptLayer lets teams manage, evaluate, and observe prompts that run on Gemini 2.0 Flash Lite alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.
Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.