Ling-2.6-flash
Inclusion AI
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
What is Ling-2.6-flash?
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Specifications
- Developer: Inclusion AI
- Context window: 262.1K tokens
- Max output: 32.8K tokens
- Input modalities: text
- Output modalities: text
- Input price: $0.0800 per 1M tokens
- Output price: $0.2400 per 1M tokens
- Knowledge cutoff: —
- Supported parameters: frequency_penalty, max_tokens, presence_penalty, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p
Use Ling-2.6-flash with PromptLayer
PromptLayer lets teams manage, evaluate, and observe prompts that run on Ling-2.6-flash alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.
Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.