Ling-2.6-flash

Inclusion AI

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

What is Ling-2.6-flash?

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Specifications

Developer: Inclusion AI
Context window: 262.1K tokens
Max output: 32.8K tokens
Input modalities: text
Output modalities: text
Input price: $0.0800 per 1M tokens
Output price: $0.2400 per 1M tokens
Knowledge cutoff: —
Supported parameters: frequency_penalty, max_tokens, presence_penalty, repetition_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_k, top_p

Use Ling-2.6-flash with PromptLayer

PromptLayer lets teams manage, evaluate, and observe prompts that run on Ling-2.6-flash alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.

Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.