Qwen3.5-Flash

Qwen3.5-Flash

Alibaba (Qwen)

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

What is Qwen3.5-Flash?

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Specifications

  • Developer: Alibaba (Qwen)
  • Context window: 1M tokens
  • Max output: 65.5K tokens
  • Input modalities: text, image, video
  • Output modalities: text
  • Input price: $0.0650 per 1M tokens
  • Output price: $0.2600 per 1M tokens
  • Knowledge cutoff:
  • Supported parameters: include_reasoning, max_tokens, presence_penalty, reasoning, response_format, seed, structured_outputs, temperature, tool_choice, tools, top_p

Use Qwen3.5-Flash with PromptLayer

PromptLayer lets teams manage, evaluate, and observe prompts that run on Qwen3.5-Flash alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.

Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026